Reduces a HTML document (or fragment) to basic HTML tags and attributes – clean HTML
Also, converts a Word document (.docx) to a clean HTML