HTML cleanup tool

Reduces a HTML document (or fragment) to basic HTML tags and attributes – clean HTML

Also, converts a Word document (.docx) to a clean HTML

Paste HTML to clean it up

What exactly does it do?

  • Fixes or removes non-well formed tags and attributes (e.g. adds alt attributes to images if missing)
  • Converts the markup to HTML5 (if it is XHTML for example)
  • Reduces the markup to: <a href>, <body>, <h1>, <h2>, <h3>, <h4>, <h5>, <h6>, <head>, <hr>, <html>, <i>, <img src width height alt>, <li>, <ol>, <p>, <ruby>, <strong>, <table>, <tbody>, <td colspan rowspan>, <th colspan rowspan>, <title>, <tr>, <ul>
  • Replaces: <b> to <strong>, <div> to <p>
  • Reformats the HTML (line breaks, indents)

Example

Input: <p class="funny" onlick="alert('LOL')">bla bla</p>
Will be simplified to: <p>bla bla</p>