HTML parsing and cleaning

IN