Office HTML Cleansing Revisited

Another file full of villainous Word-generated HTML crossed the geekroom desk today. It had footnotes, which worked fine in Firebird but not at all in Internet Explorer. In that browser, the first footnote number looked like:

[1]

That’s ugly!

Following John’s suggestion from last time this issue came up, I installed Mirosoft’s Office HTML Filter to remove all that weird Office-specific markup, and ran it on the file. The results were very agreeable, and the file now renders correctly in IE as well as Firebird.

1 Comment »

  1. toke

    February 14, 2004 @ 5:24 am

    1

    nice

Leave a Comment

Log in
Protected by AkismetBlog with WordPress