How can I remove ms Word crud from HTML, but still keep the bolds?

Giganews Newsgroups
Subject: How can I remove ms Word crud from HTML, but still keep the bolds?
Posted by:  ship (shiph…@gmail.com)
Date: 12 Apr 2007

DW8 / ms Word2003 / Windows XP Pro

Hi

Can anyone tell me how to remove all the msWord (2003)  formatting
from HTML?
I have tried the "Clean up Word HTML" utility in DW8 but it is pretty
useless.

Basically I need to strip out all marking up apart for bolds (and
possibly bullets)

....The problem is that I have pages and pages to do. And they are all
now pasted into a database so mass string changes is impossible.

e.g.
I need to remove lines like
  <li class="MsoNormal"><span style="font-size: 10pt; font-family:
arial; ">
either completely (!) or [ideally] to just leave the
  <li>
tag behind.

i.e. I want to remove all classes, and all fonts, all spans but leave
<b>  and </b>

- Any thoughts?

Ship
Shiperton Henethe

P.S. I'm sorry, I know this topic must have been discussed innumerable
times, but I couldnt find any *recent* solutions!

Replies