Subject: | How can I remove ms Word crud from HTML, but still keep the bolds? |
Posted by: | ship (shiph…@gmail.com) |
Date: | 12 Apr 2007 |
DW8 / ms Word2003 / Windows XP Pro
Hi
Can anyone tell me how to remove all the msWord (2003) formatting
from HTML?
I have tried the "Clean up Word HTML" utility in DW8 but it is pretty
useless.
Basically I need to strip out all marking up apart for bolds (and
possibly bullets)
....The problem is that I have pages and pages to do. And they are all
now pasted into a database so mass string changes is impossible.
e.g.
I need to remove lines like
<li class="MsoNormal"><span style="font-size: 10pt; font-family:
arial; ">
either completely (!) or [ideally] to just leave the
<li>
tag behind.
i.e. I want to remove all classes, and all fonts, all spans but leave
<b> and </b>
- Any thoughts?
Ship
Shiperton Henethe
P.S. I'm sorry, I know this topic must have been discussed innumerable
times, but I couldnt find any *recent* solutions!