I have a regular html document that encodes certain characters. For
example a quote " is represented as " and a greater sing > is >
and so on.
I am looking for a java API to automatically decode this back. This
seems like something that should be outthere but so far I could not
findd it anywhere. I would appreaciate if you could advise me on where
to find an HTML decoder to do that. Thanks
Martin - 20 Jan 2006 19:22 GMT
Did you look at java.util.regex.Pattern and java.util.regex.Matcher?
Roedy Green - 20 Jan 2006 21:27 GMT
>I am looking for a java API to automatically decode this back
see http://mindprod.com/products1.html#ENTITIES
it will take entities out or put them back. It will also strip tags
leaving your raw text. Putting them back is more problematic. :-)

Signature
Canadian Mind Products, Roedy Green.
http://mindprod.com Java custom programming, consulting and coaching.