Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsWhite Papers
Discussion GroupsFirst AidDatabasesJavaBeansGUIJava 3DVirtual MachineCORBASecurityToolsGeneral
Java DirectoryOpen Source ProjectsSample Book ChaptersUser GroupsWeb Resources
Related Topics
Databases.NETMore Topics ...

Java Forum / General / January 2006

Tip: Looking for answers? Try searching our database.

decoding html in java

Thread view: 
IgorD - 20 Jan 2006 15:16 GMT
I have a regular html document that encodes certain characters. For
example a quote " is represented as " and a greater sing > is >
and so on.

I am looking for a java API to automatically decode this back. This
seems like something that should be outthere but so far I could not
findd it anywhere. I would appreaciate if you could advise me on where
to find an HTML decoder to do that. Thanks
Martin - 20 Jan 2006 19:22 GMT
Did you look at java.util.regex.Pattern and java.util.regex.Matcher?
Roedy Green - 20 Jan 2006 21:27 GMT
>I am looking for a java API to automatically decode this back

see http://mindprod.com/products1.html#ENTITIES

it will take entities out or put them back. It will also strip tags
leaving your raw text. Putting them back  is more problematic. :-)
Signature

Canadian Mind Products, Roedy Green.
http://mindprod.com Java custom programming, consulting and coaching.



Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2009 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.