Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsWhite Papers
Discussion GroupsFirst AidDatabasesJavaBeansGUIJava 3DVirtual MachineCORBASecurityToolsGeneral
Java DirectoryOpen Source ProjectsSample Book ChaptersUser GroupsWeb Resources
Related Topics
Databases.NETMore Topics ...

Java Forum / General / April 2006

Tip: Looking for answers? Try searching our database.

Parsing HTML files

Thread view: 
Tom - 10 Apr 2006 10:57 GMT
Hi all,

Currently i would like to remove all whitespaces between html tags and
comments in a html file.   Is there any good html parser api i could
use to achieve that?

Appreciate any inputs.
Tajonis - 10 Apr 2006 14:28 GMT
If you are looking to simply clean up the HTML itself then have a look
@ http://jtidy.sourceforge.net/

Other wise if you really want to parse out HTML elements then you can
look at http://java-source.net/open-source/html-parsers
Roedy Green - 10 Apr 2006 17:59 GMT
>Currently i would like to remove all whitespaces between html tags and
>comments in a html file.   Is there any good html parser api i could
>use to achieve that?

http://mindprod.com/products1.html#COMPACTOR will take out excess
white space.

http://mindprod.com/products1.html#ENTITIES will take out tags and
convert entities.
Signature

Canadian Mind Products, Roedy Green.
http://mindprod.com Java custom programming, consulting and coaching.



Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.