Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsWhite Papers
Discussion GroupsFirst AidDatabasesJavaBeansGUIJava 3DVirtual MachineCORBASecurityToolsGeneral
Java DirectoryOpen Source ProjectsSample Book ChaptersUser GroupsWeb Resources
Related Topics
Databases.NETMore Topics ...

Java Forum / General / December 2006

Tip: Looking for answers? Try searching our database.

Ignoring tags when extracting data from xhtml

Thread view: 
Damo_Suzuki - 08 Dec 2006 00:03 GMT
hi again,
I'm traversing an org.w3c.dom.Document to extract data.Say I'm going
through the following line:

<h2 class=r>
<a class=l href="http://www.java.com/" onmousedown="return
clk(this.href,'','','res','1','')">
<b>java</b>.com: Hot Games, Cool Apps</a></h2>

I look for the h2 node. I want it to print out, just, "java.com:Hot
Games, Cool Apps". At the moment it doesnt print anything. I thik its
because of the <b></b> tags in the middle. Is there anyway I can ignore
tags after I find the h2 tag
thanks
Damo_Suzuki - 08 Dec 2006 01:37 GMT
hi,
I just noticed JTidy has a method getDropFontTags() (oddly named!!)
,but has no documentation of how to use it. If you call it from a new
instance of a tidy object , how does it know what file to remove the
tags from? Has anyone ever used this method and if so could you show me
how?
Thanks


Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.