I have been having trouble parsing this string.
<UserValue value="John O"Hare 
 101 N. Main St"
title="ConsumedText"></UserValue>
The " is being parsed correctly, but the 
 is being converted
into a space instead of a linefeed/newline. The newline is crutial to
formatting in the application.
I have been searching for a couple days now and tried many different
methods and have not been able to get the parser to stop converting
this character set into a single space. I believe the issue is
occurring because it is an attribute value and not text in the node.
Please do not critique the format of the xml, as I know it is
undesireable. I am just trying to implement a short term solution.
I should also mention that searching the xml file before parsing is
not feasible.
Tech being used: JRE 1.4.2.10, Xerces DOMParser
Thanks in advance for any advice.
--Clowtown
Pavel Lepin - 08 Jun 2007 14:45 GMT
clowtown <clowtown@gmail.com> wrote in
<1181309270.407258.274280@q69g2000hsb.googlegroups.com>:
> I have been having trouble parsing this string.
>
[quoted text clipped - 5 lines]
> into a space instead of a linefeed/newline. The newline
> is crutial to formatting in the application.
You might want to read W3C's XML 1.0 4E, 3.3.3
(Attribute-Value Normalization).
Follow-ups set to comp.text.xml, this has very little to do
with Java, and everything to do with XML specs.

Signature
Pavel Lepin