Hi Everyone,
I'm looking to make a piece of software that basically takes in large
XML files and stores this information in the database.
The tricks are that the XML will have similar qualities, but different
repeating fields, and I have to be able to store each XML as I get it
programmatically.
Basically I want a system that can...
(1) Read Large XML (SAX Parser)
(2) Take any XML and automatically determine a schema to store it.
(3) Possible automatically create the tables.
(4) Preferrably JAVA based.
I have found:
http://www.skyhawksystems.com (Unresponsive sales team after 3-weeks)
http://www.rpbourret.com/xmldbms/index.htm (Very Nice, but uses DOM,
ie. only can process small XML files)
Thanks,
Jeff
Example XML.
XML 1
------------
<xml>
<customer>
<customer_name>Elvis Presley</customer_name>
<residences>
<residence>
<address>1234 Graceland</address>
<zip>98521</zip>
<residence>
</residences>
</customer>
...
XML 2
----------
<xml>
<customer>
<customer_name>Elvis Presley</customer_name>
<account_number>000999</account_number
<age></age>
</customer>
Daud Lee Lambert - 13 Apr 2006 15:22 GMT
> Hi Everyone,
>
[quoted text clipped - 10 lines]
> (3) Possible automatically create the tables.
> (4) Preferrably JAVA based.
There is a student at Wayne State University who has developed a system to
do something like that; he recently gave a presentation to the departent
about XML processing in general, and here are links to the abstract and
slideshow he presented:
http://www.cs.wayne.edu/courses/gradsem_w06/abstracts/Mustafa%20Atay.doc
http://www.cs.wayne.edu/courses/gradsem_w06/slides/Atay_XML2REL_SeminarWSU.ppt
Another student asked him whether his software would be able to read in a
10-GB bioinformatics database, and he replied that he believed it would,
but that he had not tried it; the largest file he had processed with this
algorithm was about 100-MB in size.
jeffrey.h.hill@gmail.com - 18 Apr 2006 16:56 GMT
Thanks for the post. I've contacted Mustafa. I'm sure others could
find this, but just to speed up anyone else's search, you can find out
about all of the different components being worked on at his site.
(The Schema Mapping and Data Mapping components are of interest to me)
http://www.cs.wayne.edu/~matay/
Jeff