Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsWhite Papers
Discussion GroupsFirst AidDatabasesJavaBeansGUIJava 3DVirtual MachineCORBASecurityToolsGeneral
Java DirectoryOpen Source ProjectsSample Book ChaptersUser GroupsWeb Resources
Related Topics
Databases.NETMore Topics ...

Java Forum / General / November 2006

Tip: Looking for answers? Try searching our database.

Design of a pipelined architecture/framework for handling large data sets

Thread view: 
nish - 30 Nov 2006 19:08 GMT
I am facing an inconvenience which I believe should have been faced
before by other Java developers but I am finding it difficult to
articulate it in keywords so that google will give me the right
answers..so here goes

1. I am using eclipse ide with mulitple java projects, each one sourced
from a CVS repository on an external server in the local LAN.
2. Almost all of these projects basically handle big data sets (read
100mbs - 500mbs of xml and text files) which is basically data crawled
from the web, act and transform it in some way and then pass it along
for other projects to act on it. Some of hte data is in single big
files and some of it is in 100's of small files inside a single
directory.

Basically what I am looking for is a better way to handle this data.
Currently if I put the data in CVS then it is not that efficient , plus
there needs to be some central lookup for all the data.I guess this is
partly a java design question and partly ignorance on my part to use
the right tools to do this job.

Thanks for any help.
nish - 30 Nov 2006 19:19 GMT
Other issues I could think about:

3. I should be able to specify how a data set is being archived. So for
example for some large data I dont want it to be revisioned in CVS
because it is not going to change, for others i might want it to be
checked into cvs so that it gets revisioned

> I am facing an inconvenience which I believe should have been faced
> before by other Java developers but I am finding it difficult to
[quoted text clipped - 17 lines]
>
> Thanks for any help.


Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.