Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsWhite Papers
Discussion GroupsFirst AidDatabasesJavaBeansGUIJava 3DVirtual MachineCORBASecurityToolsGeneral
Java DirectoryOpen Source ProjectsSample Book ChaptersUser GroupsWeb Resources
Related Topics
Databases.NETMore Topics ...

Java Forum / General / February 2007

Tip: Looking for answers? Try searching our database.

search source code project

Thread view: 
cmk128@hotmail.com - 15 Feb 2007 09:14 GMT
Hi
   Here is my project:

1) Apache + tomcat + mysql (have to use JSP, no php)
2) 3 servers : Linux FC6 , 2GB ram , 2.8Ghz
3) Goal : to build a website to let people search the source code from
over 1 million files.

It is unable to use mysql to build the index from the source code
because people can search "int x=0" rather than english word, so all
the indexing server doesn't word. I guess every time the user submit a
search, i use grep command to search all the 1 million files. But it
is too slow, how can i improve it?

thanks
from Peter (cmk128@hotmail.com)
cmk128@hotmail.com - 15 Feb 2007 09:41 GMT
Forgot to say, in my xeon 3Ghz server, i need 10 seconds to grep 20000
files. If i need to grep 1 million files, then i need at least 500
seconds, it is too slow.
thanks
from Peter
dnass - 15 Feb 2007 13:33 GMT
On 15 fév, 10:41, cmk...@hotmail.com wrote:
> Forgot to say, in my xeon 3Ghz server, i need 10 seconds to grep 20000
> files. If i need to grep 1 million files, then i need at least 500
> seconds, it is too slow.
> thanks
> from Peter

Hi,

Have you tried Lucene http://lucene.apache.org/java/docs/

you are welcome
Dnas
Chris Uppal - 15 Feb 2007 16:20 GMT
> 3) Goal : to build a website to let people search the source code from
> over 1 million files.

Have you looked at OpenGrok ?

(It uses Lucene as its text search engine internally, but adds stuff for
specialising to searching and displaying code.)

   -- chris


Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.