Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsWhite Papers
Discussion GroupsFirst AidDatabasesJavaBeansGUIJava 3DVirtual MachineCORBASecurityToolsGeneral
Java DirectoryOpen Source ProjectsSample Book ChaptersUser GroupsWeb Resources
Related Topics
Databases.NETMore Topics ...

Java Forum / General / March 2008

Tip: Looking for answers? Try searching our database.

Want to implement a specific Website search engine

Thread view: 
Alok Guha [ India ] - 11 Mar 2008 09:18 GMT
hi all,
        I got assignment to make a search engine specifin to a
website. In this way user will only enter search string and url of
website, and my application will be expected as to return result as
google [link to pages to that site].
       Please help me out.. I am really puzzeled.. Thanks in advance.

Thanks and Regards
Alok S. Guha
Andrew Thompson - 11 Mar 2008 09:47 GMT
> hi all,
>          I got assignment to make a search engine specifin to a
> website.

Give it (the assignment) back to whoever you
got it from. *

>..In this way user will only enter search string and url of
> website, and my application will be expected as to return result as
> google [link to pages to that site].

Google offers such a service already.

>         Please help me out.. I am really puzzeled.. Thanks in advance.

I suggest you hire someone.

* Anybody with a remote chance of being able to
implement a spider, organise and collate the
information (in any slightly efficient form)
and deliver that information back to users
based upon search strings, would not be 'really
puzzeled' by such a task.

--
Andrew T.
PhySci.org
GArlington - 11 Mar 2008 10:52 GMT
> hi all,
>          I got assignment to make a search engine specifin to a
> website. In this way user will only enter search string and url of
> website, and my application will be expected as to return result as
> google [link to pages to that site].
>         Please help me out.. I am really puzzeled.. Thanks in advance.

I am puzzled by what you want?

> Thanks and Regards
> Alok S. Guha

Are you expected to process random string and random (NOT predefined)
URL?
Are you going to implement real-time search engine? Then good luck -
some sites span thousands of pages and NO site directories, I hope you
will have fun following ALL links in ALL pages...
Maybe you should hire somebody to do that for you as Andrew
suggested...
Alok Guha [ India ] - 12 Mar 2008 05:31 GMT
> > hi all,
> >          I got assignment to make a search engine specifin to a
[quoted text clipped - 15 lines]
> Maybe you should hire somebody to do that for you as Andrew
> suggested...

As i specified .. i want to perform search in a specific site like...
ther is one site.. www.mysite.com
so my system will be able to search in every public page of this site.

can anyone having idea about "how to generate sitemap of any site in
Java ?"
Please let me know as soon as possible..

Thanks in advance...

-Alok
Andrew Thompson - 12 Mar 2008 07:25 GMT
...
> .. i want to perform search in a specific site like...
> ther is one site..www.mysite.com

<html>
<head>
<title>KISS</title>
</head>
<body>

<form method='GET' action="http://www.google.com/search">
<input type='text' value="" name='as_q'>
<input type='hidden' value="www.mysite.com" name='as_sitesearch'>
<input type='submit' value="MySite Search">
</form>

</body>
</html>

> so my system will be able to search in every public page of this site.
>
> can anyone having idea about "how to generate sitemap of any site in
> Java ?"

Visit Indonesia, catch a ferry to the island
of Java - then do what I did above.

If you want to use the Java programming language
for this task, and actually have *control* of the
site, JavaHelp might be a good way to index the
pages - though in the 'KISS' principle, I would
still probably just use the type of form shown
above, and simply ask Google to use the site's
own stylesheets to give the search results the
same 'look' as my site.  (I have never tried
it, but Google did offer that at one stage.)

> Please let me know as soon as possible..

Oh it's urgent is it?  What's your budget?

BTW - please don't go splitting off new threads
simply becasue an existing thread is not going
as well as you'd hoped.

--
Andrew T.
PhySci.org
Roedy Green - 12 Mar 2008 10:50 GMT
On Tue, 11 Mar 2008 01:18:31 -0700 (PDT), "Alok Guha [ India ]"
<aloksguha@gmail.com> wrote, quoted or indirectly quoted someone who
said :

>         I got assignment to make a search engine specifin to a
>website. In this way user will only enter search string and url of
>website, and my application will be expected as to return result as
>google [link to pages to that site].
>        Please help me out.. I am really puzzeled.. Thanks in advance.

You could attack it with a form that passes to content of a field as a
parm in the HTTP get.

See http://mindprod.com/jgloss/jgloss.html to give you a hint.

You could write a program that generates a get that would be just like
what a browser would send.  You compose it dynamically.

See http://mindprod.com/jgloss/http.html

your code might be similar to
http://mindprod.com/applet/submitter.html

to snoop on what a browser and engine say to each other, see
http://mindprod.com/jgloss/sniffer.html

See http://mindprod.com/jgloss/htmlcheat.html#FORMS
to learn how forms work in HTML and what sorts of message they
generate.
--

Roedy Green Canadian Mind Products
The Java Glossary
http://mindprod.com


Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.