Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsWhite Papers
Discussion GroupsFirst AidDatabasesJavaBeansGUIJava 3DVirtual MachineCORBASecurityToolsGeneral
Java DirectoryOpen Source ProjectsSample Book ChaptersUser GroupsWeb Resources
Related Topics
Databases.NETMore Topics ...

Java Forum / General / August 2006

Tip: Looking for answers? Try searching our database.

doing some ocr stuff with java

Thread view: 
mambenanje@gmail.com - 22 Aug 2006 10:54 GMT
I need help on a little practise project I am undertaking right now. I
have a couple of question papers with diagrams and text and I wish to
scan the papers to either pdf format or jpeg png format. my main
problem is to read the file using java and extract the images and
questions. can some one help me or give me a clue on this
Ingo R. Homann - 22 Aug 2006 11:01 GMT
Hi,

> I need help on a little practise project...

"OCR" is nothing you will solve in "a little practise project". Forget
it. (Use an existing OCR-Software instead or type it into your computer
by yourself). There is nothing to say more about that...

Ciao,
Ingo
Brandon McCombs - 22 Aug 2006 23:26 GMT
> I need help on a little practise project I am undertaking right now. I
> have a couple of question papers with diagrams and text and I wish to
> scan the papers to either pdf format or jpeg png format. my main
> problem is to read the file using java and extract the images and
> questions. can some one help me or give me a clue on this

As Ingo said, OCR isn't something you can just throw together. It is
very sensitive and your results can vary (especially when dealing with
OCR software for faxes, which is where I've dealt with it, but that's
another issue). Do you need to search the text that is on your diagrams
or be able to copy/paste it into another application? If not, then you
don't even need OCR but just something that can read the existing file
format and convert to your pdf/jpg/png format.

OCR actually reads the text in your source and separates it from the
image version of the text so you can manipulate it as text but if you
don't actually have a need to manipulate it like that then you don't
need OCR.
mambenanje@gmail.com - 27 Aug 2006 12:36 GMT
ok thanks for the help,
this is what I want to do
1) scan the question paper to any file format
2) get the text and pictures found on the file analyse them with no
human copying and pasting
3) send the information into a database

this will help me work with several question papers and when papers
come in future I only have to scan then pass it thru the application. I
cannot use another OCR tool for this, well maybe I cant cos I dont know


Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.