text parser for Multiple Choice Questions & Answers -- find Question, Build array of answers

text parser for Multiple Choice Questions & Answers -- find Question, Build array of answers

Closed - This job posting has been filled and work has been completed.

Job Description

so you will have text like this:

https://www.google.com/search?q=multiple+choice+questions&hl=en&tbo=d&rlz=1C5CHFA_enUS503US503&source=lnms&tbm=isch&sa=X&ei=zxDAUPr8CrK40AG3uICICA&ved=0CAcQ_AUoAA&biw=1349&bih=662


it will be converted to text already via OCR.

you need to determine which text is the question, and then build an array of all the possible answers.

if you can do the OCR part, I will pay extra. I plan to install an open source OCR tool on my Ubuntu server: https://help.ubuntu.com/community/OCR ...so if u can do the whole thing, I'll pay $50-100 extra.

Any programming language can be used, but Javascript is actually preferred since we're using node.js. OR, i guess a compiled C or C++ or Java app may be preferred that we can call from the command line from any scripting language. That way it's very fast.

ps. i hear good things about this OCR engine:
http://code.google.com/p/tesseract-ocr/