Job Facts

Date Posted
April 22, 2009
Start Date
April 22, 2009
Type
Hourly
Main Category
Software Development
Sub Category
Scripts & Utilities
Skills
Python, Regular Expressions, Parsing semi-structured text
Estimated Workload
As needed - Less than 10 hrs/week
Estimated Duration
Less than 1 week
Last Buyer Activity
April 26, 2009
Last Date Worked
April 29, 2009
Offline Hours
0%

Buyer Facts

Score
(4.96 of 5)
Member Since
August 6, 2008
Country
United States (GMT-05)
City
Cambridge
Jobs Posted
191
Jobs Filled
186
Jobs Not Yet Filled
1
Current Team size
8
Hours billed, last 30 days
14
Total oDesk Hours
272

Job Description:

I have a large text document containing biographical sketches for over 50,000 individuals (small sample attached).  I want a Python script that goes through this document and identifies all the individuals. For a first iteration, the program can just put the biographical sketch for each person in a string.  Once I validate the the script is capturing everyone, I'd like to parse the biographical string and initiate a "person" class with attributes depending upon what is in the string. ...

More info about this job…