Someone to turn raw unstructured data, into nice structured fields.
Our data is captured in raw HTML. We need to parse that data into fields and scrub for data consistency. This step is often done in a scraper, however we dont need to scrape, we already have the data. We just need it parsed.
Attention to detail is important. Sampling hundreds of fields to make sure the parser is creating the correct data is important.
Choice of script language is up to you. Ruby, Python etc.
* Choice of language
* Experience with Data Mining.
* Relevant Projects
Please reply with the words "Structured Data Mining" in the first line of your email