Creating spiders/scrappers for certain web sites
Closed - This job posting has been filled and work has been completed.
We need a PHP programmer for writing parsers (spiders) for relatively simple websites which contain the lists of structured data (for instance, the lists of people along with their names and addresses).
When gathering data for each entry the spider should simply call a PHP function that we provide and which will do the rest (save data in certain format that we need). We will give you an example of a ready made fully functional spider, so you'll be able to simply adjust it to new sites.
If we're satisfied with your work we may provide about a hundred sites to create spiders for.
Based on our experience, a typical time required to create a spider for one site is 2-20 hours, 100-300 lines of simple code. The cost is negotiated per spider beforehand.
A candidate must understand how HTTP protocol works (GET and POST requests, cookies). Experience of working with XPath and regular expressions would be a plus.
Skills: writing, rest