Site scrapper

Site scrapper

Cancelled

Job Description

A simple content scrapper (article scraper from 2 or 3 sources) written with Scrapy and using XPath. I would like to have the option to reorder the output at the sentence and paragraph level using spyntax format. Optional, it would be great to have the option to translate it through Bing API.

I would like to get the code to be able to reuse it to add some more sites as content sources and also a minimal GUI.