Less than 1 week –
10-30 hrs/week –
Web scraper is a Java tool that extracts the required data from a site and stored them in a database. For each site -need to be crawled- a “spider” class should be implemented to parse the site hierarchy and extract its structure data.
The scraping steps is as follow:
1. Extract the web site directories.
2. Extract the directory’s pages.
3. Extract the page details
According to the type of data being listed in the directory, the result will ...