Webscrape to automate PDF downloading
Closed - This job posting has been filled and work has been completed.
I have two large samples of thousands of PDFs that are currently contained on two unique websites. I do not want to download these PDFs manually so I would like you to create code for each website to automatically download. Each website contains PDFs based on stock tickers, and I will supply the list of tickers.
I am not sure on the exact number, but there will be around 100,000 - 200,000 PDFs to download. However, the page layout is the same for each of these so the code should be fairly easy to automate.
I will provide the login for each website and detailed instructions for the specific PDF downloads I want you to automate.
The code will need to be re-usable, so you must design it so that it can be automatically updated and is easy to run. Please provide written instructions explaining the code so that I can understand your method.
I do not care if you download the PDFs or just provide the code so I can download them as long as the code works.
I am not an expert on webscrapes but I think this is a moderate difficult scrape and not extremely complex.
I require that any potential contractor review each website first to determine if they can successfully complete this project or not. At that point, I will pay 20% up front and we can negotiate milestone payments. Once you have completed the code, I will pay the full amount.
Other open jobs by this client
- Hourly – Data Assistant for SEC Filings