Project Title: Scrape Bio Pages of Actors from IMDB
Deliverable is a zip archive of all the bio pages of actors on IMDB.
They should be in folders, like that:
Actor ID1 directory/bio.html
Actor ID2 directory/bio.html
This is a list of all actors: https://datasets.imdbws.com/name.basics.tsv.gz
There are about 10 million pages.Actor ID is in the first column in the file above.
Pattern for pages that need to be downloaded:
Here is an example of a page:
What would be the cost?
Thank you and look forward to hearing from you.
For similar work requirement feel free to email us on firstname.lastname@example.org.