Project Title: Extract Specific Information from Goodreads
I am looking for a way to extract specific information from Goodreads.com.
Every book page on Goodreads has a Q&A page, e.g: https://www.goodreads.com/book/2.Harry_Potter_and_the_Order_of_the_Phoenix/questions
I want the timestamp for the first question posted on this page, along with its answer timestamp.
As you can see, the timestamp appears in a relative way (“6 years ago”).
I wanted to get a sense if there is any way to extract this information more precisely.
One way is by using the Internet Archive: (see for example http://web.archive.org/web/20160505114338/https://www.goodreads.com/book/2.Harry_Potter_and_the_Order_of_the_Phoenix/questions).
If I have ~100,000 book titles, I want to get an estimate for the feasibility and cost of combining goodreads + internet archive, and getting month level precision on timestamp information for the first question and its answer.
For similar work requirement feel free to email us on firstname.lastname@example.org.
I need you to scrape the coming soon books from amazon weekly. You will need to save images and information in a CSV file.
I would like to start creating a DB of Book Authors and the books they have written. Should be scraped from https://www.fictiondb.com/author/book-lists-by-author~a.htm
I need someone to scrape between 1 to 2 million pages from ebooks from the free samples.
You will go through all ebook categories and scrape the free sample from all books in that category.
The scraped content should be in a text file for each ebook.
I need to collect books information from a website: https://www.pearson.com/.
I’m not sure about the number of books available. Can you please let me know the estimated number? Also, advise us the best quote for this service.
Can you provide me quotation to build a tool to scrape book details from https://www.flipkart.com/books-store?