Project Title: Scrape Poetry Websites
Project Description:
I’m looking for someone who can scrape these website for me:
1.http://ukrlit.org/tvory/poeziia_poemy_virshi,
2.http://users.belgacom.net/babowal/
They both follow the same structure: Poet names -> Book titles -> Poem title -> Poem
For every poet the output should be a folder (Poet name) that includes a list of folders (Book titles) that include a list of plain .txt files (Poems).
Ideally each book folder should have a .json file with metadata about the book.
Here’s how the output should look:
Poet Name
Poem Title
Poem Content
Book Name
Book Title
Please contact me on the given contact information to discuss more regarding the project.
For similar work requirement feel free to email us on info@logicwis.com.