Project Title: Scrape Poetry Websites

Project Description:

I’m looking for someone who can scrape these website for me:

1.http://ukrlit.org/tvory/poeziia_poemy_virshi,

2.http://users.belgacom.net/babowal/

They both follow the same structure: Poet names -> Book titles -> Poem title -> Poem

For every poet the output should be a folder (Poet name) that includes a list of folders (Book titles) that include a list of plain .txt files (Poems).

Ideally each book folder should have a .json file with metadata about the book.

Here’s how the output should look:
Poet Name
Poem Title
Poem Content
Book Name
Book Title

Please contact me on the given contact information to discuss more regarding the project.

For similar work requirement feel free to email us on info@logicwis.com.