Thanks to visit codestin.com
Credit goes to github.com

Skip to content

dmerullo/project-espanol

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

Description

Project ESPAÑOL: An Exploration Of Spanish-language Poetry And Nuances Of Language

by Devin Merullo

View the application at: http://dmerullo.pythonanywhere.com/

Project ESPAÑOL is an application that leverages high-throughput analysis of classical Spanish-language poems to create custom lesson plans and guide discovery of new texts for practice and study. The dataset contains 10,000 Spanish-language poems in the public domain from 200 authors across 500 years, totaling 2.5 million words. An unsupervised clustering algorithm in Python grouped the poems into 4 difficulty levels by analyzing the frequencies of 10,000+ verb forms conjugated across 18 grammatical tenses for the 550+ most common verbs. The full texts, metadata, and grammatical statistics can be accessed in an interactive Plotly Dash application.

I scraped the poems from https://poesi.as/ with the R package rvest. I only included poems whose authors lived before the start of World War II (1939)-- I thought this was a safe time to only include texts in the public domain, but this date may change as I learn more about copyright laws.

In creating this project, I came across a GitHub repository from the POSTDATA Project that similarly contains classical Spanish-language poems. Although I did all of the work here independently, I want to reference this project because it predates Project ESPAÑOL.

Release History

  • 2024-12-21 Version 0.0: Minimum viable product!

Coming Soon

  • Improved interface for application
  • Detailed analytics on poem dataset
  • Recommendation algorithm
  • Incorporation of additional grammatical features for computational analysis
  • Addition of more poems and authors

Community

If you would like to support this and future projects, please send a donation!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published