Description

Project ESPAÑOL: An Exploration Of Spanish-language Poetry And Nuances Of Language

View the application at: http://dmerullo.pythonanywhere.com/

Project ESPAÑOL is an application that leverages high-throughput analysis of classical Spanish-language poems to create custom lesson plans and guide discovery of new texts for practice and study. The dataset contains 10,000 Spanish-language poems in the public domain from 200 authors across 500 years, totaling 2.5 million words. An unsupervised clustering algorithm in Python grouped the poems into 4 difficulty levels by analyzing the frequencies of 10,000+ verb forms conjugated across 18 grammatical tenses for the 550+ most common verbs. The full texts, metadata, and grammatical statistics can be accessed in an interactive Plotly Dash application.

I scraped the poems from https://poesi.as/ with the R package rvest. I only included poems whose authors lived before the start of World War II (1939)-- I thought this was a safe time to only include texts in the public domain, but this date may change as I learn more about copyright laws.

In creating this project, I came across a GitHub repository from the POSTDATA Project that similarly contains classical Spanish-language poems. Although I did all of the work here independently, I want to reference this project because it predates Project ESPAÑOL.

Release History

2024-12-21 Version 0.0: Minimum viable product!

Coming Soon

Improved interface for application
Detailed analytics on poem dataset
Recommendation algorithm
Incorporation of additional grammatical features for computational analysis
Addition of more poems and authors

Community

If you would like to support this and future projects, please send a donation!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Description

Release History

Coming Soon

Community

About

Uh oh!

Releases

Packages

dmerullo/project-espanol

Folders and files

Latest commit

History

Repository files navigation

Description

Release History

Coming Soon

Community

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages