This package is used to grab data from the spanish linguistic corpus PRESEEA and generate a basic report from it. The generated report is a .csv file. The aim is to rapidly retrieve data in a structured format on the researchers computer for further linguistic investigation. The researcher should be able to concentrate on a language model instead of copying that data from the Browser. Some basic analyses are done as well and added to the report.
PRESEEA is a spanish language corpus: https://preseea.linguas.net/Corpus.aspx
The example.py script shows a simple usage of the package.
Python 3.7
python -m unittest discover
pip install -i https://test.pypi.org/simple/ preseeapy
- Paul Obermayer - Initial work - twitter
This project is licensed under the Mozilla Public License Version 2.0 - see the LICENSE.md file for details