Official repository of the KIPoS2020 shared task on KIParla Part of Speech tagging at Evalita 2020
The DEVELOPMENT SET for KIPOS2020 includes data from the KIParla corpus tagged according to the UD Part of Speech tagset for Italian and manually revised. They can be considered as the gold standard reference for training the systems that participate to the task.
The SILVER SET for KIPOS2020 includes a larger portion of data from the KIParla corpus automatically annotated but not manually revised (except for what concerns some systematic error). They can be also used for training and developing the systems that participate to the task.
It includes data to be used for testing systems that participate to the task. Data are tokenized like in the DEVELOPMENT and SILVER SETs withe a few exceptions regarding the tokenization of multiple tokens words and the heading lines that in the DEVELOPMENT and SILVER SETs introduce each conversation turn and have been removed in the TEST SET data. For all details see the KIPOS website.
The DEVELOPMENT SET and the SILVER SET for KIPOS2020 are available for donwload (file KIPOS2020-DS_rel290520+silverrel030720.zip in this repository). These datasets are both covered by the Creative Commons license provided in this repository and they are released with a password-protected zip archive. By filling in this form you can accept the license and send us the request for the password for unzip the archive. It will be sent (by email) in a few time after filling in the form.
Data are organized in files that correspond to single conversations. The name of each file is composed according to the following pattern:
- 
the first two letters correspond to the city where the recording was collected: TO (Torino) and BO (Bologna) 
- 
following two characters corresponding to the type of activity: A1 (office hours), A3 (random conversation), C1 (exams), D1 (lessons) and D2 (interviews); for the aim of KIPOS2020 C1, A1 and D1 are considered FORMAL, while A3 and D2 are considered INFORMAL. 
For more information on the task, see also the guidelines available in this repository (LAST UPDATE JULY 3rd!!!), and the web page of KIPOS2020.
You can also join our googlegroup at [email protected]. For any question or problem, please start a topic on this mailing list.