Releases: DT-UCPH/sp
SP with corrected phrase boundaries
Some corrections in phrase boundaries.
SP with phrase boundaries
The dataset contains the text of the SP with word level annotations, phrase atom boundaries and phrase boundaries.
ETCBC parsing
This release comes with a few corrections to the existing data as well as a parsing of all words according to the ETCBC conventions. The new feature is labelled ETCBC_parsing.
Corrections
This release comes with new corrections of the data. The release also contains a new feature ETCBC_parsing, a representation of the word according to the ETCBC conventions. The feature, however, needs further corrections and testing and is currently only reliable for Genesis.
Minor update
This release merges the phrase atom dataset (TF 4.0) with corrections made at an earlier stage.
Phrase atoms
This release contains a major extension of the dataset, now including phrase atom boundaries on top of the existing word level annotations.
Small fixes
Corrections of a few annotations in the SP data.
Minor updates
Text-Fabric dataset of the Samaritan Pentateuch.
The features are similar to those of the Biblia Hebraica Stuttgartensia Amstelodamensis (BHSA), so we refer to the BHSA feature documentation for more explanation of the features.
The text was provided by the Samaritanus-project based at Martin-Luther-Universität Halle-Wittenberg, directed by Stefan Schorch, and is based on a transcription MS Dublin Chester Beatty Library 751 (Gen 1-Deut 32:36) + MS Garizim 1 (Deut 32:36b-34), cf. Stefan Schorch (ed.), The Samaritan Pentateuch: A critical editio maior. Berlin: de Gruyter, 2018-.
The features released in this version include:
g_cons
lex
sp
g_vbs
g_pfm
g_lex
g_vbe
g_nme
g_uvf
g_prs
vt
ps
prs_ps
nu
prs_nu
gn
prs_gn
Features gn and prs_gn
Text-Fabric dataset of the Samaritan Pentateuch.
The features are similar to those of the Biblia Hebraica Stuttgartensia Amstelodamensis (BHSA), so we refer to the BHSA feature documentation for more explanation of the features.
The text was provided by the Samaritanus-project based at Martin-Luther-Universität Halle-Wittenberg, directed by Stefan Schorch, and is based on a transcription MS Dublin Chester Beatty Library 751 (Gen 1-Deut 32:36) + MS Garizim 1 (Deut 32:36b-34), cf. Stefan Schorch (ed.), The Samaritan Pentateuch: A critical editio maior. Berlin: de Gruyter, 2018-.
The features released in this version include:
g_cons
lex
sp
g_vbs
g_pfm
g_lex
g_vbe
g_nme
g_uvf
g_prs
vt
ps
prs_ps
nu
prs_nu
gn
prs_gn
minor changes in 6 features
A few changes in nu, trailer, g_nme, g_nme_utf8, g_lex and g_lex_utf8