Published October 9, 2025
| Version v1.0
Dataset
Open
MicrobELP - annotated training sets and unannotated test set documents
Authors/Creators
Contributors
Annotator (5):
Researcher (3):
Description
This submission contains 3 sets of data:
- machine_train_with_ref.zip
- machine_train_without_ref.zip
- test_without_annotations.zip
The training set is given with and without references, and was machine annotated using the pipeline code given in the GitHub.
The test set documents are unannotated and can be used for performance benchmarking on Codabench.
All files are given in BioC-JSON format obtained using Auto-CORPus.
Files
machine_train_with_ref.zip
Additional details
Related works
- Is metadata for
- Publication: 10.1101/2025.08.29.671515v1 (DOI)
Software
- Repository URL
- https://github.com/omicsNLP/microbELP
- Development Status
- Active