This repository contains source code related to the publication
Zhang, R., Hristovski, D., Schutte, D., Kastrin, A., Fiszman, M., & Kilicoglu, H. (2021). Drug repurposing for COVID-19 via knowledge graph completion. Journal of Biomedical Informatics, 115, 103696. https://doi.org/10.1016/j.jbi.2021.103696
- Python 3.6 with packages
lxml,numpy, andpandas - Perl 5 with module
Text::NSP - AWK
./datadirectory contains input files./preprocessingdirectory contains scripts for preparing data./filteringdirectory contains scripts for filtering predications with BERT./modelsdirectory contains scripts for knowledge graph completion./predictionsdirectory contains output files from graph completion models
- Download and set up SemMedDB
- Create
./datadirectory in project's root folder - Prepare
sub_rel_obj_pyear_edat_pmid_sent_id_sent.tsv.gzfile and place it into the./data/SemMedDBdirectory - Download SemRepped CORD-19 dataset and extract files into
./data/cord-19 directory - Prepare SemMedDB and CORD-19 data using the
./preprocessing/run.shfile - Run Python notebooks in the
./filteringdirectory - Run Python notebooks in the
./modelsdirectory
Halil Kilicoglu (halil (at) illinois.edu)