FAIR_apps

FAIR Data in -> FAIR Application run -> FAIR Data out.

Today is a Learning exercise – and many are coming from many different angles – if we achieve even a portion of this ambitious objective we will have done a lot!

The learning objectives for this hackathon I have gathered from folks is as follows:

Learning about FAIR Applications – what does it mean and how does it happen? FAIR Data In, FAIR Applications, FAIR data out and moving forward
Learning about containers
Learning about workflows

The set up:

The concept is we want to connect the mouse models to human disease.

This is an idealized scenario, and for pedagogical principles, we are going through the exercise of the round robin of data from mouse to application to data from human. We may or may not succeed in actually connecting the dots, but that may not matter for this exercise.

The exact details of this question are to be suggestive and not necessarily conclusive as the best way to go about this problem, but the goal is to be as useful and pragmatic as possible as we wrap our heads around what are the most important things about FAIR that we need to concern ourselves with.

Example Phenotype Data set:

Please find attached the data (and I will be depositing these in github as our starting point).

Beginning with phenotype measures collected from a study, Chesler3, details were downloaded as json files from the phenome.jax.org using the RESTFul API. The measures for that study are detailed in the table that was downloaded as well, measures.csv and details for the study are found in studies.csv.

The attached R script reads this data in and merges the Chesler3 data with the measurements details.

The idea:

Beginning with mouse data, data from a specific mouse strain, subject to specific conditions, what are the measurements, the phenotypes that tell us about that mouse in this particular experiment. With this gene set use Geneweaver to find the homology to human gene sets.

The idea is to identify a mouse strain that has contrasting phenotypes for a syndrome of interest.

Making, keeping the data in FAIR:

Mouse gene list

Making, keeping the application FAIR:

Homology service to make a homologous gene list given mouse gene lists go to human gene lists and vice versa

Making, keeping the data out FAIR:

Human gene list

Beginning with human data, a new exome is obtained for an individual with a disease, and the exomiser is used to obtain a prioritized list of pathogenic regulatory variants in Mendialian disease. From these, a gene list could be derived.

Discussion:

Testing the data that are going into an application regarding whether that data adheres to the principles of FAIR, as determined by the FAIRShake application we will be introduced to today – Testing the application for its FAIRness

The path from mouse to human is the path we want to walk.

The path takes us from the Mouse Phenotype Database (MPD =phenome.jax.org) data with the mouse strains data for specific experiments.

From the mouse strain, we see the genes that are differentiated, affected with the strain. With the genes, we can look at the pathway and we get

Please find attached the studies that are available from the Mouse Phenotype Database, studies.csv And we have the measurements that we can get from the

Data Sources:

https://phenome.jax.org/downloads

https://phenome.jax.org/about/api

https://phenome.jax.org/about/termsofuse https://osp.od.nih.gov/scientific-sharing/policies/ https://phenome.jax.org/about/citing

Background:

Ontoforce with their tool DisQover
Lifebit.ai with their tool deploit
Databiology with their integrated platform.

Here are some details about containers:

https://biocontainers.pro/#/

http://biocontainers-edu.biocontainers.pro/en/latest/best_practices.html

http://biocontainers-edu.biocontainers.pro/en/latest/best_practices.html https://www.sylabs.io/2018/10/whitepaper-containerized-genomic-workflows-with-singularity-part-1/ https://www.sylabs.io/2018/11/whitepaper-containerized-genomic-workflows-with-singularity-part-2/ https://www.sylabs.io/2018/11/whitepaper-containerized-genomic-workflows-with-singularity-part-3/

https://github.com/nf-core

All applications and data will be given a FAIRShake

https://fairshake.cloud/

Herding cats

DisQover.com

http://54.211.41.249/#start

Lifebit.ai

https://deploit.lifebit.ai/public/jobs/5cb5f8d8b91c3200b25075dc

databiology.com

https://www.demo.databiology.net/dbe/userlab/show-project.html?tab=workunits&projectId=7

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.vs/config		.vs/config
data		data
databiology_apps/app-dbio-exomiser		databiology_apps/app-dbio-exomiser
deploit_apps/docker-exomiser		deploit_apps/docker-exomiser
docs		docs
jpg		jpg
mousemine		mousemine
results		results
starting_sets		starting_sets
validator		validator
.gitignore		.gitignore
BioITWorld2019HackathonJAXTeamWrapUp.pdf		BioITWorld2019HackathonJAXTeamWrapUp.pdf
BioITWorld2019HackathonJAXTeamv1.pptx		BioITWorld2019HackathonJAXTeamv1.pptx
LICENSE		LICENSE
README.md		README.md
Slide01.jpeg		Slide01.jpeg
Slide02.jpeg		Slide02.jpeg
Slide03.jpeg		Slide03.jpeg
Slide04.jpeg		Slide04.jpeg
Slide05.jpeg		Slide05.jpeg
Slide06.jpeg		Slide06.jpeg
Slide07.jpeg		Slide07.jpeg
Slide08.jpeg		Slide08.jpeg
Slide09.jpeg		Slide09.jpeg
Slide10.jpeg		Slide10.jpeg
Slide11.jpeg		Slide11.jpeg
Slide12.jpeg		Slide12.jpeg
Slide13.jpeg		Slide13.jpeg
Slide14.jpeg		Slide14.jpeg
Slide15.jpeg		Slide15.jpeg
Slide16.jpeg		Slide16.jpeg
Slide17.jpeg		Slide17.jpeg
Slide18.jpeg		Slide18.jpeg
Slide19.jpeg		Slide19.jpeg
Slide20.jpeg		Slide20.jpeg
Slide21.jpeg		Slide21.jpeg
Slide22.jpeg		Slide22.jpeg
Slide23.jpeg		Slide23.jpeg
Slide24.jpeg		Slide24.jpeg
Slide25.jpeg		Slide25.jpeg
Slide26.jpeg		Slide26.jpeg
Slide27.jpeg		Slide27.jpeg
Slide28.jpeg		Slide28.jpeg
google7924481fe858362f.html		google7924481fe858362f.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FAIR_apps

FAIR Data in -> FAIR Application run -> FAIR Data out.

The set up:

Example Phenotype Data set:

The idea:

Discussion:

Data Sources:

Background:

Here are some details about containers:

All applications and data will be given a FAIRShake

Herding cats

DisQover.com

Lifebit.ai

databiology.com

Final presentation

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

NCBI-Hackathons/FAIR_apps

Folders and files

Latest commit

History

Repository files navigation

FAIR_apps

FAIR Data in -> FAIR Application run -> FAIR Data out.

The set up:

Example Phenotype Data set:

The idea:

Discussion:

Data Sources:

Background:

Here are some details about containers:

All applications and data will be given a FAIRShake

Herding cats

DisQover.com

Lifebit.ai

databiology.com

Final presentation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages