Processing KidsFirst data for input in the C2M2 model to produce Level 1 tables

Download data from the KidsFirst (KF) portal

Visit the KF portal website: https://portal.kidsfirstdrc.org/dashboard
Log in using your Orchid ID (preferred), Gmail, or Facebook.
Select the File Repository tab on the main navigation bar at the top of the website.
Download the data:
- Click the columns option and select all columns. Click Export TSV.
- Click Download and choose the option File Manifest at the bottom of the dropdown menu.
Initial preprocessing: remove all the columns with no headers.
Select KF column names that correspond to the right C2M2 table IDs.

Building 'green' tables from core entity tables

This term-scanner script is used to auto-generate the green tables for the C2M2 Model Level 1 model. Currently, this script generates four of the five green tables for Level 1.

Default paths direct to the HMP example tsv files.

Inputs

It currently takes in biosample.tsv and file.tsv (two of the core-entity ETL instance TSVs, aka two of the three black tables) from the --draftDir (default is ../draft-C2M2_example_submission_data/HMP__sample_C2M2_Level_1_bdbag.contents)

It will load OBO and ontology files from --cvRefDir (default is external_CV_reference_files):

EDAM.version_1.21.tsv
OBI.version_2019-08-15.obo
uberon.version_2019-06-27.obo

Outputs

It will produce these four green tables for Level 1: file_format.tsv,data_type.tsv, assay_type.tsv, and anatomy.tsv. The outputs are saved in --outDir (default is ./007_HMP-specific_CV_term_usage_TSVs).

Run script

The term-scanner script is named build_term_tables.py and you can run it like so:

# with default directory locations: change directory to `model`
cd ./model
python build_term_tables.py

# full command, if not using any default paths
./build_term_tables.py --draftDir [path/to/tsv/file/dir] --cvRefDir [path/to/external/CV/ref/files/dir] --outDir [dir/path/where/you/want/outputs/saved]

Run it with -h for command line help.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
KF_sample_C2M2_Level_1_bdbag.contents		KF_sample_C2M2_Level_1_bdbag.contents
model		model
r_script		r_script
raw_data		raw_data
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Processing KidsFirst data for input in the C2M2 model to produce Level 1 tables

Download data from the KidsFirst (KF) portal

Building 'green' tables from core entity tables

Inputs

Outputs

Run script

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Processing KidsFirst data for input in the C2M2 model to produce Level 1 tables

Download data from the KidsFirst (KF) portal

Building 'green' tables from core entity tables

Inputs

Outputs

Run script

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages