v0 (Genomic Data)
The v0 loader expects an ontology, with mutation and clinical data in the MAF format. As the ontology data you must use ${MEDCO_SETUP_DIR}/test/data/genomic/tcga_cbio/clinical_data.csv and ${MEDCO_SETUP_DIR}/test/data/genomic/tcga_cbio/mutation_data.csv. For clinical data you can keep using the same two files or a subset of the data (e.g. 8_clinical_data.csv). More information about how to generate sample data files can be found below. After the following script is executed all the data is encrypted and deterministically tagged in compliance with the MedCo data model.

How to use

Ensure you have downloaded the data before proceeding to the loading.
The following examples show you how to load data into a running MedCo deployment. Adapt accordingly the commands your use-case.

Examples

Loading the three nodes on the dev-local-3nodes profile

1
export MEDCO_SETUP_DIR=~/medco \
2
MEDCO_DEPLOYMENT_PROFILE=dev-local-3nodes
3
cd "${MEDCO_SETUP_DIR}/deployments/${MEDCO_DEPLOYMENT_PROFILE}"
4
docker-compose -f docker-compose.tools.yml run medco-loader-srv0 v0 \
5
--ont_clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
6
--sen /data/genomic/sensitive.txt \
7
--ont_genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
8
--clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
9
--genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
10
--output /data/
11
docker-compose -f docker-compose.tools.yml run medco-loader-srv1 v0 \
12
--ont_clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
13
--sen /data/genomic/sensitive.txt \
14
--ont_genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
15
--clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
16
--genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
17
--output /data/
18
docker-compose -f docker-compose.tools.yml run medco-loader-srv2 v0 \
19
--ont_clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
20
--sen /data/genomic/sensitive.txt \
21
--ont_genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
22
--clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
23
--genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
24
--output /data/
Copied!

Loading one node on a network-test profile

1
export MEDCO_SETUP_DIR=~/medco \
2
MEDCO_DEPLOYMENT_PROFILE=test-network-xxx-node0
3
cd "${MEDCO_SETUP_DIR}/deployments/${MEDCO_DEPLOYMENT_PROFILE}"
4
docker-compose -f docker-compose.tools.yml run medco-loader v0 \
5
--ont_clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
6
--sen /data/genomic/sensitive.txt \
7
--ont_genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
8
--clinical /data/genomic/tcga_cbio/8_clinical_data.csv \
9
--genomic /data/genomic/tcga_cbio/8_mutation_data.csv \
10
--output /data/
Copied!

Explanation of the command's arguments

1
NAME:
2
medco-loader v0 - Load genomic data (e.g. tcga_bio dataset)
3
4
USAGE:
5
medco-loader v0 [command options] [arguments...]
6
7
OPTIONS:
8
--group value, -g value UnLynx group definition file
9
--entryPointIdx value, --entry value Index (relative to the group definition file) of the collective authority server to load the data
10
--sensitive value, --sen value File containing a list of sensitive concepts
11
--dbHost value, --dbH value Database hostname
12
--dbPort value, --dbP value Database port (default: 0)
13
--dbName value, --dbN value Database name
14
--dbUser value, --dbU value Database user
15
--dbPassword value, --dbPw value Database password
16
--ont_clinical value, --oc value Clinical ontology to load
17
--ont_genomic value, --og value Genomic ontology to load
18
--clinical value, --cl value Clinical file to load
19
--genomic value, --gen value Genomic file to load
20
--output value, -o value Output path to the .csv files
Copied!

Test that the loading was successful

To check that it is working you can query for:
-> MedCo Gemomic Ontology -> Gene Name -> BRPF3
For the small dataset 8_xxxx you should obtain 3 matching subjects (one at each site).
Last modified 8mo ago