# Data Loading

The current version of the loader offers two different loading alternatives: (*v0*) loading of clinical and genomic data based on MAF datasets; and (*v1*) loading of generic i2b2 data. Currently these two loaders support each one dataset:

* *v0*: a genomic dataset (*tcga\_cbio* publicly available in [cBioPortal](http://www.cbioportal.org/))
* *v1*: the [i2b2 demodata](https://www.i2b2.org/software/repository.html?t=demo\&p=15).

Future releases of this software will allow for other arbitrary data sources, given that they follow a specific structure (e.g. BAM format).

## Pre-Requisite: Download test data

From the *medco-deployment* folder, execute the resources/data/download.sh script to download the test datasets.

```bash
cd ${MEDCO_SETUP_DIR}/resources/data
bash download.sh
```

## Dummy Generation

The provided example data set files come with dummy data pre-generated. Those data are random dummy entries whose purpose is to prevent frequency attacks. In a future release, the generation will be done dynamically by the loader.
