1 of 16

System Administrator Guide

This guide explains the deployment and configuration of MedCo instances.

Specifications
Deployment

Specifications

We recommend the following specifications for running MedCo:

Network Bandwidth: >100 Mbps (ideal), >10 Mbps (minimum), symmetrical
Ports Opening and IP Restrictions: see Network Architecture
Hardware
CPU: 8 cores (ideal), 4 cores (minimum)
RAM: >16 GB (ideal), >8GB (minimum)
Software
OS: Any flavor of Linux, physical or virtualized (tested with Ubuntu 16.04, 18.04, Fedora 29)
Softwares: OpenSSL, (tested with Docker 18.09.1) &

Deployment

Local Test Deployment

These pages explain how to deploy MedCo in different scenarios. Each deployment scenario corresponds to a deployment profile, as described below. All these instructions use the deployment scripts from the repository.

If you are new to MedCo…

… and want to try to deploy the system on a single machine to test it, you should should follow the guide.

… and want to create or join a MedCo network, you should follow the guide.

… and want to develop around MedCo, you should follow the guide.

Deployment Profiles

A deployment profile is composed of two things:

a compose profile in ~/medco-deployment/compose-profiles/<profile name>/: docker-compose file and parameters like ports to expose, log level, etc.
a configuration profile in ~/medco-deployment/configuration-profiles/<profile name>/: files mounted in the docker containers, containing the cryptographic keys, the certificates, etc.

Some profiles are provided by default, for development or testing purposes. Those should not be used in a production scenario with real data, as the private keys are set by default, thus not private. Other types of profiles must generated using the scripts in ~/medco-deployment/resources/profile-generation-scripts/<profile name>/.

The different profiles are the following:

test-local-3nodes ()
- for test on a single machine (used by the )
- 3 nodes on any host

The database is pre-loaded with some encrypted test data using a key that is pre-generated from the combination of all the participating nodes’ public keys. For the test-network deployment profile this data will not be correctly encrypted, since the public key of each node is generated independently, and, as such, the data must be re-loaded.

Local Test Deployment

Profile test-local-3nodes

This test profile deploys 3 MedCo nodes on a single machine for test purposes. It can be used either on your local machine, or any other machine to which you have access. The version of the docker images used are the latest released versions. This profile is for example used for the MedCo public demo.

MedCo Stack Deployment

First step is to get the MedCo Deployment latest release.

Next step is to download the docker images:

The default configuration of the deployment is suitable if the stack is deployed on your local host, and if you do not need to modify the default passwords. If so, edit the file ~/medco-deployment/compose-profiles/test-local-3nodes/.env to reflect your configuration. For example:

MEDCO_NODE_URL should be the fully qualified domain name of the host, HTTP_SCHEME should be http or https. The other fields control the default passwords for the various services running. Note that setting the passwords that way works only on the first deployment. If the passwords need to be updated later, you should use the specific component way of modifying password.

Follow to set up the certificates needed for HTTPS. If you are deploying on another host than the local host without HTTPS take note of the following: .

Final step is to run the nodes, all three will run simultaneously:

Wait some time for the initialization of the containers to be done (up to the message: “i2b2-medco-srv… - Started x of y services (z services are lazy, passive or on-demand)”), this can take up to 10 minutes. For the subsequent runs, the startup will be faster. In order to stop the containers, hit Ctrl+C in the active window.

You can use the command docker-compose up -d instead to run MedCo in the background and thus not keeping the console captive. In that case use docker-compose stop to stop the containers.

Keycloak Configuration

Follow the instructions from to be able to use Glowing Bear.

Test the deployment

In order to test that the local test deployment of MedCo is working, access Glowing Bear in your web browser at http(s)://<domain name> and use the credentials previously configured during the . If you are new to Glowing Bear you can watch the video.

By default MedCo loads a specific test data, refer to for expected results to queries. To load a dataset, follow the guide . For reference, the database address (host) to use during loading is <domain name>:5432 and the databases i2b2medcosrv0, i2b2medcosrv1 and i2b2medcosrv2.

Network Test Deployment

Profile test-network

This test profile deploys an arbitrary set of MedCo nodes independently in different machines that together form a MedCo network. This deployment assumes each node is deployed in a single dedicated machine. All the machines have to be reachable between each other. Nodes should agree on a network name and individual indexes beforehand (to be assigned a unique ID). The node with index 0 is the central node, which is the only one running Glowing Bear, PICSURE and Keycloak.

The next set of steps must be executed individually by each node of the network.

Preliminaries

First step is to get the MedCo Deployment latest release at each node.

Generation of the Deployment Profile

Next the compose and configuration profiles must be generated using a script, executed in two steps.

Step 1: each node generates its keys and certificates, and shares its public information with the other nodes
Step 2: each node collects the public keys and certificates of the all the other nodes

For step 1, the network name should be common to all the nodes. <node DNS name> corresponds to the machine domain name where the node is being deployed. As mentioned before the different parties should have agreed beforehand on the members of the network, and assigned an index to each different node to construct its UID (starting from 0, to n-1, n being the total number of nodes). Remember that node 0 is the central node.

This script will generate the compose profile and part of the configuration profile, including a file srv<node index>-public.tar.gz. This file should be shared with the other nodes, and all of them need to place it in their configuration profile folder (~/medco-deployment/configuration-profiles/test-network-<network name>-node<node index>).

Once all nodes have shared their srv<node index>-public.tar.gz file with all other nodes, step 2 can be executed:

At this point, it is possible to edit the default configuration generated in ~/medco-deployment/configuration-profiles/test-network-<network name>-node<node index>/.env. This is needed if you want to modify the default passwords. When editing this file, be careful to change only the passwords and not the other values. Note that setting the passwords that way works only on the first deployment. If the passwords need to be updated later, you should use the specific component way of modifying password.

The deployment profile is now ready to be used.

MedCo Stack Deployment

Next step is to download the docker images and run the node. The process is different for the central node and for the other nodes. If you manage the central node run the following:

If you manage a node other than the central one (index > 0), run the following:

Wait some time for the initialization of the containers to be done, this can take up to 10 minutes. For the subsequent runs, the startup will be faster. You can use docker-compose -f docker-compose... stop to stop the containers.

Keycloak Configuration

Follow the instructions from and then you should be able to login in Glowing Bear.

Data Loading

Contrary to the other deployment profiles the default test data will not be working (the queries made will fail) since the data is not encrypted with the collective key that was generated (encryption key derived from all the nodes’ public keys). Run the MedCo loader (see ) to be able to test this deployment. For reference, the database address (host) to use during loading is <domain name>:5432 and the database i2b2medco.

Test the deployment

In order to test that the network deployment of MedCo is working, access Glowing Bear in your web browser at http://<node domain name> and use the credentials previously configured during the . If you are new to Glowing Bear you can watch the video.

Note that by default the certificates generated by the script are self-signed and thus, when using Glowing Bear, the browser will issue a security warning. To use your own valid certificates, see .

Local Development Deployment

Profile dev-local-3nodes

This deployment profile deploys 3 MedCo nodes on a single machine for development purposes. It is meant to be used only on your local machine, i.e. localhost. The tags of the docker images used are all dev, i.e. the ones built from the development version of the different source codes. They are available either through Docker Hub, or built locally.

MedCo Stack Deployment (except Glowing Bear)

First step is to clone the medco-deployment repository with the correct branch. This example gets the data in the home directory of the current user, but that can be changed.

Next step is to build the docker images:

Note that instead of building the dev docker images locally, it is possible to download them from Docker Hub by using docker-compose pull, but there is no guarantee on the current status of those images are they are automatically built.

Next step is to run the nodes. They will run simultaneously, and the logs of the running containers will maintain the console captive. No configuration changes are needed in this scenario before running the nodes. To run them:

Glowing Bear Deployment

First step is to clone the glowing-bear repository with the correct branch.

Glowing Bear is deployed separately for development, as we use its convenient live development server:

Note that the first run will take a significant time in order to build everything.

In order to stop the containers, simply hit Ctrl+C in all the active windows.

Keycloak Configuration

Follow the instructions from to be able to use Glowing Bear.

Test the deployment

In order to test that the development deployment of MedCo is working, access Glowing Bear in your web browser at http://localhost:4200 and use the credentials previously configured during the . If you are new to Glowing Bear you can watch the video.

By default MedCo loads a specific test data, refer to for expected results to queries. To load a dataset, follow the guide . For reference, the database address (host) to use during loading is localhost:5432 and the databases i2b2medcosrv0, i2b2medcosrv1 and i2b2medcosrv2.

Configuration

Keycloak Configuration

Keycloak Configuration

Here follows some MedCo-specific instructions for the administration of Keycloak. For anything else, please refer to the Keycloak Server Administration Guide.

Accessing the web administration interface

In the case of the development profile dev-local-3nodes (i.e. without reverse proxy), the address is http://localhost:8081/auth/admin. In the other cases (with the reverse proxy), the address is http://<node domain name>/auth/admin. The credentials are :

User keycloak
Password keycloak by default, or whatever else was configured at the initial deployment.

Disabling HTTPS requirement for external connections

When deploying the test-local-3nodes profile without HTTPS on a machine other than localhost, the administration interface will refuse to load. To solve this, access pgAdmin (see ) and execute the following SQL on the keycloak database:

You need to restart the Keycloak docker container to enable the changes.

Import MedCo Default Settings

Import the provided realm configuration into Keycloak. This will create the MedCo client with the appropriate roles.

Go to the Import menu
Click on Select file and select the file keycloak-medco-realm.json that you will find in ~/medco-deployment/resources/configuration.

Configure the MedCo OpenID Connect client

In the Settings tab, fill Valid Redirect URIs according to the following table:

In the same tab, fill Web Origins with + and save.

User Management

Add a user

Go to the configuration panel Users, click on Add user.
Fill the Username field, toggle to ON the Email Verified button and click Save.

Give query permissions to a user

Go to the configuration panel Users, search for the user you want to give authorization to and click on Edit.
Go to the Role Mappings tab, and select medco (or another client ID set up for the MedCo OIDC client) in the Client Roles.
Add the roles you wish to give the user, each of the roles maps to a query type.

Configuring SwitchAAI Authentication

This guide walks you through the process of configuring Keycloak as a Service Provider to one or more SwitchAAI identity provider(s), in order for MedCo to rely on SwitchAAI for user authentication.

Prerequisites

A MedCo network is up and running, with one or more functional Keycloak within the network.

Loading Data

v0 (Genomic Data)
- Example

The current version offers two different loading alternatives: (v0) loading of clinical and genomic data based on MAF datasets; and (v1) loading of generic i2b2 data. Currently these two loaders support each one dataset:

v0: a genomic dataset (tcga_cbio publicly available in )
v1: the .

Future releases of this software will allow for other arbitrary data sources, given that they follow a specific structure (e.g. BAM format).

Pre-Requisites

Download test data

From the medco-deployment folder, execute the resources/data/download.sh script to download the test datasets.

v0 (Genomic Data)

The v0 loader expects an ontology, with mutation and clinical data in the MAF format. As the ontology data you must use ~/medco-deployment/resources/data/genomic/tcga_cbio/clinical_data.csv and ~/medco-deployment/resources/data/genomic/tcga_cbio/mutation_data.csv. For clinical data you can keep using the same two files or a subset of the data (e.g. 8_clinical_data.csv). More information about how to generate sample datafiles can be found below. After the following script is executed all the data is encrypted and ‘deterministically tagged’ in compliance with the MedCo data model.

Example

The following example allows to load data into a running MedCo development deployment (dev-local-3nodes), on the node 0. Adapt accordingly the docker-compose service being ran to load the two other nodes of this profile.

Explanation of the arguments:

Data Manipulation

Inside ~/medco-loader/data/scripts/ you can find a small python application to extract (or replicate) data out of the original tcga_cbio dataset. You can decide which patients you want to consider for you ‘new’ dataset or simply randomly pick a sample.

To check that it is working you can query for:

-> MedCo Gemomic Ontology -> Gene Name -> BRPF3

For the small dataset 8_xxxx you should obtain 3 matching subjects (one at each site).

Command-Line Interface (CLI)

MedCo provides a client command-line interface (CLI) to interact with the medco-connector APIs.

Prerequisites

To use the CLI, you must first follow one of the deployment guides. However, the version of the CLI documented here is the one shipped with the Local Development Deployment.

How to use it

To show the CLI manual, run:

For a start, you can use the credentials of the default user: username:test password:test

You will also need to specify the URL of the medco connector node you want to interact with by properly setting the MEDCO_CONNECTOR_URL variable in compose-profiles/docker-compose-definitions.yml

query

You can use this command to query the MedCo network.

This is the syntax of an example query using the pre-loaded .

You will get something like that:

Not that, in the queries, the OR operator has the highest priority, so 1 AND 2 OR 3 AND 2 is factorised as (1) AND (2 OR 3) AND (2).

genomic-annotations-get-values

You can use this command to get the values of the genomic annotations that MedCo nodes make available for queries.

To do some tests, you may want to .

Then, for example, if you want to know which genomic annotations of type "protein_change" containing the string "g32" are available, you can run:

You will get:

The matching is case-insensitive and it is not possible to use wildcards. At the moment only three types of genomic annotations are available: variant_name, protein_change and hugo_gene_symbol.

genomic-annotations-get-variants

You can use this command to get the variant ID of a certain genomic annotation.

To do some tests, you may want to .

Then, for example, if you want to know the variant ID of the genomic annotation "HTR5A" of type "hugo_gene_symbol" with zygosity "heterozygous" or "homozygous", you can run:

You will get:

The matching is case-insensitive and it is not possible to use wildcards. If you request the ID of an annotation which is not available (e.g, in the previous, example, "HTR5") you will get an error message. At the moment only three types of genomic annotations are available: variant_name, protein_change and hugo_gene_symbol.

HTTPS Configuration

HTTPS is supported for the profiles test-local-3nodes and test-network.

Certificate

The certificates are held in the configuration profile folder (e.g, ~/medco-deployment/configuration-profiles/test-local-3nodes):

certificate.key: private key
certificate.crt: certificate of own node
srv0-certificate.crt, srv1-certificate.crt, …: certificates of all nodes of the network

Enable HTTPS for the Test Local Deployment

To enable HTTPS for the profile test-local-3nodes, replace the files certificate.key and certificate.crt from the configuration profile folder with your own versions. Such a certificate can be obtained for example through .

Then edit the file .env from the compose profile, replace the http with https, and restart the deployment.

Configure HTTPS for the Test Network Deployment

For this profile, HTTPS is mandatory. The profile generation scripts generates and use default self-signed certificates for each node. Those are perfectly fine to be used, but because they are self-signed, an HTTPS warning will be displayed to users in their browser when accessing Glowing Bear. There are two ways of avoiding this warning:

Configuring the browsers of your users to trust this certificate. This procedure is specific to the browsers and operating systems used at your site.
Use a certificate obtained by an authority trusted by the browser you are using: see below.

If you wish to use a certificate from your own making, gather its key and the certificate itself. Note that using your own certificate is only needed on the central node (as it is the one hosting the web application Glowing Bear). In the configuration profile of the central node (~/medco-deployment/configuration-profiles/test-network-<network name>-node<node index>/) copy the certificate and its key in the respective files certificate.crt and certificate.key. Then duplicate the file certificate.crt in srv0-certificate.crt. Restart the deployment and the central node configuration is ready.

Now the other nodes need to get this certificate to trust it. Get and copy the srv0-certificate.crt file into each of the configuration profile directory of the other nodes, and restart all the deployments. The configuration of HTTPS is now ready.

System Administrator Guide

Specifications

Deployment

hashtagDeployment Profiles

Local Test Deployment

hashtagMedCo Stack Deployment

hashtagKeycloak Configuration

hashtagTest the deployment

Network Test Deployment

hashtagPreliminaries

hashtagGeneration of the Deployment Profile

hashtagMedCo Stack Deployment

hashtagKeycloak Configuration

hashtagData Loading

hashtagTest the deployment

Local Development Deployment

hashtagMedCo Stack Deployment (except Glowing Bear)

hashtagGlowing Bear Deployment

hashtagKeycloak Configuration

hashtagTest the deployment

Configuration

hashtagConfiguration

Keycloak Configuration

hashtagAccessing the web administration interface

hashtagDisabling HTTPS requirement for external connections

hashtagImport MedCo Default Settings

hashtagConfigure the MedCo OpenID Connect client

hashtagUser Management

Configuring SwitchAAI Authentication

hashtagPrerequisites

Loading Data

hashtagPre-Requisites

v0 (Genomic Data)

hashtagExample

hashtagData Manipulation

Command-Line Interface (CLI)

hashtagPrerequisites

hashtagHow to use it

hashtagquery

hashtaggenomic-annotations-get-values

hashtaggenomic-annotations-get-variants

Local Development Deployment

hashtagMedCo Stack Deployment (except Glowing Bear)

hashtagGlowing Bear Deployment

hashtagKeycloak Configuration

hashtagTest the deployment

Keycloak Configuration

hashtagAccessing the web administration interface

hashtagDisabling HTTPS requirement for external connections

hashtagImport MedCo Default Settings

hashtagConfigure the MedCo OpenID Connect client

hashtagUser Management

Configuring SwitchAAI Authentication

hashtagPrerequisites

hashtagConfigure the identity provider(s) in Keycloak

hashtagConfigure the first login flow

hashtagAdd the identity provider

hashtagAdd the username mapper

hashtagSetup a certificate

hashtagRegister Keycloak instance as a Service Provider in SwitchAAI

hashtagRegister new resource

hashtag1. Basic Resource Information

hashtag2. Descriptive Information

hashtag3. Contacts

hashtag4. Service Locations

hashtag5. Certificates

hashtag6. Requested Attributes

hashtag7. Intended Audience and Interfederation

hashtagGet the new resource approved

Specifications

v0 (Genomic Data)

hashtagExample

hashtagData Manipulation

Network Test Deployment

hashtagPreliminaries

hashtagGeneration of the Deployment Profile

hashtagMedCo Stack Deployment

hashtagKeycloak Configuration

hashtagData Loading

hashtagTest the deployment

Deployment Profiles

MedCo Stack Deployment

Keycloak Configuration

Test the deployment

Preliminaries

Generation of the Deployment Profile

MedCo Stack Deployment

Keycloak Configuration

Data Loading

Test the deployment

MedCo Stack Deployment (except Glowing Bear)

Glowing Bear Deployment

Keycloak Configuration

Test the deployment

Configuration

Accessing the web administration interface

Disabling HTTPS requirement for external connections

Import MedCo Default Settings

Configure the MedCo OpenID Connect client

User Management

Prerequisites

Pre-Requisites

Example

Data Manipulation

Prerequisites

How to use it

query

genomic-annotations-get-values

genomic-annotations-get-variants

MedCo Stack Deployment (except Glowing Bear)

Glowing Bear Deployment

Keycloak Configuration

Test the deployment

Accessing the web administration interface

Disabling HTTPS requirement for external connections

Import MedCo Default Settings

Configure the MedCo OpenID Connect client

User Management

Prerequisites

Configure the identity provider(s) in Keycloak

Configure the first login flow

Add the identity provider

Add the username mapper

Setup a certificate

Register Keycloak instance as a Service Provider in SwitchAAI

Register new resource

1. Basic Resource Information

2. Descriptive Information

3. Contacts

4. Service Locations

5. Certificates

6. Requested Attributes

7. Intended Audience and Interfederation

Get the new resource approved

Example

Data Manipulation

Preliminaries

Generation of the Deployment Profile

MedCo Stack Deployment

Keycloak Configuration

Data Loading

Test the deployment

Deployment Profiles

MedCo Stack Deployment

Keycloak Configuration

Test the deployment

Pre-Requisites

Configuration

Prerequisites

How to use it

query

genomic-annotations-get-values

genomic-annotations-get-variants

Dummy Generation

Example

Certificate

Enable HTTPS for the Test Local Deployment

Configure HTTPS for the Test Network Deployment

Administration with PgAdmin

External Entities