Data Productivity Cloud Custom Connector

Extract case, exposure, diagnosis and gene expression quantification files from The Cancer Genome Atlas (TCGA)

This Data Productivity Cloud Custom Connector extracts and loads open access data from the Genomic Data Commons Data Portal for analysis.

Image ofExtract from the Genomic Data Commons Data Portal
Extract from the Genomic Data Commons Data Portal

Authentication

No authentication is required for open access data.

Endpoints

Parameters

The cases endpoint can be configured by setting query parameters:

The geq_files endpoint is a “Search and Retrieval” request that extracts a list of gene expression quantification file names. Users are expected to subsequently download the files one by one or in bulk using the GDC API’s file download functionality. Filters and field selection can be changed by editing the POST body as per the documentation.

Downloads

Licensed under: Matillion Free Subscription License

Download TCGA-GDC.json

Installation instructions

How to Install a Data Productivity Cloud Custom Connector

Author: Matillion