Working with Cancer Genomics Cloud datasets in a PostgreSQL database (Part 1)

Posted on Mon 12 October 2020 in SQL • Tagged with Bioinformatics, gene expression quantification, copy number variation, Windows

Introduction

Recently I have been looking for publicly-available genomics datasets to test machine learning models in Python. During my searches for such a “toy dataset”, I came upon the Cancer Genomics Cloud (CGC) initiative.

Anyone can register in CGC and have access to open access massive public datasets, like The …


Continue reading

Working with Cancer Genomics Cloud datasets in a PostgreSQL database (Part 2)

Posted on Mon 12 October 2020 in SQL • Tagged with Bioinformatics, gene expression quantification, copy number variation, Windows

Introduction

Recently I have been looking for publicly-available genomics datasets to test machine learning models in Python. During my searches for such a “toy dataset”, I came upon the Cancer Genomics Cloud (CGC) initiative.

Anyone can register in CGC and have access to open access massive public datasets, like The …


Continue reading