Welcome to the Galaxy Human Cell Atlas project

Human Cell Atlas

The Human Cell Atlas Galaxy setup comprises of analysis tools and workflows for the analysis of Single Cell RNA-Seq data. It includes a module that connects to the Matrix Service API of the HCA’s Data Coordination Platform that enables retrieval of gene expression matrices from any data sets in the Human Cell Atlas. The instance is based on the Galaxy framework, which guarantees simple access, easy extension, flexible adaption to personal and security needs, and sophisticated analyses independent of command-line knowledge.

This setup aims to give users access to as much granularity as possible in terms of the downstream analysis steps provided by the major software for single cell data analysis: Scanpy, SC3, Scater and Seurat. For each of these tools, this Galaxy instance has decomposed modules for each the main functionalities: ingestion from 10x/loom, filtering (by cells or genes), scaling, normalisation, clustering, marker genes, and dimensionality reduction, among others. In the short term we expect to have interoperability between these tools through the Loom exchange format. Additionally, we provide specialised viewers for single cell clustering data: UCSC CellBrowser and cellxgene interactive tool (contributed by the great Galaxy community).

Tools available under HCA-Single Cell section were mainly brought to Galaxy by the Gene Expression Team at EMBL-EBI and the Teichmann Team at the Wellcome Sanger Institute.

Content

Get started

Are you new to Galaxy, or returning after a long time, and looking for help to get started? Take a guided tour through Galaxy’s user interface.

Available Workflows

WorkflowDescription
Human Cell Atlas - Scanpy - CellBrowserRetrieve data from the Human Cell Atlas matrix service, analysis with Scanpy and visualisation with UCSC CellBrowser
EBI Single Cell Expression Atlas - Scanpy - CellBrowserRetrieve expression matrices from Single Cell Expression Atlas, analysis with Scanpy and visualisation with UCSC CellBrowser
EBI Single Cell Expression Atlas Scanpy Prod 1.3Workflow used for clustering data in the release 6 to 9 of Single Cell Expression Atlas
EBI Single Cell Expression Atlas Tertiary Analysis Rel 10Workflow used for clustering data in the release 10 of Single Cell Expression Atlas
EBI Single Cell Expression Atlas Tertiary Analysis Rel 11Workflow used for clustering data in the releases 11 and 12 of Single Cell Expression Atlas
EBI Single Cell Expression Atlas Tertiary Analysis Rel 13Workflow used for downstream analysis in the releases 13 to 15 of Single Cell Expression Atlas.

Available tools

In this section we list all tools that have been integrated in the RNA workbench. The list is likely to grow as soon as further tools and workflows are contributed. To ease readability, we divided them into categories.

Single Cell Galaxy Tools developed for the Human Cell Atlas

Data retrieval from Single Cell data Repositories

ToolDescriptionReference
hca_matrix_downloaderHuman Cell Atlas Matrix Downloader retrieves expression matrices and metadata from the Human Cell Atlas.Regev et al. 2018
retrieve_scxaEBI SCXA Data Retrieval downloads expression matrices and metadata from the EBI Single Cell Expression Atlas (SCXA)Papatheodorou et al. 2018

10x files produced by these tools can be consumed by 10x reader modules in the tools below.

Visualisation

ToolDescriptionReference
ucsc_cell_browserUCSC Cell Browser displays single-cell clusterized data in an interactive web application.cells.ucsc.edu

Scanpy

Granular tools for accessing the main Scanpy functionalities.

ToolDescriptionReference
scanpy_read_10xScanpy Read10x into hdf5 object handled by scanpyWolf et al. 2018
scanpy_filter_genesScanpy FilterGenes based on counts and numbers of cells expressedWolf et al. 2018
scanpy_filter_cellsScanpy FilterCells based on counts and numbers of genes expressedWolf et al. 2018
scanpy_normalise_dataScanpy NormaliseData to make all cells having the same total expressionWolf et al. 2018
scanpy_find_variable_genesScanpy FindVariableGenes based on normalised dispersion of expressionWolf et al. 2018
scanpy_scale_dataScanpy ScaleData to make expression variance the same for all genesWolf et al. 2018
scanpy_run_pcaScanpy RunPCA for dimensionality reductionWolf et al. 2018
scanpy_compute_graphScanpy ComputeGraph to derive kNN graphWolf et al. 2018
scanpy_find_clusterScanpy FindCluster based on community detection on KNN graphWolf et al. 2018
scanpy_find_markersScanpy FindMarkers to find differentially expressed genes between groupsWolf et al. 2018
scanpy_run_tsneScanpy RunTSNE visualise cell clusters using tSNEWolf et al. 2018
scanpy_run_umapScanpy RunUMAP visualise cell clusters using UMAPWolf et al. 2018

Seurat

Granular tools for accessing the main Seurat functionalities. These tools received contributions from Maria Doyle @mblue9.

ToolDescriptionReference
seurat_read10xSeurat Read10x Loads 10x data into a serialized seurat R objectSatija et al. 2015
seurat_create_seurat_objectSeurat CreateSeuratObject create a Seurat objectSatija et al. 2015
seurat_filter_cellsSeurat FilterCells filter cells in a Seurat objectSatija et al. 2015
seurat_normalise_dataSeurat NormaliseData normalise dataSatija et al. 2015
seurat_find_variable_genesSeurat FindVariableGenes identify variable genesSatija et al. 2015
seurat_scale_dataSeurat ScaleData scale and center genesSatija et al. 2015
seurat_run_pcaSeurat RunPCA run a PCA dimensionality reductionSatija et al. 2015
seurat_find_clustersSeurat FindClusters find clusters of cellsSatija et al. 2015
seurat_find_markersSeurat FindMarkers find markers (differentially expressed genes)Satija et al. 2015
seurat_dim_plotSeurat Plot dimension reduction graphs the output of a dimensional reduction technique (PCA by default). Cells are colored by their identity class.Satija et al. 2015
seurat_run_tsneSeurat RunTSNE run t-SNE dimensionality reductionSatija et al. 2015
seurat_export_cellbrowserSeurat Export2CellBrowser produces files for UCSC CellBrowser import.Satija et al. 2015

Scater

Granular tools for accessing the main Scater functionalities. Normally used in combination with SC3.

ToolDescriptionReference
scater_read_10x_resultsScater read 10x data Loads 10x data into a serialized scater R objectMcCarthy et al. 2017
scater_calculate_qc_metricsScater CalculateQcMetrics based on expression values and experiment informationMcCarthy et al. 2017
scater_filterScater Filter cells and genes based on pre-calculated stats and QC metricsMcCarthy et al. 2017
scater_is_outlierScater DetectOutlier cells based on expression metricsMcCarthy et al. 2017
scater_calculate_cpmScater CalculateCPM from raw countsMcCarthy et al. 2017
scater_normalizeScater Normalise expression values by library size in log2 scaleMcCarthy et al. 2017

SC3

Granular tools for accessing the main SC3 functionalities. Normally used in combination with Scater.

ToolDescriptionReference
sc3_prepareSC3 Prepare a sc3 SingleCellExperiment objectKisilev et al. 2017
sc3_calc_consensSC3 Calculate Consensus from multiple runs of k-means clusteringKisilev et al. 2017
sc3_calc_transfsSC3 Calculate Transformations of distances using PCA and graph LaplacianKisilev et al. 2017
sc3_calc_biologySC3 DiffExp calculates DE genes, marker genes and cell outliersKisilev et al. 2017
sc3_estimate_kSC3 Estimate the number of clusters for k-means clusteringKisilev et al. 2017
sc3_calc_distsSC3 Calculate Distances between cellsKisilev et al. 2017
sc3_kmeansSC3 K-Means perform k-means clusteringKisilev et al. 2017

…plus all the great tools normally available at the usegalaxy.eu.

Contributors

Our Data Policy

Registered UsersUnregistered UsersFTP DataGDPR Compliance
For all different storage options, for detailed explanations of data retention policies, and ways how to increase your quota please refer to our dedicated storage site.Processed data will only be accessible during one browser session, using a cookie to identify your data. This cookie is not used for any other purposes (e.g. tracking or analytics). If UseGalaxy.eu service is not accessed for 90 days, those datasets will be permanently deleted.Any user data uploaded to our FTP server should be imported into Galaxy as soon as possible. Data left in FTP folders for more than 3 months, will be deleted.The Galaxy service complies with the EU General Data Protection Regulation (GDPR). You can read more about this on our Terms and Conditions.