Human Genetics working group T2 update
Expanding Galaxy’s ability to analyze protected human data
Human Genetics
This domain-focused working group emphasizes applying Galaxy in human research.
- Google Drive
- Goals Slide
- Leadership: Enis Afgan
The Human Genetics working group focuses on expanding Galaxy’s ability to analyze protected human data. Currently, this is made possible via AnVIL where Galaxy is deployed in a FedRAMP certified environment and alongside 3PB of data. This is the only such environment in the world where anyone can sign up and start working with this data within minutes.
For the term starting in May 2021 and running through August 2021, the working group focused on the following five objectives:
- Add support for Galaxy Interactive tools (GxITs). GxITs allow rich applications, such as IOBIO or HiGlass, to be used within Galaxy and effectively expand analysis and visualization capabilities beyond what is available from installed tools. However, they are complicated to set up and require special handling. In partnership with the Deployment and Cancer Informatics working groups, the necessary components for deploying GxITs have been codified. However, there are some restrictions within AnVIL that still need to be worked out with the underlying platform, such as running GxITs on a subpath instead of a subdomain as well as proxying of the requests by the AnVIL infrastructure. These will be the goals for the next term.
- Galaxy framework enhancements for AnVIL. AnVIL supports additional applications besides Galaxy, allowing users to switch between batch analysis in Terra and interactive analysis in RStudio/Bioconductor or Galaxy. Being able to move data between those applications is critical. Previously, we have added the ability to export complete histories from Galaxy to Terra but that was cumbersome to use in other applications. This term, we added a tool that allows individual datasets to be exported from Galaxy into Terra.
- Presence at the Galaxy Community Conference. If you were interested in using or learning how Galaxy works on AnVIL, GCC was the place to learn! But it’s not too late, videos and abstracts are available talking about an overview of AnVIL, deployment architecture for working in a regulated environment, and enabling GxITs.
- Explore analysis costing and cost estimates. Cost estimation remains one the largest questions researchers have as they look to migrate their analysis to the cloud. We have embarked on an effort to review historic tool usage data and identify most popular tools, then benchmark these tools using a controlled environment, and finally estimate costs. You can check out an interactive dashboard with the historic and benchmarking data on ObservableHQ.
- Support operations of Galaxy on AnVIL. Keeping AnVIL in lock-step with the general Galaxy releases is critical for disseminating the latest features developed by the broader Galaxy community. This means live testing and validation of a new Galaxy release on AnVIL specifically. AnVIL is currently running Galaxy 21.05 with all the settings captured in an AnVIL version of the Galaxy Helm chart: GalaxyKubeMan.