← Back to news

An update on Galaxy Genome Annotation

Galaxy as a platform for the annotation of genomes

Galaxy Genome Annotation

As presented at GCC2023 and EGD2023, a lot of new exciting developments have been made in Galaxy for the annotation of genomes!

This has been done in the frame of the GGA community of practice that you are really welcome to join if you have interest in genome annotation within Galaxy (as a user, developer, trainer, ...).

New tools!

We've worked hard to integrate into Galaxy many state-of-the-art genome annotation tools! Here's the complete, categorized, list:

Repeated elements detection

Tool Version UseGalaxy.eu
RepeatModeler 2.0.4 link
RepeatMasker 4.1.5 link
Red 2018.09.10 link

Gene detection for prokaryotes

Tool Version UseGalaxy.eu
Prokka 1.14.6 link
Bakta 1.8.2 link

Gene detection for eukaryotes

Tool Version UseGalaxy.eu
BRAKER3 3.0.3 link
BRAKER2 2.1.6 link
Helixer 0.3.2 link
Funannotate 1.8.15 link
MAKER 2.31.11 link
TSEBRA 1.1.1 Available soon

Functional annotation

Tool Version UseGalaxy.eu
Antismash 6.1.1 link
InterProScan 5.59-91 link
EggNOG-Mapper 2.1.8 link
DeepSig 1.2.5 link

Evaluation methods

Tool Version UseGalaxy.eu
Genome annotation statistics 0.8.4 link
BUSCO 5.4.6 link

Annotation comparison

Tool Version UseGalaxy.eu
AEGeAN 0.16.0 link
Funannotate compare 1.8.15 link

General alignment tools

Tool Version UseGalaxy.eu
BLAST 2.14.1 link
Exonerate 2.4.0 link
Diamond 2.0.15 link
Miniprot 0.12 link

Long non-coding RNA prediction

Tool Version UseGalaxy.eu
FEELnc 0.2.1 link

Visualisation

Tool Version UseGalaxy.eu
JBrowse 1.16.11 link
Circos 0.69.8 link
GeneNoteBook 0.4.8 link

New workflows!

Over the last years we have collected a lot of different workflows for different use-cases. All the GTN tutorials below, for example, are also available as workflows. However, they might be hard to find and as a consequence people can be tempted to redo them. To fix that we have started to push our workflows to the Intergalactic Workflow Commission, adding workflow tests and pushing them to workflowhub.eu and Dockstore for broader dissemination.

You can see the first IWC workflow from the GGA community at workflowhub or on GitHub as part of the IWC organisation.

Next up is the functional annotation workflow and our GTN workflows - see below.

New visualisations!

Galaxy is much more than just a workflow system and one area that we are trying to improve is Visualisation!

Check out our tutorials about this:

JBrowse

Circos

We also added support for GeneNoteBook: a nice way to present functional annotation data into an easy-to-navigate website:

GeneNoteBook

New and updated training material!

We have updated and added new training material for everyone that wants to learn Genome Annotation. If you don't know where to start check out our recommended learning path and learn from "Introduction to Genome Annotation" to "Refining Genome Annotations with Apollo" everything that you need to know about Genome Annotation:

Alternatively, you can jump right into you favorite topic:

Manual curation and collaborative annotation made easy

Apollo is a web service for Collaborative Annotation - a “Google Docs of Genome Annotation” if you like to think in Google Terms. It allows for real-time, collaborative genome annotation, editing, and review of your favorite genome. Genome Annotation requires manual curation and review many hands are needed - Biology is a team sport! And Apollo is a wonderful tool to support your team.

Galaxy Apollo integration

To make your life easy we worked closely with the Apollo team and bridged Galaxy with Apollo. You can do automatic annotation with Galaxy, using its powerful workflow system, and afterwards send the data to Apollo for manual curation and collaborative editing! A typical Galaxy-Apollo use-case might look like that:

  1. Fetch Data
  2. Analyse in Galaxy
  3. Send to Apollo
  4. Collaboratively Annotate → repeat

Galaxy Apollo integration

The Earth BioGenome Project

Building on the Assembly workflows that have been developed as part of the VGP/BGE/ERGA (→ in short EBP) we are planning to deliver high-quality workflows and interactive tools for your genome annotation efforts. Ideally, running assembly and annnotation workflows after each other 😊

What's next?

Any tool or workflow missing in our catalog? Please tell us and we will do our best to make it available!

We already have these items on our TODO list, stay tuned:

  • JBrowse2
  • GALBA
  • OMark
  • BUSCO update
  • InterProScan update

Join us on our chat channel to keep in touch!