← Back to news

New Preprint: PathoGFAIR, a collection of FAIR and adaptable (meta)genomics workflows for (foodborne) pathogens detection and tracking

We are happy to share our new preprint bioRXiv: PathoGFAIR: a collection of FAIR and adaptable (meta)genomics workflows for (foodborne) pathogens detection and tracking.

Graphical abstract showing in which pathogen investigation step the workflows are applied, on the left side. Then in the middle, is a summary of all 5 workflows forming PathoGFAIR (Preprocessing, Taxonomy Profiling, Gene-based Pathogen Identification and Allele-based Pathogen Identification). Finally on the right side, the workflows features are highlighted

Background

Food contamination by pathogens poses a global health threat, affecting an estimated 600 million people annually. During a foodborne outbreak investigation, microbiological analysis of food vehicles detects responsible pathogens and traces contamination sources. Metagenomic approaches offer a comprehensive view of the genomic composition of microbial communities, facilitating the detection of potential pathogens in samples. Combined with sequencing techniques like Oxford Nanopore sequencing, such metagenomic approaches become faster and easier to apply. A key limitation of these approaches is the lack of accessible, easy-to-use, and openly available pipelines for pathogen identification and tracking from (meta)genomic data.

PathoGFAIR

PathoGFAIR is a collection of 5 Galaxy-based FAIR workflows employing state-of-the-art tools to detect and track pathogens from metagenomic Nanopore sequencing. Although initially developed for foodborne pathogen data, the workflows can be applied to any metagenomic Nanopore pathogenic data. PathoGFAIR incorporates visualisations and reports for comprehensive results.

We tested PathoGFAIR on 130 benchmark samples containing different pathogens from multiple hosts under various experimental conditions. Workflows have successfully detected and tracked expected pathogens at least at the species rank in both pathogen-isolated and non-pathogen-isolated samples with sufficient Colony-forming unit and Cycle Threshold values.

PathoGFAIR detects the pathogens or the subspecies of the pathogens in any sample, regardless of whether the sample is isolated or incubated before sequencing.

Importantly, PathoGFAIR is easy to use and can be straightforwardly adapted and extended for other types of analysis and sequencing techniques, making it usable in various pathogen detection scenarios.

PathoGFAIR Webpage

Workflows

Workflow Name WorkflowHub Dockstore Galaxy Servers
Nanopore Preprocessing (v 0.1) ID 1061 v 0.1 nanopore-pre-processing/main:v0.1 European Galaxy Server, United States Galaxy Server, Australian Galaxy Server
Taxonomy Profiling and Visualization with Krona (v 0.1) ID 1059 v 0.1 taxonomy-profiling-and-visualization-with-krona/main:v0.1 European Galaxy Server, United States Galaxy Server, Australian Galaxy Server
Gene-based Pathogen Identification (v 0.1) ID 1062 v 0.1 gene-based-pathogen-identification/main:v0.1 European Galaxy Server, United States Galaxy Server, Australian Galaxy Server
Allele-based Pathogen Identification (v 0.1) ID 1063 v 0.1 allele-based-pathogen-identification/main:v0.1 European Galaxy Server, United States Galaxy Server, Australian Galaxy Server
Pathogen Detection PathoGFAIR Samples Aggregation and Visualisation (v 0.1) ID 1060 v 0.1 pathogen-detection-pathogfair-samples-aggregation-and-visualisation/main:v0.1 European Galaxy Server, United States Galaxy Server, Australian Galaxy Server
PathoGFAIR 5in1 (v 0.1) Soon Soon European Galaxy Server, United States Galaxy Server, Australian Galaxy Server

Training Material

To assist users in understanding and using the workflows, we have provided extensive training material as a tutorial via the Galaxy Training Network (GTN).

This tutorial has been used as training support in multiple workshops and conferences' training sessions:

  1. A 1-day workshop early 2023 in FHNW in Muttenz, Switzerland that received highly positive feedback
  2. A 1-day session in several High-Throughput Data Analysis with Galaxy workshops delivered in Freiburg with the next iteration will be on July 26th 2024.
  3. A 2-hour workshop during the Galaxy Community Conference GCC 2024.

Funding

The PathoGFAIR project was funded via EOSC life industry call 2021 (news about the award)