January 2014 Galaxy Update

Galaxy Updates

Welcome to the January 2014 Galaxy Update, a monthly summary of what is going on in the Galaxy community. Galaxy Updates complement the Galaxy Development News Briefs which accompany new Galaxy releases and focus on Galaxy code updates.

Galaxy Community Hubs

December saw the launch of two Galaxy community hubs - one to describe Galaxy deployments, and one to document admin and deployment experiences with Galaxy. Thanks to everyone who has described their experiences in the first month.

Galaxy Community Log Board

Galaxy Community Log Board

The Galaxy Community Log Board is a place to share how you addressed a particular task in your Galaxy deployment. Log entries describe specific solutions to particular tasks, such as the details of what steps were taken to deploy Galaxy on particular platforms, or specific stumbling blocks when configuring Galaxy. Log entries are an easy way to help others (and learn from others) by sharing what you've already done. If you have figured out or documented how to do something, then this is the ideal forum for sharing that information with the community.

   Share your experience now   
C'mon, it will make you feel good!

The Galaxy Community Log Board describes how things can be done, and it complements the (also recently announced) Galaxy Deployments Catalog, which describes different Galaxy installations. Both the Log Board and the Deployments Catalog came out of discussions at the GCC2013 GalaxyAdmins Birds-of-a-Feather gathering. Thanks to everyone who attended that BoF and to everyone who taken the time to log their experiences so far.

Galaxy Deployment Catalog

Galaxy Deployment Catalog
All the cool researchers are doing it!

The Galaxy Deployment Catalog is a place to describe individual Galaxy deployments, and to learn how others in the community are implementing Galaxy in production environments.

If you have a Galaxy deployment then please take a few minutes to describe your instance to the community.

2013 Events Archive

2013 Events
200+ Talks, workshops, tutorials, ...
90+ Events
80% with slides/video
75% presented by the Galaxy Community

The Galaxy Events for 2013 have been moved to the Events Archive, where they can still be accessed. There are 160 items in the 2013 Events list. That count goes well over 200 items if you follow some links and include the 13 talks at Galaxy Day; and all the talks (25), posters (26), lightning talks (19) and Training Day Sessions (15) at GCC2013; and the 20+ items at ISMB 2013; and ... Furthermore, at least 3/4 of those items were presented by people outside the core Galaxy Team.

While those numbers are just plain cool, the really useful bit is that there are slides, exercises, and videos for over 4/5 of those presentation. If you have a question about Galaxy, chances are someone talked about it in 2013, and that you can find their slides in the Events Archive.

Many, many thanks to everyone who talked about Galaxy last year and for making their material available online.

New Papers

# Tag    # Tag    # Tag    # Tag
3 Cloud 2 Project 4 Tools 1 UsePublic
2 HowTo 1 RefPublic - UseCloud 2 Visualization
4 IsGalaxy 3 Reproducibility 2 UseLocal 18 Workbench
20 Methods 2 Shared 12 UseMain

52 papers were added to the Galaxy CiteULike Group in December. Some papers that may be particularly interesting to the Galaxy community:

This brings the total number of papers to 1330.

Who's Hiring

Please Help! Yes you!

The Galaxy is expanding! Please help it grow.

Got a Galaxy-related opening? Send it to outreach@galaxyproject.org and we'll put it in the Galaxy News feed and include it in next month's update.


GCC2014: June 30 - July 2, Baltimore

GCC2014: June 30 - July 2

The 2014 Galaxy Community Conference (GCC2014) will be held June 30 through July 2, at the Homewood Campus of Johns Hopkins University, in Baltimore, Maryland, United States.

Training Day Topic Voting, January 6-17

GCC2014 will start with a Training Day on June 30. As for GCC2013, the specific topics offered will be determined by you, the Galaxy Community. Topic nomination was open December 2 through December 20. The organizers are now collating the nominations into a single list that the community (yes, that means you) will vote on these topics January 6-17. The final schedule of topics and instructors will be published no later than February 10, when registration opens.

Watch the Galaxy Mailing Lists for announcements and links to the voting form.

Training Day is an opportunity to learn about all things Galaxy including using Galaxy, deploying and managing Galaxy, extending Galaxy, and Galaxy internals. There will be 4 parallel tracks, each with 3 sessions, with each of those sessions two and half hours long. That's 12 sessions and 30 hours of workshop material.



EMC Isilon

We are pleased to announce several new sponsorships for GCC2014. First, EMC Isilon will be a Gold Sponsor for the third year in a row, and SGI will a Gold Sponsor for the first time.

Also please welcome three new Silver Sponsors:

Globus Genomics GigaScience Journal
BioMed Central, The Open Access Publisher

The Platinum, Gold and Silver sponsors will have exhibit booths at the event. There are still Silver and Bronze sponsorships still available. Please contact the Organizers if your organization would like to help sponsor this event.


In 2014 we are also adding non-sponsor exhibit spaces in addition to the sponsor exhibits. This will significantly increase the size of the exhibit floor. Please contact the Organizers if your organization would like to have an exhibit space at GCC2014.

2013 Galaxy Day Report

Galaxy Day

The 2013 Galaxy Day event took place at Institut Curie in Paris, 4 December. Program and slides for all presentations are available online. A meeting summary is also available.

Other Events

There are several events of interest in the next few months. Also see the [Galaxy Events Google Calendar](http://bit.ly/gxycal) for details on other events of interest to the community.
Galaxy @ Plant and Animal Genome (PAG 2014) 2014 GMOD Meeting   Analyse Bioinformatique de séquences pour l'amélioration des plantes RNA-Seq & ChIP-Seq analysis course using Galaxy
Date Topic/Event Venue/Location Contact
January 11-15 Galaxy for NGS Data Analysis: A Hands-on Computer Demo Plant and Animal Genome XXII (PAG 2014), San Diego, California, United States Dave Clements, Anushka Brownley
Galaxy Cloudman: A Gentle Introduction to Data Analysis on the Cloud
Part of the GMOD Workshop
Dave Clements, Scott Cain
Plus 2 more talks and 4 posters See list
January 16-17 2014 GMOD Meeting San Diego, California, United States Dave Clements, Scott Cain
February 10-14 Analyse Bioinformatique de séquences pour l'amélioration des plantes Campus la Gaillarde, Montpellier, France Yolande OLIVIER
February 11-13 RNA-Seq & ChIP-Seq analysis course using Galaxy PRABI, Lyon, France Navratil V., Oger C., Veber P., Deschamps C., Perriere G.
February 12-15 Advances in Genome Biology and Technology (AGBT) Marco Island, Florida, United States Anton Nekrutenko
February 26-28 GMOD Malaysia 2014 Kuala Lumpur, Malaysia Amelia Ireland
February 27 Introduction to NGS Bioinformatics Nowgen Centre, Manchester, United Kingdom Jan Taylor
March 24-25 Galaxy Australasia Workshop Australian Synchrotron and Melbourne Convention Centre, Melbourne, Australia Registration
March 26-30 Galaxy toolset for Drosophila genomics
and one-on-one help in the Flybase Demonstrations Room
Drosophila Research Conference, San Diego, California, United States Anton Nekrutenko, Jen Jackson
May 27 Enabling Data Analysis with Galaxy CloudMan workshop MIPRO 2014, Opatija, Croatia Enis Afgan
June 9-10 Informatics on High Throughput Sequencing Data Workshop Toronto, Canada Francis Ouellette
June 30 - July 2 2014 Galaxy Community Conference (GCC2014) Homewood Campus of Johns Hopkins University, Baltimore, Maryland, United States Organizers

New Public Servers

One new server was added to the list of publicly accessible Galaxy servers last month.


CAPER is "An Interactive, Configurable, and Extensible Workflow-Based Platform to Analyze Data Sets from the Chromosome-centric Human Proteome Project" at Beijing Proteome Research Center, China.

The Chromosome-centric Human Proteome Project (C-HPP) aims to map and annotate the entire human proteome by the "chromosome-by-chromosome" strategy. As the C-HPP proceeds, the increasing volume of proteomic data sets presents a challenge for customized and reproducible bioinformatics data analyses for mining biological knowledge. To address this challenge CAPER 2.0 − an interactive, configurable and extensible workflow-based platform is developed for C-HPP data analyses.


In addition to the previous visualization functions of track-view and heatmap-view, CAPER 2.0 presents a powerful toolbox for C-HPP data analyses and also integrates Galaxy, allowing users to easily conduct their own C-HPP proteomic data analyses and visualization. CAPER 2.0 presents four specific workflows for finding missing proteins, mapping peptides to chromosomes for genome annotation, integrating peptides with transcription factor binding sites from ENCODE data sets, and functionally annotating proteins.

See "CAPER 2.0: An Interactive, Configurable, and Extensible Workflow-Based Platform to Analyze Data Sets from the Chromosome-centric Human Proteome Project" by Wang, et al., J. Proteome Res., DOI: 10.1021/pr400795c, Publication Date (Web): November 22, 2013.

Tutorials and support ([Dong Li](mailto:lidong DOT bprc AT foxmail DOT com), Dan Wang) are offered. CAPER 2.0 is sponsored by Beijing Proteome Research Center (BPRC), the Human Liver Proteome Project (HLPP), and the Chromosome-centric Human Proteome Project (C-HPP)

Galaxy Distributions

The most recent Galaxy Distribution was November 04, 2013.

The latest version of CloudMan was released in July.

ToolShed Contributions

Galaxy ToolShed

New Repositories in the Galaxy Project ToolShed

  • redup: Remove exact duplicate reads from paired-end fastq files
  • rgttable: Generate FPKM tables from cuffdiff read_group_tracking files
  • dgidb_annotator: Annotate tabular file with information from Drug-Gene Interaction DB
  • gatk2: The Genome Analysis Toolkit in Version 2
  • table_annovar: Annotate VCF file using ANNOVAR annotations to create a tabular file that can be filtered
  • spectral_hk: Convert InChIs to spectral hashkeys.
  • snpeff: snpEff tools from Pablo Cingolani
  • split_file_on_column: Split file on specific column; every group is saved in a separate file.

Other News