January 2014 Galaxy Update
Welcome to the January 2014 Galaxy Update, a monthly summary of what is going on in the Galaxy community. Galaxy Updates complement the Galaxy Development News Briefs which accompany new Galaxy releases and focus on Galaxy code updates.
December saw the launch of two Galaxy community hubs - one to describe Galaxy deployments, and one to document admin and deployment experiences with Galaxy. Thanks to everyone who has described their experiences in the first month.
The Galaxy Community Log Board is a place to share how you addressed a particular task in your Galaxy deployment. Log entries describe specific solutions to particular tasks, such as the details of what steps were taken to deploy Galaxy on particular platforms, or specific stumbling blocks when configuring Galaxy. Log entries are an easy way to help others (and learn from others) by sharing what you've already done. If you have figured out or documented how to do something, then this is the ideal forum for sharing that information with the community.
The Galaxy Community Log Board describes how things can be done, and it complements the (also recently announced) Galaxy Deployments Catalog, which describes different Galaxy installations. Both the Log Board and the Deployments Catalog came out of discussions at the GCC2013 GalaxyAdmins Birds-of-a-Feather gathering. Thanks to everyone who attended that BoF and to everyone who taken the time to log their experiences so far.
The Galaxy Deployment Catalog is a place to describe individual Galaxy deployments, and to learn how others in the community are implementing Galaxy in production environments.
If you have a Galaxy deployment then please take a few minutes to describe your instance to the community.
|200+||Talks, workshops, tutorials, ...|
|75%||presented by the Galaxy Community|
The Galaxy Events for 2013 have been moved to the Events Archive, where they can still be accessed. There are 160 items in the 2013 Events list. That count goes well over 200 items if you follow some links and include the 13 talks at Galaxy Day; and all the talks (25), posters (26), lightning talks (19) and Training Day Sessions (15) at GCC2013; and the 20+ items at ISMB 2013; and ... Furthermore, at least 3/4 of those items were presented by people outside the core Galaxy Team.
While those numbers are just plain cool, the really useful bit is that there are slides, exercises, and videos for over 4/5 of those presentation. If you have a question about Galaxy, chances are someone talked about it in 2013, and that you can find their slides in the Events Archive.
Many, many thanks to everyone who talked about Galaxy last year and for making their material available online.
52 papers were added to the Galaxy CiteULike Group in December. Some papers that may be particularly interesting to the Galaxy community:
- "Galaxy tools to study genome diversity" by Bedoya-Reina, et al., GigaScience, Vol. 2, No. 1. (2013), 17, doi:10.1186/2047-217x-2-17
- "Detection of candidate tumor driver genes using a fully integrated Bayesian approach" by Yang, et al., Statistics in Medicine (1 December 2013), doi:10.1002/sim.6066
- "In silico to in vivo splicing analysis using splicing code models" by Gazzara, et al., Methods (December 2013), doi:10.1016/j.ymeth.2013.11.006
This brings the total number of papers to 1330.
The Galaxy is expanding! Please help it grow.
Integrated Data Analysis for Improved Understanding of Disease Life Sciences studentship, Queen Mary University of London
Stage Master2 "Annotation du transcriptome de l’esturgeon" Bordeaux, France
- Informatics Software Support Specialist at the University of Minnesota, United States
- CDD Gif-sur-Yvette NGS et Variation Structurales UMR de Génétique Végétale, INRA-Université Paris Sud-CNRS
- Statistical Genomics Postdoc opening in the Makova lab at Penn State
- The Galaxy Project is hiring software engineers and post-docs, including positions at George Washington University.
GCC2014 will start with a Training Day on June 30. As for GCC2013, the specific topics offered will be determined by you, the Galaxy Community. Topic nomination was open December 2 through December 20. The organizers are now collating the nominations into a single list that the community (yes, that means you) will vote on these topics January 6-17. The final schedule of topics and instructors will be published no later than February 10, when registration opens.
Watch the Galaxy Mailing Lists for announcements and links to the voting form.
Training Day is an opportunity to learn about all things Galaxy including using Galaxy, deploying and managing Galaxy, extending Galaxy, and Galaxy internals. There will be 4 parallel tracks, each with 3 sessions, with each of those sessions two and half hours long. That's 12 sessions and 30 hours of workshop material.
Also please welcome three new Silver Sponsors:
- Globus Genomics, for the 3rd year
- GigaScience Journal and BioMed Central, for the 2nd year
- ABgenomica, for the first time.
The Platinum, Gold and Silver sponsors will have exhibit booths at the event. There are still Silver and Bronze sponsorships still available. Please contact the Organizers if your organization would like to help sponsor this event.
In 2014 we are also adding non-sponsor exhibit spaces in addition to the sponsor exhibits. This will significantly increase the size of the exhibit floor. Please contact the Organizers if your organization would like to have an exhibit space at GCC2014.Galaxy Events Google Calendar for details on other events of interest to the community.
One new server was added to the list of publicly accessible Galaxy servers last month.
The Chromosome-centric Human Proteome Project (C-HPP) aims to map and annotate the entire human proteome by the "chromosome-by-chromosome" strategy. As the C-HPP proceeds, the increasing volume of proteomic data sets presents a challenge for customized and reproducible bioinformatics data analyses for mining biological knowledge. To address this challenge CAPER 2.0 − an interactive, configurable and extensible workflow-based platform is developed for C-HPP data analyses.
In addition to the previous visualization functions of track-view and heatmap-view, CAPER 2.0 presents a powerful toolbox for C-HPP data analyses and also integrates Galaxy, allowing users to easily conduct their own C-HPP proteomic data analyses and visualization. CAPER 2.0 presents four specific workflows for finding missing proteins, mapping peptides to chromosomes for genome annotation, integrating peptides with transcription factor binding sites from ENCODE data sets, and functionally annotating proteins.
See "CAPER 2.0: An Interactive, Configurable, and Extensible Workflow-Based Platform to Analyze Data Sets from the Chromosome-centric Human Proteome Project" by Wang, et al., J. Proteome Res., DOI: 10.1021/pr400795c, Publication Date (Web): November 22, 2013.
Tutorials and support (Dong Li, Dan Wang) are offered. CAPER 2.0 is sponsored by Beijing Proteome Research Center (BPRC), the Human Liver Proteome Project (HLPP), and the Chromosome-centric Human Proteome Project (C-HPP)
- redup: Remove exact duplicate reads from paired-end fastq files
- rgttable: Generate FPKM tables from cuffdiff read_group_tracking files
- dgidb_annotator: Annotate tabular file with information from Drug-Gene Interaction DB
- gatk2: The Genome Analysis Toolkit in Version 2
- table_annovar: Annotate VCF file using ANNOVAR annotations to create a tabular file that can be filtered
- spectral_hk: Convert InChIs to spectral hashkeys.
- snpeff: snpEff tools from Pablo Cingolani
- split_file_on_column: Split file on specific column; every group is saved in a separate file.
- Take the ABRF DNA Sequencing Research Group survey on nucleic acid quantification methods & best practices
- The first 30 minutes of the "Galaxy Code and Storage Architecture" GCC2013 Training Day session is now on Vimeo and YouTube. Thanks to Samuel Lampa for recording and making this available.
- http://gmod.org/wiki/|GMOD is looking into running a GMOD workshop in Malaysia, February 26-28 -- please email firstname.lastname@example.org if you're interested in attending.
- Aussie researchers can now launch their own Galaxy instance on Nectar. See the guide at the Genome Virtual Lab