The October 2015 Galactic News!
Welcome to the October 2015 Galactic News, a summary of what is going on in the Galaxy community. It's been, and will be, a busy few months. If you have anything to include in the October News, please send it to Galaxy Outreach.
60 new papers referencing, using, extending, and implementing Galaxy were added to the Galaxy CiteULike Group in September. Highlights include:
WiseScaffolder: an algorithm for the semi-automatic scaffolding of Next Generation Sequencing data BMC Bioinformatics, Vol. 16, No. 1. (03 September 2015), 281, doi:10.1186/s12859-015-0705-y, by Gregory K. Farrant, Mark Hoebeke, Frédéric Partensky, Gwendoline Andres, Erwan Corre, Laurence Garczarek
Discriminating somatic and germline mutations in tumor DNA samples without matching normals Genome Research, Vol. 25, No. 9. (01 September 2015), pp. 1382-1390, doi:10.1101/gr.183053.114. Saskia Hiltemann, Guido Jenster, Jan Trapman, Peter van der Spek, Andrew Stubbs
VING: a software for visualization of deep sequencing signals BMC Research Notes, Vol. 8, No. 1. (07 September 2015), 419, doi:10.1186/s13104-015-1404-5. Marc Descrimes, Yousra B. Zouari, Maxime Wery, Rachel Legendre, Daniel Gautheret, Antonin Morillon
Hammock: A Hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets Bioinformatics (05 September 2015), btv522, doi:10.1093/bioinformatics/btv522. Adam Krejci, Theodore Hupp, Matej Lexa, Borivoj Vojtesek, Petr Muller
The new papers were tagged with:
The next GalaxyAdmins meetup on October 15 will feature David Kovalic speaking on Galaxy at Analome – Outreach, Opportunity and Challenges: At Analome Galaxy has found another home and we like it for numerous reasons. In this talk we will present the multifaceted role Galaxy plays at Analome and how it supports ongoing bioinformatics projects while at the same time providing opportunities for social good.
GalaxyAdmins is a special interest group for Galaxy community members who are responsible for Galaxy installations.
Hope to see you there.
The 2016 Galaxy Community Conference (GCC2016) features two full days of training on June 26-27. Each day will have multiple tracks, with each track featuring several sessions throughout the day. GCC2016 will be held at Indiana University in Bloomington, Indiana, United States.
GCC2016 Training topics are nominated by you, the Galaxy Community. Please give some thought to what topics you would like to see offered, or that you would like to offer, and then submit them as topic nominations. Any topic of interest to the Galaxy Community can be nominated and you are encouraged to nominate more than one topic. If you are looking for ideas, see
- Topics nominated for GCC2015
- Topics nominated for GCC2014
- Topics nominated for GCC2013
- The Galaxy Events page, which lists what training the community is offering outside of GCCs.
- Topics that have already been nominated for GCC2016
Nominated topics will be published as they come in. Topic nomination closes Friday, October, 16.. Topics will be compiled into a uniform list by the GCC2016 Organizing Committee, and topics will be voted on by the Galaxy Community from October 26 through November 6.
Topics will then be selected and scheduled based on topic interest, and the organisers' ability to confirm instructors for each session. Some very popular sessions may be scheduled more than once. The final schedule will be posted before registration opens.
GCC2017 needs a host!
(and a logo)
We are now seeking proposals from organizations interested in hosting GCC2017.
GCC draws 200+ participants from the full range of data-intensive life science research. Participants come from around the world, from all career stages, and do research spanning the tree of life. Universities, hospitals and medical schools, research organizations, and industry are all represented, including several of the largest and most influential research organizations in the world.
Hosts get copious (and enthusiastic) organizational support from Galaxy's Outreach Team (who have been helping to organize GCC since 2011), and even more copious gratitude (even adulation!) from the Galaxy Community.
See the full call for host proposals for more details on what should be covered in proposals. Proposals are due by the end of
30 October 30 November 2015. If you are interested in possibly hosting GCC2017 then please contact Galaxy Outreach with any questions.
Hoping to work with you in 2017!
A Galaxy tool developers to hack on dataset collections hackathon was held on September 17th and 18th.
Dataset collections enable MapReduce style workflows in Galaxy and have come a long way over the last year+. Several groups are using dataset collections and newer tools to express workflows of various degrees of complexity that were not possible in Galaxy before.
See the full report details.
|Designates a training event offered by GTN member(s)|
The Galaxy is expanding! Please help it grow.
- The Galaxy Project is hiring software engineers and post-docs
1 new publicly accessible Galaxy servers was added in September, bringing the total dangerously close to 80.
- Available tools:
- MicRhoDE workflow: places query sequences into MicRhoDE tree (type-1 rhodopsin phylogeny).
- Renkan Blast: blast query sequences on 18S and 28S radiolaria databases (blastn).
- WiseScaffolder: genome scaffolding of pre-assembled contigs using mate-pair data.
- HECTAR: HEterokont subCellular TARgeting is a statistical prediction method designed to assign proteins to five different categories of subcellular targeting: Signal peptides, type II signal anchors, chloroplast transit peptides, mitochondrion transit peptides and proteins which do not possess any N-terminal target peptide.
- Available tools:
- User support:
- For any question or request for tools or account, send an email at ABiMS Support
There are two active discussions happening on the Galaxy-Training mailing list.
We are considering offering a Software Carpentry / Data Carpentry Train the Trainer workshop immediately before GCC2016. This workshop would run for two days, and accommodate 24 people.
This idea came up at the GCC2015 GTN BoF. Software and Data Carpentry (S/DC) teach scientists core essentials of programming and data management. Their model is to train and certify trainers, and then provide a common curriculum and teaching infrastructure that enables those trainers to teach scientists how to program and manage data well. If you think we should (or should not) offer this training at GCC2016, please contribute to the [discussion on the mailing list. ## Galaxy Training Network and GOBLET
GTN became a member of the Global Organisation for Bioinformatics & Training (GOBLET) earlier this year. The 2015 GOBLET Meeting will be held 18-20 November, in Cape Town, South Africa. Fourie Joubert of the Centre for Bioinformatics and Computational Biology at the University of Pretoria will be representing GTN at GOBLET 2015.
If you have anything you'd like to have discussed at the meeting, then please join this conversation.
The Galaxy Project now has a Developer Code of Conduct.
From the introduction: This code of conduct outlines our expectations for participants within the Galaxy community, as well as steps to reporting unacceptable behavior. We are committed to providing a welcoming and inspiring community for all and expect our code of conduct to be honored. Anyone who violates this code of conduct may be banned from the community.
The Code of Conduct includes
- Guidelines on interaction,
- Commitment to diversity,
- and procedures for addressing issues if they arrise.
Thanks to everyone who helped put these together.
The July 2015 Galaxy Release is out! Release Notes v 15.07
- Interactive Environments - The interactive environments (IE) framework features several updates this release and Galaxy is now distributed with a new IE for RStudio implemented by Helena Rasche.
- Workflow Editor Enhancements - The workflow editor has been significantly revamped - it now uses newer tool form components and allow construction of workflows with tools explicitly producing output collections.
- Policies for Committers and Pull Requests - The process for adding committers to the project and fielding pull requests has been formalized and documented in the source code. Three new committers have been added to the project - Björn Grüning, Nicola Soranzo, and Helena Rasche.
- And much more
We just released an update to Galaxy CloudMan. CloudMan offers an easy way to get a personal and completely functional instance of Galaxy in the cloud in just a few minutes, without any manual configuration or imposed quotas. Once running, you have complete control over Galaxy, including the ability to install new tools.
This is the most complete update of the Galaxy on the Cloud platform to date with all the cloud components being built from the ground up. The following is a summary of the major changes:
- Galaxy on the Cloud: all new galaxy and indices file systems, with Galaxy 15.07 release, ~200 preinstalled Galaxy tools, and nearly half a terrabyte of updated reference genome data.
- Galaxy on the Cloud: switched to a different Cloud Launch application, available at https://launch.usegalaxy.org/
- Galaxy on the Cloud: Integrated Galaxy IPython Interactive Environments
- CloudMan: Creating cluster shares is now fast (~30 seconds vs. hours)
- Created a wealth of new documentation, including a description of the all new automated method for building private/custom instances.
Three security vulnerabilities were announced and patched in August. The Galaxy Committers have released a number of fixes which have been rolled in to affected versions of Galaxy dating back to the 14.10 release.
- Galaxy XSS Reflection Vulnerability
- Galaxy LDAP Authentication Vulnerability
- Tool Shed Unauthorized Upload Vulnerability
BioBlend 0.6.0 and 0.6.1 BioBlend version 0.6.1 was released in July. BioBlend is a python library for interacting with CloudMan and the Galaxy API(CloudMan offers an easy way to get a personal and completely functional instance of Galaxy in the cloud in just a few minutes, without any manual configuration.) From the release CHANGELOG:
Pulsar Pulsar 0.5.0 was released in May. Pulsar is a Python server application that allows a Galaxy server to run jobs on remote systems (including Windows) without requiring a shared mounted file systems. Unlike traditional Galaxy job runners - input files, scripts, and config files may be transferred to the remote system, the job is executed, and the results are transferred back to the Galaxy server - eliminating the need for a shared file system.
blend4j v0.1.2 blend4j v0.1.2 was released in December 2014. blend4j is a JVM partial reimplemenation of the Python library bioblend for interacting with Galaxy, CloudMan, and BioCloudCentral.
|Share your training resources and experience now||Share your experience now|
1 new Galaxy Training Network member was added in September:
- The Centre for Bioinformatics and Computational Biology, University of Pretoria The Centre offers training in general usage, analysing and comparing protein and nucleic acid sequences, annotating genomes, gene expression patterns, investigating molecular structures, and simulating metabolic pathways. Please welcome the first GTN member in Africa. (Now we just need Antartica.)
- From Robert M Flight: Southeast Center for Integrated Metabolomics (SECIM) has their Galaxy Project tools on GitHub. Can follow development, add to own.
- From Brian Nosek and many others: Center for Open Science gives grant to Titus Brown and team to connect Galaxy Project and the Open Science Framework. See the full announcement.
- From Globus: Love (or like!) Globus? Vote for us in the HPCWire Readers' Choice Awards (#16 - Best Data-Intensive System)
- From Björn Grüning: Brand new Galaxy Interactive Environment documentation: Enterprise Deployments, Docker on a different host...
- From Keywan Hassani-Pak: We encountered a big problem using bwa-eXpress in Galaxy for transcript abundance estimation!
- From Saket Choudhary: GalDrive: Pipeline for comparative identification of driver mutations using Galaxy framework
- From Keith Bradnam
Slides from a short talk:
[13 questions you may have about using Galaxy](http://buff.ly/1QhSwk1)
- From Markus Wolfien TRAPLINE: A standardized and automated pipeline for RNA sequencing data analysis and evaluation
- From GigaScience Supporting Galaxy BLAST+ wrappers, archived at the point of publication hosted in GigaDB dx.doi.org/10.5524/100149
- From Björn Grüning: New Galaxy Docker images! Enjoy all the new goodies!