March 2016 Galaxy News

Galaxy News

Welcome to the March 2016 Galactic News, a summary of what is going on in the Galaxy community.

If you have anything to include in the next News, please send it to [Galaxy Outreach](mailto:outreach AT galaxyproject DOT org).

GCC2016

2016 Galaxy Community Conference

GCC2016 will be held June 25-29 at Indiana University in Bloomington, Indiana, United States. This will be the 7th annual gathering of the Galaxy community, and we are expecting over 200 participants again this year. The 2016 Galaxy Community Conference includes 2 days of hackathons, 2 days of training, and a two day meeting featuring accepted presentations, keynotes, poster sessions, the new Visualization Showcase and Software Demo sessions, lightning talks, birds-of-a-feather meetups, and plenty of networking.

GCC2016 Early Registration

   Register Now   

Early registration for GCC2016 is now open. Registration costs depend on which events you register for, your career stage & affiliation, and when you register. Early bird registration ends May 20 and is up to 40% less than regular registration rates. Early bird registration starts at less than $45 / day for students and postdocs, and at $65 / day for other attendees from non-profits.

You can also sign up for conference housing during registration.

You are strongly encouraged to review the training and housing options before beginning the registration process.

GCC2016 Abstract Submission Deadlines!

   Submit an abstract   

The deadline for oral presentation abstracts is March 25, 2016. Abstracts will be reviewed and submitters will be notified of acceptance status no later than April 11, 2016.

Poster and Demo abstract submission closes May 20, 2016 (or earlier depending on poster space availability). Poster and demo submissions are reviewed on a rolling basis, and submitters will be notified of acceptance status no later than two weeks after the abstract is submitted. You may submit similar content for oral presentations, posters, and demos.

Topics should be of interest to those working in high-throughput data analysis and research. Presentations that are Galaxy-centric are encouraged, but not required. Please see the abstracts page for full details.

Scholarships: International application deadline is March 20

We are pleased to offer scholarships for the 2016 Galaxy Community Conference, being held in Bloomington, Indiana, United States, June 25-29. Scholarships are available to students and post-docs in historically under-represented groups, and to those from or based in Low and Lower-Middle Income Economies, as defined by the World Bank. If this describes you or one of your students then we hope to receive an application.

Scholarships cover registration and lodging during the GCC Meeting, and for any Training or Hackathon events the applicant chooses to attend. Scholarships do not cover travel or other expenses. The application deadline is May 1 for members of historically underrepresented groups and March 20 for those from Low and Lower-Middle Income Economies.

See the full announcement for details.

Sponsors


BioTeam

We are pleased to announce that BioTeam will again be a sponsor for this annual event. BioTeam is a professional services and products company at the intersection of science and information technology. BioTeam solves analytics, big data and scientific computing challenges for organizations by mapping the right technologies to their unique scientific needs. BioTeam has a well-established history of providing forward-thinking solutions to the Life Sciences.

The BioTeam Appliance Galaxy Edition is a push-button solution that let’s researchers get up and running quickly with Galaxy. The Galaxy Appliance comes preinstalled with a production instance of Galaxy, bioinformatics tools, and reference datasets. This powerful system is specifically configured for computationally intensive scientific workloads. Most importantly, the Galaxy Appliance is an open system so researchers can install whatever tools they need and use the server as their own high-performance informatics infrastructure along side Galaxy. BioTeam provides ongoing support for the Galaxy Appliance, enabling researchers to minimize their IT burden. The Galaxy Appliance is used by researchers around the world for metagenomic, ChIP-Seq, RNA-Seq analysis and more. For more information: http://bioteam.net/bioteam-appliance/galaxy-edition/

This is the fourth year in a row that BioTeam has been a GCC Sponsor.

We continue to seek other sponsors as well and offer a wide range of sponsorship plans. If your organization is interested in having a presence at GCC2016, please contact the GCC2016 Exec for more information.


New Papers

54 new papers referencing, using, extending, and implementing Galaxy were added to the Galaxy CiteULike Group in February, bringing the total number of papers to over 3000. Highlights from last month include:

The new papers were tagged with:

# Tag    # Tag    # Tag    # Tag
2 Cloud 3 Other - Shared 4 UseMain
1 HowTo 1 Project 3 Tools 9 UsePublic
5 IsGalaxy 5 RefPublic - UseCloud - Visualization
29 Methods - Reproducibility 3 UseLocal 11 Workbench

The Galaxy CiteULike group surpassed 3000 pubs in February. 49% of them use Galaxy in their methods.


Events

March 7-8: Online IUC Contribution Fest - RADSeq Tools and Workflows

remote Contribution Fest will be held 7th and 8th of March for developers to work on Galaxy RADSeq tools

A remote Contribution Fest will be held 7th and 8th of March for developers to work on Galaxy RADSeq tools.

RADSeq is a cheap sequencing technology that is used by many resource-limited groups who would benefit a lot from easy-to-use galaxy tools. Indeed there has been quite some interest in analyzing RADSeq with Galaxy. Currently there is a wrapper for stacks and little more to help with RAD specific analysis (though many other galaxy tools are useful with RADs - bwa, cap3, gatk, velvet, ...).

If you are interested in participating in the hackathon but not interested in actual tool development - we will assemble a list of smaller, manageable Python and JavaScript tasks to work on and certainly documentation is a chronically lacking for collections so we could use help there and no actual coding would be required.

We encourage ideas or advice about how to organize this so please let us know. A core group will be available on IRC all day and we will have google hangouts across those days to organize, answer questions, and report progress.

We will do our best to coordinate and make this hackathon a nice and productive experience and we would like to especially focus on working reasonable hours and discourage overnighters.

All forms of contribution are welcome!

GMOD Meeting, June 30 - July 1

2016 GMOD meeting

The 2016 GMOD meeting will be held immediately following GCC2016 and in the same venue as GCC2016. GMOD is a consortium of open-source software project, including Galaxy, that address common challenges with biological data. Other projects in the GMOD consortium include

  • Tripal: A web front end for Chado databases. Galaxy is working with the Tripal project to make Galaxy be Tripal's analysis engine.
  • JBrowse: A client-side genome browser and successor to the venerable http://gmod.org/wiki/GBrowse. JBrowse as a Galaxy Tool was presented by Helena Rasche at GCC2015. Ian Holmes, the JBrowse PI, has put JBrowse-Galaxy integration at "top of the list" for JBrowse's infrastructure upcoming infrastructure work.
  • MAKER: A genome annotation pipeline that integrates several gene annotation engines, and combines them to produce annotation that is better than any individual tool produces. A MAKER-Galaxy by Agriculture and Agri-Food Canada was presented at ISMB 2014.
  • InterMine and BioMart: These are both popular data sources that are integrated with Galaxy.

Early bird registration ends May 21. For those who would like to present a talk or poster, the meeting registration form includes a section for submitting the presentation title and abstract. Note: GCC2016 and GMOD have separate event registration, but are sharing housing - you won't even have to change rooms.

Galaxy in Google Summer of Code

Galaxy in GSoC 2016 as part of the Open Genome Informatics group

We are delighted to announce that Galaxy will be participating in Google Summer of Code this year, along with http://gmod.org/wiki/|GMOD and Reactome, as part of the Open Genome Informatics initiative.

The current list of project ideas is on this project ideas page. There is a bit more time to firm up and add new project ideas. It is also possible for students to suggest their own ideas and we will try hard to find them a mentor. If you add (or have) new ideas to the proposal, please let our GSoC coordinators Robin Haw, Scott Cain, and Marc Gillespie know.

Between now and 13 March, would-be student participants will start to discuss application ideas with this group. Please let Robin know if you have any questions about GSoC.


February 2016 GalaxyAdmins meetup are now available

February GalaxyAdmins Slides & Video

Slides and video from the February 2016 GalaxyAdmins meetup are now available.

Upcoming Events

There are many upcoming events in the next few months. See the Galaxy Events Google Calendar for details on other events of interest to the community.

IUC Contribution Fest - RADSeq Tools and Workflows Le cycle "Bioinformatique par la pratique" 2016 Introduction à l'analyse de données RNAseq sous Galaxy   Workshop on Galaxy: Practical Course
Date Topic/Event Venue/Location Contact
March 7-8 IUC Contribution Fest - RADSeq Tools and Workflows Around the World Online IUC
March 15 Initiation à l’utilisation de Galaxy Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
March 16 Analyse primaire de données issues de séquenceurs nouvelle génération sous Galaxy Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
March 17 Traitement bioinformatique des données RNA-Seq sous Galaxy Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
March 29 Introduction à l'analyse de données RNAseq sous Galaxy
Galaxy User Group Grand Ouest (GUGGO), Rennes, France
Training offered by GTN Member
Yvan Le Bras
April 6 Approaches for the Integration of Visual and Computational Analysis of Biomedical Data North America BioIT World 2016, Boston, Massachusetts, United States Nils Gehlenborg
April 22-26 Workshop on Galaxy: Practical Course North America SolBio International Conference 2016, Riviera Maya, México Oswaldo Trelles
May 11-13 NGS & Cancer : Analyses RNA-Seq Europe Paris, France Cancéropôle Île-de-France
May 23-24 Analyse statistique de données RNA-Seq sous Galaxy -Recherche des régions d'intérêt différentiellement exprimées Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
May 30 Initiation à l’utilisation de Galaxy Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
May 31 Analyse primaire de données issues de séquenceurs nouvelle génération sous Galaxy Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
June 9-10 Informatics on High-throughput Sequencing Data North America Montreal, Quebec, Canada Training offered by GTN Member Francis Ouellette
June 13-17 Using Galaxy For Analysis of High Throughput Sequence Data North America Davis, California, United States Training offered by GTN Member UC Davis Bioinformatics Training Program
June 25-29 2016 Galaxy Community Conference (GCC2016) North America Indiana University, Bloomington, Indiana, United States Training offered by GTN Member Organizers
June 30 - July 1 GMOD Meeting North America Indiana University, Bloomington, Indiana, United States Scott Cain
Designates a training event offered by GTN Member Designates a training event offered by GTN member(s)

Who's Hiring

Please Help! Yes you!

The Galaxy is expanding! Please help it grow.

Got a Galaxy-related opening? Send it to outreach@galaxyproject.org and we'll put it in the Galaxy News feed and include it in next month's update.



New Public Galaxy Servers

Three new publicly accessible Galaxy servers were listed in February:

BISTRO

BISTRO Galaxy
  • Link:

  • Domain/Purpose:

    • Taxonomic studies of environmental microbial communities
  • Comments:

    • Publishes the PipeAlign workflow and supporting tools. These tools include ballast, RASCAL, LEON, and MACSIMS.
  • User Support:

  • Quotas:

    • Supports anonymous access and creation of user logins.
  • Sponsor(s):

MiModD NacreousMap

Baumeister Lab Galaxy featuring MiModD
  • Link:

  • Domain/Purpose:

    • This server exposes the NacreousMap mapping/plotting tool of MiModD for users of MiModD who do not want to install the required dependencies (R and rpy2) for graphical output from this tool on their local system. MiModD is a comprehensive software package for the identification and annotation of mutations in the genomes of model organisms from whole-genome sequencing (WGS) data.
  • Comments:

    • CloudMap users can replot data produced with the Hawaiian Variant Mapping tool using the NacreousMap engine to obtain optimized (much smaller files that display faster) plots.
    • To install the complete MiModD package for use as a command line tool or for integration into any local Galaxy follow the installation instructions in the MiModD User Guide.
  • User Support:

  • Quotas:

    • The quota for anonymous usage is 50MB, registered users have 250MB available.
    • You can analyze/plot data in VCF format or the "Per variant report" format generated by local runs of the MiModD NacreousMap tool. The latter has the advantage of being up to 20 times smaller than the corresponding VCF file.
  • Sponsor(s):

NIOO-KNAW PhyloProfiler

NIOO-KHAN Public Galaxy Server featuring PhyloPofiler

Releases

Galaxy v16.01

GalaxyProject

The Galaxy Committers team is pleased to announce the January 2016 (v16.01) release of Galaxy.

Galaxy administrators should also be aware of the

security announcements below.

Highlights

Interactive Tours

The interactive tours framework allows developers and deployers to build interactive tutorials for users superimposed on the actual Galaxy web front end. Unlike video tutorials, these will not become stale and are truly interactive (allowing users to actually navigate and interact with Galaxy). Galaxy 16.01 ships with two example tours and new ones can easily be added by creating a small YAML file describing the tour. Try the Galaxy UI tour on Main.

Wheels

Galaxy’s Python dependencies have traditionally been distributed as eggs using custom dependency management code to enable Galaxy to distribute binary dependencies (enabling quick downloads and minimal system requirements). With this release all of that infrastructure has been replaced with a modern Python infrastructure based on pip and wheels. Work done as part of this to enable binary dependencies on Linux has been included with the recently released pip 8.

Detailed documentation on these changes and their impact under a variety of Galaxy deployment scenarios can be found in the Galaxy Framework Dependencies section of the Admin documentation.

Nested Workflows

Workflows may now run other workflows as a single abstract step in the parent workflow. This allows for reusing or subworkflows in your analyses.

Github

New
% git clone -b master https://github.com/galaxyproject/galaxy.git

Update to latest stable release
% git checkout master && pull --ff-only origin master

Update to exact version
% git checkout v16.01
BitBucket

Upgrade
% hg pull
% hg update latest_16.01
See the Get Galaxy page for additional details regarding the source code locations.

Deprecation Notices

Barring a strong outcry from deployers, 16.01 will be the last release of Galaxy to support Python 2.6. For more information, see Galaxy Github Issue #1596.

## Security Announcements

Read/write arbitrary filesystem paths, arbitrary code execution

Multiple security vulnerabilities were recently discovered in Galaxy that allow malicious actors to read and write files on the Galaxy server. Additionally, Galaxy servers on which a rarely used feature has been enabled are vulnerable to an arbitrary code execution exploit.

This issue affects all known releases of Galaxy in at least the last 3 years. See the full announcement for details.

Tool Shed Security Vulnerability - Read/write arbitrary filesystem paths

Multiple security vulnerabilities were recently discovered in the Tool Shed that allow malicious actors to read and write files on the Tool Shed server outside of normal Tool Shed repository directories.

This issue affects all known releases of Galaxy in at least the last 3 years. See the full announcment for details.

galaxy-lib 16.4.0

galaxy-lib is a subset of the Galaxy core code base designed to be used as a library. This subset has minimal dependencies and should be Python 3 compatible. It's available from GitHub and PyPi.

CloudBridge 0.1.0

The Galaxy Team is proud to be part of the development team for a new cross-cloud library called CloudBridge. CloudBridge is a Python library providing a simple layer of abstraction over different cloud providers, reducing or eliminating the need to write conditional code for each cloud. The library is generally applicable to any domain wishing to run cloud-independent applications. There is already support for Amazon and OpenStack clouds with support for Google’s Compute Engine in development.

The first version of CloudBridge was released earlier this month and it comes with detailed user documentation: http://cloudbridge.readthedocs.org/. The source code is available on Github: https://github.com/gvlproject/cloudbridge.

Other Releases

Galaxy Docker Image 15.10

And, thanks to Björn Grüning, there is also now a Docker image for Galaxy 15.10 as well.

CloudMan

CloudMan 15.12

CloudMan is a cloud manager that orchestrates all of the steps required to provision a complete compute cluster environment on a cloud infrastructure; subsequently, it allows one to manage the cluster, all through a web browser.

This minor CloudMan 15.12 release includes:

  • Update Galaxy to the Galaxy 15.10 release (see above for all the juicy details with the improvements that brings along)
  • Preload the GIE IPython Docker container onto the image for faster startup: less than 30 seconds to a fully functional IPython notebook vs. several minutes before!
  • For those of you that ran into the problem of the size of your user data overfilling after having added and removed a bunch of worker instances - no worries, that has been fixed now and resource tags are no longer stored in the cluster config to prevent persistent_data.yaml from becoming too big.
  • The file system archive extraction has been made more resilient and synchronous.
  • For those that are building custom images, a more direct method for locating the Nginx configuration directory has been added to help with different versions of the operating system (thanks to Matthew Ralston)
  • A number of dependent library versions have been updated to their latest versions.

Starforge 0.1

Starforge

Starforge is a collection of scripts that supports the building of components for Galaxy. Specifically, with Starforge you can:

These things will be built in Docker. Additionally, wheels can be built in QEMU/KVM virtualized systems.

Documentation can be found at starforge.readthedocs.org.


Pulsar

Pulsar 0.6.1

Pulsar 0.6 was released in December. Pulsar is a Python server application that allows a Galaxy server to run jobs on remote systems (including Windows) without requiring a shared mounted file systems. Unlike traditional Galaxy job runners - input files, scripts, and config files may be transferred to the remote system, the job is executed, and the results are transferred back to the Galaxy server - eliminating the need for a shared file system.

The 0.6.x release includes these changes:

  • Pulsar now depends on the new galaxy-lib Python package instead of manually synchronizing Python files across Pulsar and Galaxy.
  • Numerous build and testing improvements.
  • Fixed a documentation bug in the code (thanks to @erasche). e8814ae
  • Remove galaxy.eggs stuff from Pulsar client (thanks to @natefoo). 00197f2
  • Add new logo to README (thanks to @martenson). abbba40
  • Implement an optional awknowledgement system on top of the message queue system (thanks to @natefoo). Pull Request 82 431088c
  • Documentation fixes thanks to @remimarenco. Pull Request 78, Pull Request 80
  • Fix project script bug introduced this cycle (thanks to @nsoranzo). 140a069
  • Fix config.py on Windows (thanks to @ssorgatem). Pull Request 84
  • Add a job manager for XSEDE jobs (thanks to @natefoo). 1017bc5
  • Fix pip dependency installation (thanks to @afgane) Pull Request 73

Planemo 0.21.1

Planemo is a set of command-line utilities to assist in building tools for the Galaxy project. Planemo 0.21.1 is the most recent release. See the release history.

BioBlend 0.7.0

BioBlend version 0.7.0 was released at the beginning of November. BioBlend is a python library for interacting with CloudMan and the Galaxy API. CloudMan offers an easy way to get a personal and completely functional instance of Galaxy in the cloud in just a few minutes, without any manual configuration.) From the release CHANGELOG.

blend4j v0.1.2 blend4j v0.1.2 was released in December 2014. blend4j is a JVM partial reimplemenation of the Python library bioblend for interacting with Galaxy, CloudMan, and BioCloudCentral.


Galaxy Community Hubs

Galaxy Training Network Galaxy Community Log Board Galaxy Deployment Catalog
Share your training resources and experience now Share your experience now



Galaxy ToolShed

ToolShed Contributions

See list of tools contributed in February.


Other News