May 2016 Galaxy News

Galaxy News

Welcome to the May 2016 Galactic News, a summary of what is going on in the Galaxy community.

If you have anything to include in the next News, please send it to Galaxy Outreach.

GCC2016

2016 Galaxy Community Conference

GCC2016 will be held June 25-29 at Indiana University in Bloomington, Indiana, United States. This will be the 7th annual gathering of the Galaxy community, and we are expecting over 200 participants again this year. The 2016 Galaxy Community Conference includes 2 days of hackathons, 2 days of training, and a two day meeting featuring accepted presentations, keynotes, poster sessions, the new Visualization Showcase and Software Demo sessions, lightning talks, birds-of-a-feather meetups, and plenty of networking.

Deadlines!

Poster & Demo Abstracts: May 20

   Submit an abstract   

The deadline for poster and computer demo abstracts is May 20, or when we run out of space, whichever comes first. Abstracts are reviewed on a rolling basis and submitters are notified of acceptance status no later than two weeks after submission. You may submit similar content for oral presentations, posters, and demos.

Topics should be of interest to those working in high-throughput data analysis and research. Presentations that are Galaxy-centric are encouraged, but not required. Please see the abstracts page for full details.

(And we are also still accepting late oral abstracts, that will be considered if we have cancellations.)

Early Registration: May 20

   Register Now   

Early registration for GCC2016 ends May 20. Registration costs depend on which events you register for, your career stage & affiliation, and when you register. Early bird registration ends May 20 and is up to 40% less than regular registration rates. Early bird registration starts at less than $45 / day for students and postdocs, and at $65 / day for other attendees from non-profits.

You can also sign up for conference housing during registration.

You are strongly encouraged to review the training and housing options before beginning the registration process.

Scholarships: May 1

Scholarships: Apply now

We are pleased to offer scholarships for the 2016 Galaxy Community Conference, being held in Bloomington, Indiana, United States, June 25-29. Scholarships are available to students and post-docs in historically under-represented groups, and to those from or based in Low and Lower-Middle Income Economies, as defined by the World Bank. If this describes you or one of your students then we hope to receive an application.

Scholarships cover registration and lodging during the GCC Meeting, and for any Training or Hackathon events the applicant chooses to attend. Scholarships do not cover travel or other expenses. The application deadline is May 1 for members of historically underrepresented groups.

See the full announcement for details.

Sponsors

We continue to seek other sponsors as well and offer a wide range of sponsorship plans. If your organization is interested in having a presence at GCC2016, please contact the GCC2016 Exec for more information.

GigaScience

GigaScience

Please welcome the journal GigaScience as a GCC Silver Sponsor for the 4th year in a row. GigaScience aims to revolutionize reproducibility of analyses, data dissemination, organization, understanding, and use.

All accepted oral presentations are eligible for consideration for publication in the journal GigaScience's Galaxy series. Published papers will receive a 15% discount in the article-processing charge if you flag GCC2016 on submission. As an open access and open-data journal focussing on reproducibility, GigaScience publishes all research objects (including data, software tools, workflows, VMs and containers) from 'big data' studies across the entire spectrum of life and biomedical sciences. GigaScience submissions utilize a novel format, where all of the supporting research objects are hosted and integrated into accepted papers using independently citable digital object identifiers from the journal's GigaGalaxy server and GigaDB database. See the Galaxy series page for examples of work coming from previous GCC meetings.

GenomeWeb

GenomeWeb

Please welcome GenomeWeb as a GCC Silver Sponsor for the third year in a row. GenomeWeb is an independent online news organization that provides in-depth coverage of the scientific and economic ecosystem spurred by high-throughput genome sequencing. We are the leading information source for scientists, executives, and clinicians who use and develop advanced life science tools.

EMC

EMC

Finally, we are delighted to have EMC again as a GCC sponsor for the 4th time. EMC is a Peta Sponsor for both GCC2016 Hacakthons.

EMC Emerging Technologies Division (ETD) is a global leader and trusted partner in Life Science storage solutions. We deliver powerful yet versatile solutions for healthcare and life science organizations that want to manage clinical and genomics data. ETD storage solutions are simple to install, manage and scale, at any size, across the R&D data lifecycle. As a leader and trusted partner at hundreds of Life Science organizations worldwide, ETD storage solutions provide the security, ease of management,high availability, and scalability needed to manage Life Science workflows today and in the future.


Upcoming Events


64th ASMS Conference on Mass Spectrometry and Allied Topics

Galaxy at ASMS 2016

Galaxy will have a strong presence at the 64th ASMS Conference on Mass Spectrometry and Allied Topics, being held June 5-9 in San Antonio, Texas, United States. There will be one workshop and one talk (both from the GalaxyP Project), and at least 7 posters on using Galaxy for proteomics.

If you are interested, register now as early registration closes April 30.

Using Galaxy for Analysis of RNA-Seq and ChIP-Seq Data

Using Galaxy for Analysis of RNA-Seq and ChIP-Seq Data

The UC Davis Bioinformatics Training Program, a GTN member, will be presenting the workshop Using Galaxy for Analysis of RNA-Seq and ChIP-Seq Data on June 13-17, at UC Davis in Davis, California, United States.

This workshop will include a rich collection of lectures and hands-on sessions, covering both theory and tools. We will explore the basics of high throughput sequencing technologies, focusing on Illumina data for hands-on exercises. Participants will explore software and protocols, create and modify workflows, and diagnose/treat problematic data, utilizing computing power of the Amazon Cloud.

Space is limited and this workshop is already more than 50% full.

May, June and July Events

There are a staggering 14 known Galaxy related events and presentations in May. These are spread over 4 countries on 3 continents. June and July are filling up too.

See the Galaxy Events Google Calendar for details on other events of interest to the community.

QFAB Workshops NGS & Cancer : Analyses RNA-Seq RNA-Seq and DNA-Seq Cancer Analyses Cycle d'aprentissage sous Galaxy
Date Topic/Event Venue/Location Contact
May 5 De novo genome assembly using Genomics Virtual Lab Workshop Australia University of Queensland, St Lucia, Australia Training offered by GTN Member QFAB Training, Xin-Yi Chua, Mike Thang
May 6 Variant detection using Galaxy Workshop Australia University of Queensland, St Lucia, Australia Training offered by GTN Member QFAB Training
May 10-14 EDGY—Export of data from Galaxy to Yabi, automated workflow transfer to command line tools North America The Biology of Genomes, Cold Spring Harbor Laboratory, New York, United States David Molik
May 11-13 NGS & Cancer : Analyses RNA-Seq Europe Paris, France Cancéropôle Île-de-France
May 12-13 RNA-Seq and DNA-Seq Cancer Analyses Europe Rotterdam, The Netherlands Training offered by GTN Member Youri Hoogstrate
May 17 Galaxy First Step Europe Part of Cycle d'aprentissage sous Galaxy, INRA, Auzeville, France Training offered by GTN Member Sarah Maman
May 18-19 Galaxy: Reads alignment and SNP calling Europe Part of Cycle d'aprentissage sous Galaxy, INRA, Auzeville, France Training offered by GTN Member Philippe Bardou
May 19-20 Galaxy : RNAseq alignment and transcripts assemblies Europe Part of Cycle d'aprentissage sous Galaxy, INRA, Auzeville, France Training offered by GTN Member Celine Noirot, Cédric Cabau
May 18-20 ELIXIR Technical Hackathon: Tools, Workflows and Workbenches
Full
Europe Institut Pasteur, Paris, France Organisers
May 23-24 Analyse statistique de données RNA-Seq sous Galaxy -Recherche des régions d'intérêt différentiellement exprimées Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
May 26 RNA-Seq Analysis Using Galaxy Australia Children’s Medical Research Institute, Westmead, NSW, Australia Katherine Champ
May 27 Variant Detection using Galaxy Australia Children’s Medical Research Institute, Westmead, NSW, Australia Katherine Champ
May 30 Initiation à l’utilisation de Galaxy Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
May 31 Analyse primaire de données issues de séquenceurs nouvelle génération sous Galaxy Europe Part of Le cycle "Bioinformatique par la pratique" 2016, Jouy-en-Josas, France Formation Migale
June 5-9 64th ASMS Conference on Mass Spectrometry and Allied Topics North America San Antonio, Texas, United States Training offered by GTN Member Presenters
June 9-10 Informatics on High-throughput Sequencing Data North America Montreal, Quebec, Canada Training offered by GTN Member Francis Ouellette
June 13-17 Using Galaxy for Analysis of RNA-Seq and ChIP-Seq Data North America Davis, California, United States Training offered by GTN Member UC Davis Bioinformatics Training Program
June 16 GalaxyAdmins Web Meetup Around the World Online Evan Bollig, JJ Johnson, Hans-Rudolf Hotz, Dave Clements
June 25-29 2016 Galaxy Community Conference (GCC2016) North America Indiana University, Bloomington, Indiana, United States Training offered by GTN Member Organizers
June 30 - July 1 GMOD Meeting North America Indiana University, Bloomington, Indiana, United States Scott Cain
July 4-8 An introduction to Galaxy with the NeCTAR Genomics Virtual Laboratory Workshop Australia 2016 Winter School in Mathematical & Computational Biology, University of Queensland, Brisbane, Australia Igor Makunin, Derek Benson
July 6-7 BOSC Codefest 2016 North America Orlando, Florida, United States Brad Chapman
July 8-9 BOSC 2016 North America ISMB 2016, Orlando, Florida, United States
July 8-12 ISMB 2016 North America Orlando, Florida, United States
July 13-17 The Allied Genetics Conference 2016 (TAGC) North America Orlando, Florida, United States Dave Clements
Designates a training event offered by GTN Member Designates a training event offered by GTN member(s)

Past Events

April GalaxyAdmins Slides & Video

GalaxyAdmins

Slides and video from the April 2016 GalaxyAdmins meetup are now available. Ivar Grytten and Geir Kjetil Sandve from the University of Oslo discussed The Galaxy Portal: Accessing Galaxy from Mobile Devices (Slides) and John Chilton covered Tool Development Developments.

Conda Dependency Codefest Report

Conda Dependencies Codefest

A Conda Dependencies Dodefest was held on Monday April 4, and involved 8 participants. It was designed to be beginner friendly, which increased contribution from the community. 4 members of the galaxy community were added as contributors to the bioconda-recipe repository as a result of this hackathon. The main aim of the codefest was to get community members familiar with the Conda-Galaxy integration, and to remove tools from testing blacklist. See the full codefest report for details.


Galaxy on Jetstream Cloud

Want your own Galaxy server, for free? You can now easily create Galaxy servers on the new NSF Jetstream cloud. Each server comes preconfigured with hundreds of tools and commonly used reference datasets. It only takes a couple of minutes to start one. Once running, you can use it or change it up any way you like.

How do I get access?

You must be a US-based academic to access Jetstream cloud. Access is free but it is necessary to have an XSEDE account (go to https://www.xsede.org/ to sign up) and have an active resource allocation. Getting the resource allocation is matter of writing a summary of your research in less than 100 words and waiting ~24 hrs for the application to get approved. Go to http://jetstream-cloud.org/allocations.php → "Submit and manage allocation requests" to get started; choose Startup type of allocation.

How do I launch my own Galaxy server?

After you have your XSEDE account and an active allocation:

  1. Visit https://use.jetstream-cloud.org/
  2. Browse the available images and choose "Galaxy 16.01 Standalone"
  3. Follow the prompts on the screen to launch an instance
  4. In less than 5 minutes, you should have your own, fully configured Galaxy server

More documentation about the process can be found here.

New Papers

72 new papers referencing, using, extending, and implementing Galaxy were added to the Galaxy CiteULike Group in April.

Some April highlights:

The new papers were tagged with:

# Tag    # Tag    # Tag    # Tag
1 Cloud 3 Other - Shared 9 UseMain
1 HowTo - Project 6 Tools 17 UsePublic
4 IsGalaxy 5 RefPublic - UseCloud 2 Visualization
42 Methods 1 Reproducibility 7 UseLocal 14 Workbench

New Tutorials and Video

Diploid variant calling

There are two new comprehensive online tutorials from Anton Nekrutenko:

Diploid variant calling

Variant calling is a complex field that was significantly propelled by advances in DNA sequencing and efforts of large scientific consortia such as the 1000 Genomes. This tutorial summarizes basic ideas central to Genotype and Variant calling.

Reference based RNA seq

Much of Galaxy-related features described in this tutorial have been developed by Björn Grüning (@bgruening) and configured by Dave Bouvier (@davebx).

Reference based RNA seq

This tutorial is inspired by an exceptional RNAseq course at the Weill Cornell Medical College compiled by Friederike Dündar, Luce Skrabanek, and Paul Zumbo and by tutorials produced by Björn Grüning (@bgruening) for Freiburg Galaxy instance. Much of Galaxy-related features described in this section have been developed by Björn Grüning (@bgruening) and configured by Dave Bouvier (@davebx).

Dataset Collections

Dataset collections help analyzing multiple datasets in just a few clicks. This short video shows how. If you prefer reading to watching, then check out this tutorial.



Who's Hiring


Please Help! Yes you!

The Galaxy is expanding! Please help it grow.

Got a Galaxy-related opening? Send it to outreach@galaxyproject.org and we'll put it in the Galaxy News feed and include it in next month's update.



New Public Galaxy Servers

There are two new publicly accessible Galaxy servers:

MGEScan

MGEScan on Galaxy Workflow System

Koslicki Lab

Koslicki Lab Server

Galaxy Community Hubs

Galaxy Training Network Galaxy Community Log Board Galaxy Deployment Catalog
Share your training resources and experience now Share your experience now Describe your instance now


One new training resource was added in April:


Releases


Planemo 0.24.2

Planemo is a set of command-line utilities to assist in building tools for the Galaxy project. April releases features these updates:

  • Fix test summary report. Pull Request 429
  • Improve error reporting when running shed_test. ce8e1be
  • Improved code comments and tests for shed related functionality. 89674cb
  • Rev galaxy-lib dependency to 16.4.1 to fix wget usage in newer versions of wget. d76b489
  • Revert "check .shed.yml owner against credentials during shed creation", test was incorrect and preventing uploads. Pull Request 425, Issue 246

See the release history.

galaxy-lib 16.7.0

galaxy-lib is a subset of the Galaxy core code base designed to be used as a library. This subset has minimal dependencies and should be Python 3 compatible. It's available from GitHub and PyPi.


Pulsar

Pulsar 0.7.0

Pulsar 0.7 was released in April. Pulsar is a Python server application that allows a Galaxy server to run jobs on remote systems (including Windows) without requiring a shared mounted file systems. Unlike traditional Galaxy job runners - input files, scripts, and config files may be transferred to the remote system, the job is executed, and the results are transferred back to the Galaxy server - eliminating the need for a shared file system.

Earlier Releases

Galaxy v16.01

GalaxyProject

The January 2016 (v16.01) release of Galaxy features

  • Interactive Tours
  • Wheels
  • Nested Workflows

See the announcement for full details.

Galaxy Docker Image 16.01

And, thanks to Björn Grüning, there is also now a Docker image for Galaxy 16.01 as well.

CloudMan 16.03

CloudMan

We just released an update to Galaxy CloudMan on AWS. CloudMan offers an easy way to get a personal and completely functional instance of Galaxy in the cloud in just a few minutes, without any manual configuration or imposed quotas. Once running, you have complete control over Galaxy, including the ability to install new tools.

Most notable changes include:

  • Galaxy 16.01 release
  • A fine-grained control over auto-scaling options
  • Several fixes to cluster sharing and cloning

See the CHANGELOG for a more complete set of changes.

CloudBridge 0.1.0

The Galaxy Team is proud to be part of the development team for a new cross-cloud library called CloudBridge. CloudBridge is a Python library providing a simple layer of abstraction over different cloud providers, reducing or eliminating the need to write conditional code for each cloud. The library is generally applicable to any domain wishing to run cloud-independent applications. There is already support for Amazon and OpenStack clouds with support for Google’s Compute Engine in development.

The first version of CloudBridge was released earlier this month and it comes with detailed user documentation. The source code is available on Github.

Starforge 0.1

Starforge

Starforge is a collection of scripts that supports the building of components for Galaxy. Specifically, with Starforge you can:

These things will be built in Docker. Additionally, wheels can be built in QEMU/KVM virtualized systems.

Documentation can be found at starforge.readthedocs.org.

BioBlend 0.7.0

BioBlend version 0.7.0 was released at the beginning of November. BioBlend is a python library for interacting with CloudMan and the Galaxy API. CloudMan offers an easy way to get a personal and completely functional instance of Galaxy in the cloud in just a few minutes, without any manual configuration.) From the release CHANGELOG.

blend4j v0.1.2 blend4j v0.1.2 was released in December 2014. blend4j is a JVM partial reimplemenation of the Python library bioblend for interacting with Galaxy, CloudMan, and BioCloudCentral.



Galaxy ToolShed

ToolShed Contributions

Sorry. Ran out of time. Look for a double batch in the June News.