Tools

  • From tambeta:

    • gprofiler: g:Profiler performs statistical enrichment analysis to provide interpretation to user-provided gene lists, ordered gene lists and chromosomal regions. We study multiple sources of functional evidence, including Gene Ontology terms, biological pathways, regulatory motifs of transcription factors and microRNAs, human disease annotations and protein-protein interactions.
  • From koen:

    • gbsx: GBSX is a package to design inline barcodes for, and to demultiplex GBS and RAD experiments Genotyping by Sequencing is an emerging technology for cost effective variant discovery and genotyping. However, current analysis tools do not fulfill all experimental design and analysis needs. GBSX is a package of tools to first aid in experimental design, including choice of enzymes and barcode design. Secondly, it provides a first analysis step to demultiplex samples using in-line barcodes, providing fastq files that can easily be plugged into existing variant analysis pipelines.
  • From iuc:

    • resize_coordinate_window: Modifies the start and stop coordinates of GFF data, expanding the coordinate window by a specified size. Contains a tool that modifies the start and stop coordinates of GFF data such that the new start and stop position is based on a specified window size that is computed either from the existing start and stop coordinates or centered on the midpoint.
  • data_manager_bowtie_index_builder: Data Manager for building bowtie indexes Bowtie is an ultrafast, memory-efficient short read aligner. It aligns short DNA sequences (reads) to the human genome at a rate of over 25 million 35-bp reads per hour. Bowtie indexes the genome with a Burrows-Wheeler index to keep its memory footprint small: typically about 2.2 GB for the human genome (2.9 GB for paired-end).

  • extract_genomic_dna: Contains a tool that extracts genomic DNA using coordinates from ASSEMBLED genomes and UNassembled genomes.

  • From bgruening:

  • From mkh:

    • ips5: InterProScan 5
  • From fgiacomoni:

    • hmdb_ms_search: Init and uploaded [W4M][LC-MS] HMDB database MS Search Package - Annotation - Returns annotation results (adducts and metabolites) from The Human Metabolome Database. Part of the W4M project: http://workflow4metabolomics.org / HMDB: http://www.hmdb.ca/. The wrapper script use the HMDB 'MS search' resource to annotate a list of m/z. The process returns outputs files (CSV and HTML formats) with links through metabocards.
  • From drosofff:

    • blast_to_scaffold: Generate DNA scaffold from blastn or tblastx alignments of Contigs This tool start from DNA contigs that aligned to a subject DNA sequence through blastn or tblastx. The contigs must be provided in fasta format. The blastn or tblastx output must be tabular, the 12 standard column plus column 13 with the length of the blastn or tblastx subject. The sequence used to BLAST (blastn or tblastx) the contigs must be provided to serve as a guide to the final assembly The final assembly is a DNA sequence. Nucleotides of the guide sequence which were not covered by contigs are in small letters in the output assembly.
  • From abretaud:

    • busco: Uploaded BUSCO assess genome and annotation completeness. Assessing genome assembly and annotation completeness with Benchmarking Universal Single-Copy Orthologs
  • From nml:

    • spades: Genome assembler for regular and single-cell projects
  • From jjohnson:

  • From galaxyp:

    • translate_bed_sequences: Perform 3 frame translation of BED file augmented with a sequence column This tool takes an extended (12 column) BED input file that has been processed by the Galaxy tool "Extract Genomic DNA" to add a 13th column with the transcript sequence. It generates a peptide fasta file with the 3-frame translations of the spliced sequence defined by each entry in the input BED file. It can also generate a bed file that defines the mapping of the translations to the reference genome. The tool "Map peptides to a bed file" can use this output to map peptides iidentified from a Proteomics search back to the reference genome.
    • map_peptides_to_bed: Map peptides to a reference genome using a bed file the locates the proein sequence in the reference genome. This is intended for use with the bed output of the translate_bed_sequences tool.

Dependency Definitions

  • From abretaud:

    • package_busco_1_1b1: Tool dependency definition that downloads and version 1.1b1 of BUSCO Assessing genome assembly and annotation completeness with Benchmarking Universal Single-Copy Orthologs
  • From jjohnson:

    • package_r_ada_2_0_3: Tool dependency definition that downloads and installs version 2.0.3 of the ada R package along with depenedency rpart ada performs discrete, real, and gentle boost under both exponential and logistic loss on a given data set. The package ada provides a straightforward, well-documented, and broad boosting routine for classification, ideally suited for small to moderate-sized data sets. https://cran.r-project.org/web/packages/ada/index.html