Jun 22, 2016

aRNApipe: A balanced, efficient and distributed pipeline for processing RNA-seq data in high performance computing environments

BioRxiv : the Preprint Server for Biology
Arnald AlonsoRichard M Myers

Abstract

The wide range of RNA-seq applications and their high computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high performance cluster environments. aRNApipe is highly modular and can be easily migrated to any high performance computing (HPC) environment. The current applications included in aRNApipe combine the essential RNA-seq primary analyses, including quality control metrics, transcript align-ment, count generation, transcript fusion identification, alternative splicing, and sequence variant calling. aRNApipe is project-oriented and dynamic so users can easily update analyses to include or exclude samples or enable additional processing modules. Workflow parameters are easily set using a single configuration file that provides centralized tracking of all analytical processes. Finally, aRNApipe incorporates interactive web reports for sample tracking and a tool for managing the ge-nome assemblies available to perform an analysis. Availability and documentation: https://github.com/HudsonAlpha/aRNAPipe; DOI: 10.5281/zenodo.202950

  • References
  • Citations

References

  • We're still populating references for this paper, please check back later.
  • References
  • Citations

Citations

  • This paper may not have been cited yet.

Mentioned in this Paper

Sequence Determinations, RNA
Environment
GYPC gene
Sequence Analysis, RNA
Tracking
Analysis
Hereditary Pancreatitis
C1orf56
Transcript
Metric

About this Paper

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

Alternative splicing

Alternative splicing a regulated gene expression process that allows a single genetic sequence to code for multiple proteins. Here is that latest research.