Massively parallel whole transcriptome sequencing, commonly referred as RNA-Seq, is quickly becoming the technology of choice for gene expression profiling. However, due to the short read length delivered by current sequencing technologies, estimation of expression levels for alternative splicing gene isoforms remains challenging. In this paper we present a novel expectation-maximization algorithm for inference of isoform- and gene-specific expression levels from RNA-Seq data. Our algorithm, referred to as IsoEM, is based on disambiguating information provided by the distribution of insert sizes generated during sequencing library preparation, and takes advantage of base quality scores, strand and read pairing information when available. The open source Java implementation of IsoEM is freely available at http://dna.engr.uconn.edu/software/IsoEM/. Empirical experiments on both synthetic and real RNA-Seq datasets show that IsoEM has scalable running time and outperforms existing methods of isoform and gene expression level estimation. Simulation experiments confirm previous findings that, for a fixed sequencing cost, using reads longer than 25-36 bases does not necessarily lead to better accuracy for estimating expression levels ...Continue Reading
A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome
Measuring differential gene expression by short read sequencing: quantitative comparison to 2-channel gene expression microarrays
Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
Whole transcriptome sequencing reveals genes involved in plastid/chloroplast division and development are regulated by the HP1/DDB1 at an early stage of tomato fruit development
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
Towards the integration, annotation and association of historical microarray experiments with RNA-seq
Gene and isoform expression signatures associated with tumor stage in kidney renal clear cell carcinoma
Discovering transcription factor binding sites in highly repetitive regions of genomes with multi-read analysis of ChIP-Seq data
PennSeq: accurate isoform-specific gene expression quantification in RNA-Seq by modeling non-uniform read distribution
TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference
Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms
EMSAR: estimation of transcript abundance from RNA-seq data by mappability-based segmentation and reclustering
QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization
A Markov random field-based approach for joint estimation of differentially expressed genes in mouse transcriptome data
Effects of subsampling on characteristics of RNA-seq data from triple-negative breast cancer patients
Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data
Network-based bioinformatics analysis of spatio-temporal RNA-Seq data reveals transcriptional programs underpinning normal and aberrant retinal development
Modeling of RNA-seq fragment sequence bias reduces systematic errors in transcript abundance estimation
Fast bootstrapping-based estimation of confidence intervals of expression levels and differential expression from RNA-Seq data
ROP: dumpster diving in RNA-sequencing to find the source of 1 trillion reads across diverse adult human tissues
Differential Gene Expression Profiles and Alternative Isoform Regulations in Gill of Nile Tilapia in Response to Acute Hypoxia
Cardiovascular transcriptomics and epigenomics using next-generation sequencing: challenges, progress, and opportunities
Gene Cascade Finder: A tool for identification of gene cascades and its application in Caenorhabditis elegans
RNA-Seq alignment to individualized genomes improves transcript abundance estimates in multiparent populations
Transcriptome analysis of Brassica napus pod using RNA-Seq and identification of lipid-related candidate genes
HRT Atlas v1.0 database: redefining human and mouse housekeeping genes and candidate reference transcripts by mining massive RNA-seq datasets.
A Bayesian model selection approach for identifying differentially expressed transcripts from RNA sequencing data.
Empirical assessment of analysis workflows for differential expression analysis of human samples using RNA-Seq
Factorial study of the RNA-seq computational workflow identifies biases as technical gene signatures.
Alternative splicing a regulated gene expression process that allows a single genetic sequence to code for multiple proteins. Here is that latest research.