corseq: fast and efficient identification of favoured codons from next generation sequencing reads

PeerJ
Salvatore Camiolo, Andrea Porceddu

Abstract

Optimization of transgene expression can be achieved by designing coding sequences with the synonymous codon usage of genes which are highly expressed in the host organism. The identification of the so-called "favoured codons" generally requires the access to either the genome or the coding sequences and the availability of expression data. Here we describe corseq, a fast and reliable software for detecting the favoured codons directly from RNAseq data without prior knowledge of genomic sequence or gene annotation. The presented tool allows the inference of codons that are preferentially used in highly expressed genes while estimating the transcripts abundance by a new kmer based approach. corseq is implemented in Python and runs under any operating system. The software requires the Biopython 1.65 library (or later versions) and is available under the 'GNU General Public License version 3' at the project webpage https://sourceforge.net/projects/corseq/files. corseq represents a faster and easy-to-use alternative for the detection of favoured codons in non model organisms.

References

Apr 14, 1999·Proceedings of the National Academy of Sciences of the United States of America·L Duret, D Mouchiroud
Nov 6, 2008·BMC Evolutionary Biology·Pär K Ingvarsson
Apr 8, 2009·Molecular Biology and Evolution·Tong ZhouClaus O Wilke
Jan 19, 2010·Bioinformatics·Heng Li, Richard Durbin
Nov 26, 2010·Nature Reviews. Genetics·Joshua B Plotkin, Grzegorz Kudla
Jun 8, 2011·Proceedings of the National Academy of Sciences of the United States of America·Premal Shah, Michael A Gilchrist
Nov 20, 2012·Nature Methods·Adam Roberts, Lior Pachter
May 23, 2013·Genome Biology and Evolution·Andrea PorcedduSalvatore Camiolo
Apr 18, 2014·BMC Research Notes·Salvatore CamioloAndrea Porceddu
Jul 23, 2014·PLoS Biology·Jian-Rong YangJianzhi Zhang
Sep 30, 2014·Trends in Molecular Medicine·Vincent P Mauro, Stephen A Chappell
Jul 18, 2015·Molecular Cell·Tessa E F QuaxJohn van der Oost
Sep 20, 2015·G3 : Genes - Genomes - Genetics·Carrie A Whittle, Cassandra G Extavour
Dec 18, 2015·Nucleic Acids Research·Patricia P Chan, Todd M Lowe
Jan 28, 2016·Genome Biology·Ana ConesaAli Mortazavi
May 24, 2017·G3 : Genes - Genomes - Genetics·Salvatore CamioloAndrea Porceddu

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Hereditary Sensory Autonomic Neuropathy

Hereditary Sensory Autonomic Neuropathies are a group of inherited neurodegenerative disorders characterized clinically by loss of sensation and autonomic dysfunction. Here is the latest research on these neuropathies.

Glut1 Deficiency

Glut1 deficiency, an autosomal dominant, genetic metabolic disorder associated with a deficiency of GLUT1, the protein that transports glucose across the blood brain barrier, is characterized by mental and motor developmental delays and infantile seizures. Follow the latest research on Glut1 deficiency with this feed.

Regulation of Vocal-Motor Plasticity

Dopaminergic projections to the basal ganglia and nucleus accumbens shape the learning and plasticity of motivated behaviors across species including the regulation of vocal-motor plasticity and performance in songbirds. Discover the latest research on the regulation of vocal-motor plasticity here.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

Nodding Syndrome

Nodding Syndrome is a neurological and epileptiform disorder characterized by psychomotor, mental, and growth retardation. Discover the latest research on Nodding Syndrome here.

LRRK2 & Microtubules

Mutations in the LRRK2 gene are risk-factors for developing Parkinson’s disease (PD). LRRK2 mutations in PD have been shown to enhance its association with microtubules. Here is the latest research.