What can you do with 0.1x genome coverage? A case study based on a genome survey of the scuttle fly Megaselia scalaris (Phoridae)

BMC Genomics
David A Rasmussen, Mohamed A F Noor

Abstract

The declining cost of DNA sequencing is making genome sequencing a feasible option for more organisms, including many of interest to ecologists and evolutionary biologists. While obtaining high-depth, completely assembled genome sequences for most non-model organisms remains challenging, low-coverage genome survey sequences (GSS) can provide a wealth of biologically useful information at low cost. Here, using a random pyrosequencing approach, we sequence the genome of the scuttle fly Megaselia scalaris and evaluate the utility of our low-coverage GSS approach. Random pyrosequencing of the M. scalaris genome provided a depth of coverage (0.05x0.1x) much lower than typical GSS studies. We demonstrate that, even with extremely low-coverage sequencing, bioinformatics approaches can yield extensive information about functional and repetitive elements. We also use our GSS data to develop genomic resources such as a nearly complete mitochondrial genome sequence and microsatellite markers for M. scalaris. We conclude that low-coverage genome surveys are effective at generating useful information about organisms currently lacking genomic sequence data.

References

Sep 1, 1997·Nucleic Acids Research·S F AltschulD J Lipman
Nov 24, 1998·IEEE Engineering in Medicine and Biology Magazine : the Quarterly Magazine of the Engineering in Medicine & Biology Society·Donna M MuznyRichard A Gibbs
Apr 2, 1999·Nucleic Acids Research·J L Boore
Mar 27, 2001·Evolution; International Journal of Organic Evolution·M G Kidwell, D R Lisch
Oct 5, 2002·Science·Robert A HoltStephen L Hoffman
Sep 27, 2003·Science·Ewen F KirknessJ Craig Venter
Aug 12, 2005·Cytogenetic and Genome Research·C Biémont, C Vieira
Aug 12, 2005·Cytogenetic and Genome Research·J JurkaJ Walichiewicz
Apr 28, 2007·Bioinformatics·Robert KoflerTamas Lelley
May 10, 2007·PloS One·Ripan S MalhiDavid Glenn Smith
May 19, 2007·Science·Vishvanath NeneDavid W Severson
Jul 12, 2007·Annual Review of Entomology·R H L Disney
Nov 3, 2007·Genome Research·Phil Green
Nov 13, 2007·Nature·Drosophila 12 Genomes ConsortiumIain MacCallum
Sep 11, 2008·BMC Bioinformatics·Thomas D OttoWim M Degrave
Oct 25, 2008·Nucleic Acids Research·Susan TweedieFlyBase Consortium
Nov 26, 2008·Nucleic Acids Research·Daniel LawsonFrank H Collins
Mar 5, 2009·PloS One·Brant K PetersonMichael B Eisen
Jan 1, 2008·Molecular Ecology Resources·Matthew E Hudson

Citations

Nov 2, 2011·Chromosome Research : an International Journal on the Molecular, Supramolecular and Evolutionary Aspects of Chromosome Biology·France Dufresne, Nicholas Jeffery
Sep 3, 2010·The Journal of Heredity·Jen C Perry, Locke Rowe
Mar 2, 2012·Genome Génome / Conseil National De Recherches Canada·Natuo KômotoShuichiro Tomita
Dec 18, 2013·BMC Genomics·Zhimin GaoQingyan Shu
May 30, 2012·BMC Research Notes·Emese MegléczMichael G Gardner
May 19, 2010·International Journal of Molecular Sciences·Ryan C GarrickPaul Sunnucks
Oct 20, 2015·Journal of Experimental Zoology. Part A, Ecological Genetics and Physiology·Tadeusz MalewskiKatarzyna Kowalewska
May 14, 2011·Molecular Ecology Resources·E GuichouxR J Petit
Aug 28, 2015·Genome Biology and Evolution·Paul G WolfJoshua P Der
Aug 25, 2012·Journal of Biomedicine & Biotechnology·Brad S CoatesBlair D Siegfried
Jun 15, 2010·Current Biology : CB·Robin L Varney, Mohamed A F Noor
Dec 3, 2014·Genomics·Meganathan P RamakodiDavid A Ray
Sep 4, 2019·Molecular Ecology Resources·Solomon T C Chak, Dustin R Rubenstein
May 23, 2020·Journal of Medical Entomology·Luz Alejandra Castillo-AlanisMaría Elena Bravo-Gómez
Feb 6, 2017·Molecular Genetics and Genomics : MGG·Sanxu LiuJing Li
Apr 18, 2019·Anais Da Academia Brasileira De Ciências·Deise Schröder SarziValdir Marcos Stefenon
Nov 9, 2014·G3 : Genes - Genomes - Genetics·Kenneth B Hoehn, Mohamed A F Noor
Apr 1, 2018·Current Opinion in Insect Science·Brian M Wiegmann, Stephen Richards

Datasets Mentioned

BETA
SRA008268
SRA008342

Methods Mentioned

BETA
flow cytometry
454 sequencing
PCR

Related Concepts

Diptera
Drosophila
Sequence Determinations, DNA
Tetranucleotide Repeats
Computational Molecular Biology
Interspersed Repetitive Sequences
Genomics
Genome, Insect
Genome, Mitochondrial
Case-Control Studies

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Alzheimer's Disease: MS4A

Variants within the membrane-spanning 4-domains subfamily A (MS4A) gene cluster have recently been implicated in Alzheimer's disease in genome-wide association studies. Here is the latest research on Alzheimer's disease and MS4A.

Pediculosis pubis

Pediculosis pubis is a disease caused by a parasitic insect known as Pthirus pubis, which infests human pubic hair, as well as other areas with hair including eye lashes. Here is the latest research.

Rh Isoimmunization

Rh isoimmunization is a potentially preventable condition that occasionally is associated with significant perinatal morbidity or mortality. Discover the latest research on Rh Isoimmunization here.

Genetic Screens in iPSC-derived Brain Cells

Genetic screening is a critical tool that can be employed to define and understand gene function and interaction. This feed focuses on genetic screens conducted using induced pluripotent stem cell (iPSC)-derived brain cells. It also follows CRISPR-Cas9 approaches to generating genetic mutants as a means of understanding the effect of genetics on phenotype.

Enzyme Evolution

This feed focuses on molecular models of enzyme evolution and new approaches (such as adaptive laboratory evolution) to metabolic engineering of microorganisms. Here is the latest research.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Pharmacology of Proteinopathies

This feed focuses on the pharmacology of proteinopathies - diseases in which proteins abnormally aggregate (i.e. Alzheimer’s, Parkinson’s, etc.). Discover the latest research in this field with this feed.

Alignment-free Sequence Analysis Tools

Alignment-free sequence analyses have been applied to problems ranging from whole-genome phylogeny to the classification of protein families, identification of horizontally transferred genes, and detection of recombined sequences. Here is the latest research.