Citing a Data Repository: A Case Study of the Protein Data Bank

PloS One
Yi-Hung HuangChun-Nan Hsu

Abstract

The Protein Data Bank (PDB) is the worldwide repository of 3D structures of proteins, nucleic acids and complex assemblies. The PDB's large corpus of data (> 100,000 structures) and related citations provide a well-organized and extensive test set for developing and understanding data citation and access metrics. In this paper, we present a systematic investigation of how authors cite PDB as a data repository. We describe a novel metric based on information cascade constructed by exploring the citation network to measure influence between competing works and apply that to analyze different data citation practices to PDB. Based on this new metric, we found that the original publication of RCSB PDB in the year 2000 continues to attract most citations though many follow-up updates were published. None of these follow-up publications by members of the wwPDB organization can compete with the original publication in terms of citations and influence. Meanwhile, authors increasingly choose to use URLs of PDB in the text instead of citing PDB papers, leading to disruption of the growth of the literature citations. A comparison of data usage statistics and paper citations shows that PDB Web access is highly correlated with URL mentions i...Continue Reading

References

Dec 10, 1998·Nucleic Acids Research·A Bairoch, R Apweiler
Dec 11, 1999·Nucleic Acids Research·H M BermanP E Bourne
Dec 26, 2001·Nucleic Acids Research·Ron EdgarAlex E Lash
Dec 26, 2001·Nucleic Acids Research·John WestbrookHelen M Berman
Jan 10, 2003·Nucleic Acids Research·Brigitte BoeckmannMichel Schneider
Jan 10, 2003·Nucleic Acids Research·H BoutselakisW Vranken
Jan 10, 2003·Nucleic Acids Research·John WestbrookHelen M Berman
Nov 25, 2003·Nature Structural Biology·Helen BermanHaruki Nakamura
Dec 19, 2003·Nucleic Acids Research·Rolf ApweilerLai-Su L Yeh
Dec 19, 2003·Nucleic Acids Research·Alex BatemanSean R Eddy
Dec 19, 2003·Nucleic Acids Research·A GolovinK Henrick
Dec 19, 2003·Nucleic Acids Research·Philip E BourneHelen M Berman
Oct 29, 2004·Bioinformatics·John WestbrookHelen M Berman
Dec 21, 2004·Nucleic Acids Research·S VelankarK Henrick
Nov 9, 2005·Proceedings of the National Academy of Sciences of the United States of America·J E Hirsch
Dec 31, 2005·Nucleic Acids Research·Sam Griffiths-JonesAnton J Enright
Dec 31, 2005·Nucleic Acids Research·Andrei KouranovHelen M Berman
Dec 5, 2006·Nucleic Acids Research·Helen BermanJohn L Markley
Nov 28, 2007·Nucleic Acids Research·Robert D FinnAlex Bateman
Dec 13, 2007·Nucleic Acids Research·Kim HenrickHelen M Berman
Apr 24, 2008·Briefings in Bioinformatics·Daron M StandleyHaruki Nakamura
May 1, 2010·BMC Bioinformatics·Andreas PrlićJ Lynn Fink
May 3, 2012·The American Journal of Occupational Therapy : Official Publication of the American Occupational Therapy Association·Carla A ChaseMarian Arbesman
Jun 12, 2012·Database : the Journal of Biological Databases and Curation·Aurélie NévéolZhiyong Lu
Nov 8, 2012·PloS One·Jason PriemDario Taraborelli
Jun 5, 2013·PloS One·Şenay KafkasJohanna R McEntyre
Sep 30, 2014·Scientific Reports·Lovro ŠubeljMarko Bajec
Oct 31, 2014·Nature·Richard Van NoordenRegina Nuzzo

❮ Previous
Next ❯

Software Mentioned

Entrez
CODATA
BioLit

Related Concepts

Trending Feeds

COVID-19

Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Blastomycosis

Blastomycosis fungal infections spread through inhaling Blastomyces dermatitidis spores. Discover the latest research on blastomycosis fungal infections here.

Nuclear Pore Complex in ALS/FTD

Alterations in nucleocytoplasmic transport, controlled by the nuclear pore complex, may be involved in the pathomechanism underlying multiple neurodegenerative diseases including Amyotrophic Lateral Sclerosis and Frontotemporal Dementia. Here is the latest research on the nuclear pore complex in ALS and FTD.

Applications of Molecular Barcoding

The concept of molecular barcoding is that each original DNA or RNA molecule is attached to a unique sequence barcode. Sequence reads having different barcodes represent different original molecules, while sequence reads having the same barcode are results of PCR duplication from one original molecule. Discover the latest research on molecular barcoding here.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Evolution of Pluripotency

Pluripotency refers to the ability of a cell to develop into three primary germ cell layers of the embryo. This feed focuses on the mechanisms that underlie the evolution of pluripotency. Here is the latest research.

Position Effect Variegation

Position Effect Variagation occurs when a gene is inactivated due to its positioning near heterochromatic regions within a chromosome. Discover the latest research on Position Effect Variagation here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Microbicide

Microbicides are products that can be applied to vaginal or rectal mucosal surfaces with the goal of preventing, or at least significantly reducing, the transmission of sexually transmitted infections. Here is the latest research on microbicides.

Related Papers

Protein Science : a Publication of the Protein Society
Brian W Matthews
Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics
Travis MartinM E J Newman
Nucleic Acids Research
Kim HenrickHelen M Berman
© 2021 Meta ULC. All rights reserved