Exploration, visualization, and preprocessing of high-dimensional data

Methods in Molecular Biology
Zhijin Wu, Zhiqiang Wu

Abstract

The rapid advances in biotechnology have given rise to a variety of high-dimensional data. Many of these data, including DNA microarray data, mass spectrometry protein data, and high-throughput screening (HTS) assay data, are generated by complex experimental procedures that involve multiple steps such as sample extraction, purification and/or amplification, labeling, fragmentation, and detection. Therefore, the quantity of interest is not directly obtained and a number of preprocessing procedures are necessary to convert the raw data into the format with biological relevance. This also makes exploratory data analysis and visualization essential steps to detect possible defects, anomalies or distortion of the data, to test underlying assumptions and thus ensure data quality. The characteristics of the data structure revealed in exploratory analysis often motivate decisions in preprocessing procedures to produce data suitable for downstream analysis. In this chapter we review the common techniques in exploring and visualizing high-dimensional data and introduce the basic preprocessing procedures.

Citations

Nov 8, 2014·Molecular BioSystems·Andreas TjärnbergErik L L Sonnhammer
Jun 11, 2011·Biomarkers in Medicine·Paulina D Rakowska, Maxim G Ryadnov

❮ Previous
Next ❯

Related Concepts

Related Feeds

Auditory Perception

Auditory perception is the ability to receive and interpret information attained by the ears. Here is the latest research on factors and underlying mechanisms that influence auditory perception.

Bioinformatics in Biomedicine

Bioinformatics in biomedicine incorporates computer science, biology, chemistry, medicine, mathematics and statistics. Discover the latest research on bioinformatics in biomedicine here.