DOI: 10.1101/213603Nov 3, 2017Paper

Interoperable and scalable data analysis with microservices: Applications in Metabolomics

BioRxiv : the Preprint Server for Biology
Payam Emami KhoonsariOla Spjuth

Abstract

Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed in parallel using the Kubernetes container orchestrator. The access point is a virtual research environment which can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and established workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry studies, one nuclear magnetic resonance spectroscopy study and one fluxomics study, showing that the method scales dynamically with increasing availability of computational resources. We achieved a complete integration of the major software suites resulting in the first turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, multivariate statistics, and metabolite identification. Microservices is a generic methodology th...Continue Reading

Related Concepts

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

Cancer Metabolic Reprogramming (Keystone)

Cancer metabolic reprogramming is important for the rapid growth and proliferation of cancer cells. Cancer cells have the ability to change their metabolic demands depending on their environment, regulated by the activation of oncogenes or loss of tumor suppressor genes. Here is the latest research on cancer metabolic reprogramming.

Cancer Metabolic Reprogramming

Cancer metabolic reprogramming is important for the rapid growth and proliferation of cancer cells. Cancer cells have the ability to change their metabolic demands depending on their environment, regulated by the activation of oncogenes or loss of tumor suppressor genes. Here is the latest research on cancer metabolic reprogramming.