A hierarchical approach to removal of unwanted variation for large-scale metabolomics data
Details
Publication Year 2021-08-17, Volume 12, Issue #1, Page 4992
Journal Title
Nature Communications
Abstract
Liquid chromatography-mass spectrometry-based metabolomics studies are increasingly applied to large population cohorts, which run for several weeks or even years in data acquisition. This inevitably introduces unwanted intra- and inter-batch variations over time that can overshadow true biological signals and thus hinder potential biological discoveries. To date, normalisation approaches have struggled to mitigate the variability introduced by technical factors whilst preserving biological variance, especially for protracted acquisitions. Here, we propose a study design framework with an arrangement for embedding biological sample replicates to quantify variance within and between batches and a workflow that uses these replicates to remove unwanted variation in a hierarchical manner (hRUV). We use this design to produce a dataset of more than 1000 human plasma samples run over an extended period of time. We demonstrate significant improvement of hRUV over existing methods in preserving biological signals whilst removing unwanted variation for large scale metabolomics studies. Our tools not only provide a strategy for large scale data normalisation, but also provides guidance on the design strategy for large omics studies.
Publisher
NPG
WEHI Research Division(s)
Bioinformatics
PubMed ID
34404777
Open Access at Publisher's Site
https://doi.org/10.1038/s41467-021-25210-5
Rights Notice
Refer to copyright notice on published article.


Creation Date: 2021-09-17 11:04:09
Last Modified: 2021-09-17 11:11:52
An error has occurred. This application may no longer respond until reloaded. Reload 🗙