Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST

SUMMARY: Computational metabolomics workflows have revolutionized the untargeted metabolomics field. However, the organization and prioritization of metabolite features remains a laborious process. Organizing metabolomics data is often done through mass fragmentation-based spectral similarity grouping, resulting in feature sets that also represent an intuitive and scientifically meaningful first stage of analysis in untargeted metabolomics. Exploiting such feature sets, feature-set testing has emerged as an approach that is widely used in genomics and targeted metabolomics pathway enrichment analyses. It allows for formally combining groupings with statistical testing into more meaningful pathway enrichment conclusions. Here, we present msFeaST (mass spectral Feature Set Testing), a feature-set testing and visualization workflow for LC-MS/MS untargeted metabolomics data. Feature-set testing involves statistically assessing differential abundance patterns for groups of features across experimental conditions. We developed msFeaST to make use of spectral similarity-based feature groupings generated using k-medoids clustering, where the resulting clusters serve as a proxy for grouping structurally similar features with potential biosynthesis pathway relationships. Spectral clustering done in this way allows for feature group-wise statistical testing using the globaltest package, which provides high power to detect small concordant effects via joint modeling and reduced multiplicity adjustment penalties. Hence, msFeaST provides interactive integration of the semi-quantitative experimental information with mass-spectral structural similarity information, enhancing the prioritization of features and feature sets during exploratory data analysis. AVAILABILITY AND IMPLEMENTATION: The msFeaST workflow is freely available through https://github.com/kevinmildau/msFeaST and built to work on MacOS and Linux systems.

Saved in:
Bibliographic Details
Main Authors: Mildau, Kevin, Büschl, Christoph, Zanghellini, Jürgen, van der Hooft, Justin J.J.
Format: Article/Letter to editor biblioteca
Language:English
Subjects:Life Science,
Online Access:https://research.wur.nl/en/publications/combined-lc-msms-feature-grouping-statistical-prioritization-and-
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-wur-nl-wurpubs-635996
record_format koha
spelling dig-wur-nl-wurpubs-6359962025-01-14 Mildau, Kevin Büschl, Christoph Zanghellini, Jürgen van der Hooft, Justin J.J. Article/Letter to editor Bioinformatics (Oxford, England) 40 (2024) 10 ISSN: 1367-4803 Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST 2024 SUMMARY: Computational metabolomics workflows have revolutionized the untargeted metabolomics field. However, the organization and prioritization of metabolite features remains a laborious process. Organizing metabolomics data is often done through mass fragmentation-based spectral similarity grouping, resulting in feature sets that also represent an intuitive and scientifically meaningful first stage of analysis in untargeted metabolomics. Exploiting such feature sets, feature-set testing has emerged as an approach that is widely used in genomics and targeted metabolomics pathway enrichment analyses. It allows for formally combining groupings with statistical testing into more meaningful pathway enrichment conclusions. Here, we present msFeaST (mass spectral Feature Set Testing), a feature-set testing and visualization workflow for LC-MS/MS untargeted metabolomics data. Feature-set testing involves statistically assessing differential abundance patterns for groups of features across experimental conditions. We developed msFeaST to make use of spectral similarity-based feature groupings generated using k-medoids clustering, where the resulting clusters serve as a proxy for grouping structurally similar features with potential biosynthesis pathway relationships. Spectral clustering done in this way allows for feature group-wise statistical testing using the globaltest package, which provides high power to detect small concordant effects via joint modeling and reduced multiplicity adjustment penalties. Hence, msFeaST provides interactive integration of the semi-quantitative experimental information with mass-spectral structural similarity information, enhancing the prioritization of features and feature sets during exploratory data analysis. AVAILABILITY AND IMPLEMENTATION: The msFeaST workflow is freely available through https://github.com/kevinmildau/msFeaST and built to work on MacOS and Linux systems. en application/pdf https://research.wur.nl/en/publications/combined-lc-msms-feature-grouping-statistical-prioritization-and- 10.1093/bioinformatics/btae584 https://edepot.wur.nl/676248 Life Science https://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/ Wageningen University & Research
institution WUR NL
collection DSpace
country Países bajos
countrycode NL
component Bibliográfico
access En linea
databasecode dig-wur-nl
tag biblioteca
region Europa del Oeste
libraryname WUR Library Netherlands
language English
topic Life Science
Life Science
spellingShingle Life Science
Life Science
Mildau, Kevin
Büschl, Christoph
Zanghellini, Jürgen
van der Hooft, Justin J.J.
Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST
description SUMMARY: Computational metabolomics workflows have revolutionized the untargeted metabolomics field. However, the organization and prioritization of metabolite features remains a laborious process. Organizing metabolomics data is often done through mass fragmentation-based spectral similarity grouping, resulting in feature sets that also represent an intuitive and scientifically meaningful first stage of analysis in untargeted metabolomics. Exploiting such feature sets, feature-set testing has emerged as an approach that is widely used in genomics and targeted metabolomics pathway enrichment analyses. It allows for formally combining groupings with statistical testing into more meaningful pathway enrichment conclusions. Here, we present msFeaST (mass spectral Feature Set Testing), a feature-set testing and visualization workflow for LC-MS/MS untargeted metabolomics data. Feature-set testing involves statistically assessing differential abundance patterns for groups of features across experimental conditions. We developed msFeaST to make use of spectral similarity-based feature groupings generated using k-medoids clustering, where the resulting clusters serve as a proxy for grouping structurally similar features with potential biosynthesis pathway relationships. Spectral clustering done in this way allows for feature group-wise statistical testing using the globaltest package, which provides high power to detect small concordant effects via joint modeling and reduced multiplicity adjustment penalties. Hence, msFeaST provides interactive integration of the semi-quantitative experimental information with mass-spectral structural similarity information, enhancing the prioritization of features and feature sets during exploratory data analysis. AVAILABILITY AND IMPLEMENTATION: The msFeaST workflow is freely available through https://github.com/kevinmildau/msFeaST and built to work on MacOS and Linux systems.
format Article/Letter to editor
topic_facet Life Science
author Mildau, Kevin
Büschl, Christoph
Zanghellini, Jürgen
van der Hooft, Justin J.J.
author_facet Mildau, Kevin
Büschl, Christoph
Zanghellini, Jürgen
van der Hooft, Justin J.J.
author_sort Mildau, Kevin
title Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST
title_short Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST
title_full Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST
title_fullStr Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST
title_full_unstemmed Combined LC-MS/MS feature grouping, statistical prioritization, and interactive networking in msFeaST
title_sort combined lc-ms/ms feature grouping, statistical prioritization, and interactive networking in msfeast
url https://research.wur.nl/en/publications/combined-lc-msms-feature-grouping-statistical-prioritization-and-
work_keys_str_mv AT mildaukevin combinedlcmsmsfeaturegroupingstatisticalprioritizationandinteractivenetworkinginmsfeast
AT buschlchristoph combinedlcmsmsfeaturegroupingstatisticalprioritizationandinteractivenetworkinginmsfeast
AT zanghellinijurgen combinedlcmsmsfeaturegroupingstatisticalprioritizationandinteractivenetworkinginmsfeast
AT vanderhooftjustinjj combinedlcmsmsfeaturegroupingstatisticalprioritizationandinteractivenetworkinginmsfeast
_version_ 1822262588051816448