Improving genome-scale metabolic models of incomplete genomes with deep learning

Deciphering microbial metabolism is essential for understanding ecosystem functions. Genome-scale metabolic models (GSMMs) predict metabolic traits from genomic data, but constructing GSMMs for uncultured bacteria is challenging due to incomplete metagenome-assembled genomes, resulting in many gaps. We introduce the deep neural network guided imputation of reactomes (DNNGIOR), which uses AI to improve gap-filling by learning from the presence and absence of metabolic reactions across diverse bacterial genomes. Key factors for prediction accuracy are: (1) reaction frequency across all bacteria and (2) phylogenetic distance of the query to the training genomes. DNNGIOR predictions achieve an average F1 score of 0.85 for reactions present in over 30% of training genomes. DNNGIOR guided gap-filling was 14 times more accurate for draft reconstructions and 2–9 times for curated models than unweighted gap-filling.

Saved in:
Bibliographic Details
Main Authors: Boer, Meine D., Melkonian, Chrats, Zafeiropoulos, Haris, Haas, Andreas F., Garza, Daniel R., Dutilh, Bas E.
Format: Article/Letter to editor biblioteca
Language:English
Subjects:Biocomputational method, Computational Bioinformatics, Genomic analysis, Microbial genomics,
Online Access:https://research.wur.nl/en/publications/improving-genome-scale-metabolic-models-of-incomplete-genomes-wit
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-wur-nl-wurpubs-638296
record_format koha
spelling dig-wur-nl-wurpubs-6382962024-12-17 Boer, Meine D. Melkonian, Chrats Zafeiropoulos, Haris Haas, Andreas F. Garza, Daniel R. Dutilh, Bas E. Article/Letter to editor iScience 27 (2024) 12 ISSN: 2589-0042 Improving genome-scale metabolic models of incomplete genomes with deep learning 2024 Deciphering microbial metabolism is essential for understanding ecosystem functions. Genome-scale metabolic models (GSMMs) predict metabolic traits from genomic data, but constructing GSMMs for uncultured bacteria is challenging due to incomplete metagenome-assembled genomes, resulting in many gaps. We introduce the deep neural network guided imputation of reactomes (DNNGIOR), which uses AI to improve gap-filling by learning from the presence and absence of metabolic reactions across diverse bacterial genomes. Key factors for prediction accuracy are: (1) reaction frequency across all bacteria and (2) phylogenetic distance of the query to the training genomes. DNNGIOR predictions achieve an average F1 score of 0.85 for reactions present in over 30% of training genomes. DNNGIOR guided gap-filling was 14 times more accurate for draft reconstructions and 2–9 times for curated models than unweighted gap-filling. en application/pdf https://research.wur.nl/en/publications/improving-genome-scale-metabolic-models-of-incomplete-genomes-wit 10.1016/j.isci.2024.111349 https://edepot.wur.nl/680205 Biocomputational method Computational Bioinformatics Genomic analysis Microbial genomics https://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/ Wageningen University & Research
institution WUR NL
collection DSpace
country Países bajos
countrycode NL
component Bibliográfico
access En linea
databasecode dig-wur-nl
tag biblioteca
region Europa del Oeste
libraryname WUR Library Netherlands
language English
topic Biocomputational method
Computational Bioinformatics
Genomic analysis
Microbial genomics
Biocomputational method
Computational Bioinformatics
Genomic analysis
Microbial genomics
spellingShingle Biocomputational method
Computational Bioinformatics
Genomic analysis
Microbial genomics
Biocomputational method
Computational Bioinformatics
Genomic analysis
Microbial genomics
Boer, Meine D.
Melkonian, Chrats
Zafeiropoulos, Haris
Haas, Andreas F.
Garza, Daniel R.
Dutilh, Bas E.
Improving genome-scale metabolic models of incomplete genomes with deep learning
description Deciphering microbial metabolism is essential for understanding ecosystem functions. Genome-scale metabolic models (GSMMs) predict metabolic traits from genomic data, but constructing GSMMs for uncultured bacteria is challenging due to incomplete metagenome-assembled genomes, resulting in many gaps. We introduce the deep neural network guided imputation of reactomes (DNNGIOR), which uses AI to improve gap-filling by learning from the presence and absence of metabolic reactions across diverse bacterial genomes. Key factors for prediction accuracy are: (1) reaction frequency across all bacteria and (2) phylogenetic distance of the query to the training genomes. DNNGIOR predictions achieve an average F1 score of 0.85 for reactions present in over 30% of training genomes. DNNGIOR guided gap-filling was 14 times more accurate for draft reconstructions and 2–9 times for curated models than unweighted gap-filling.
format Article/Letter to editor
topic_facet Biocomputational method
Computational Bioinformatics
Genomic analysis
Microbial genomics
author Boer, Meine D.
Melkonian, Chrats
Zafeiropoulos, Haris
Haas, Andreas F.
Garza, Daniel R.
Dutilh, Bas E.
author_facet Boer, Meine D.
Melkonian, Chrats
Zafeiropoulos, Haris
Haas, Andreas F.
Garza, Daniel R.
Dutilh, Bas E.
author_sort Boer, Meine D.
title Improving genome-scale metabolic models of incomplete genomes with deep learning
title_short Improving genome-scale metabolic models of incomplete genomes with deep learning
title_full Improving genome-scale metabolic models of incomplete genomes with deep learning
title_fullStr Improving genome-scale metabolic models of incomplete genomes with deep learning
title_full_unstemmed Improving genome-scale metabolic models of incomplete genomes with deep learning
title_sort improving genome-scale metabolic models of incomplete genomes with deep learning
url https://research.wur.nl/en/publications/improving-genome-scale-metabolic-models-of-incomplete-genomes-wit
work_keys_str_mv AT boermeined improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning
AT melkonianchrats improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning
AT zafeiropoulosharis improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning
AT haasandreasf improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning
AT garzadanielr improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning
AT dutilhbase improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning
_version_ 1819140168500641792