Improving genome-scale metabolic models of incomplete genomes with deep learning
Deciphering microbial metabolism is essential for understanding ecosystem functions. Genome-scale metabolic models (GSMMs) predict metabolic traits from genomic data, but constructing GSMMs for uncultured bacteria is challenging due to incomplete metagenome-assembled genomes, resulting in many gaps. We introduce the deep neural network guided imputation of reactomes (DNNGIOR), which uses AI to improve gap-filling by learning from the presence and absence of metabolic reactions across diverse bacterial genomes. Key factors for prediction accuracy are: (1) reaction frequency across all bacteria and (2) phylogenetic distance of the query to the training genomes. DNNGIOR predictions achieve an average F1 score of 0.85 for reactions present in over 30% of training genomes. DNNGIOR guided gap-filling was 14 times more accurate for draft reconstructions and 2–9 times for curated models than unweighted gap-filling.
Main Authors: | , , , , , |
---|---|
Format: | Article/Letter to editor biblioteca |
Language: | English |
Subjects: | Biocomputational method, Computational Bioinformatics, Genomic analysis, Microbial genomics, |
Online Access: | https://research.wur.nl/en/publications/improving-genome-scale-metabolic-models-of-incomplete-genomes-wit |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
dig-wur-nl-wurpubs-638296 |
---|---|
record_format |
koha |
spelling |
dig-wur-nl-wurpubs-6382962024-12-17 Boer, Meine D. Melkonian, Chrats Zafeiropoulos, Haris Haas, Andreas F. Garza, Daniel R. Dutilh, Bas E. Article/Letter to editor iScience 27 (2024) 12 ISSN: 2589-0042 Improving genome-scale metabolic models of incomplete genomes with deep learning 2024 Deciphering microbial metabolism is essential for understanding ecosystem functions. Genome-scale metabolic models (GSMMs) predict metabolic traits from genomic data, but constructing GSMMs for uncultured bacteria is challenging due to incomplete metagenome-assembled genomes, resulting in many gaps. We introduce the deep neural network guided imputation of reactomes (DNNGIOR), which uses AI to improve gap-filling by learning from the presence and absence of metabolic reactions across diverse bacterial genomes. Key factors for prediction accuracy are: (1) reaction frequency across all bacteria and (2) phylogenetic distance of the query to the training genomes. DNNGIOR predictions achieve an average F1 score of 0.85 for reactions present in over 30% of training genomes. DNNGIOR guided gap-filling was 14 times more accurate for draft reconstructions and 2–9 times for curated models than unweighted gap-filling. en application/pdf https://research.wur.nl/en/publications/improving-genome-scale-metabolic-models-of-incomplete-genomes-wit 10.1016/j.isci.2024.111349 https://edepot.wur.nl/680205 Biocomputational method Computational Bioinformatics Genomic analysis Microbial genomics https://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/ Wageningen University & Research |
institution |
WUR NL |
collection |
DSpace |
country |
Países bajos |
countrycode |
NL |
component |
Bibliográfico |
access |
En linea |
databasecode |
dig-wur-nl |
tag |
biblioteca |
region |
Europa del Oeste |
libraryname |
WUR Library Netherlands |
language |
English |
topic |
Biocomputational method Computational Bioinformatics Genomic analysis Microbial genomics Biocomputational method Computational Bioinformatics Genomic analysis Microbial genomics |
spellingShingle |
Biocomputational method Computational Bioinformatics Genomic analysis Microbial genomics Biocomputational method Computational Bioinformatics Genomic analysis Microbial genomics Boer, Meine D. Melkonian, Chrats Zafeiropoulos, Haris Haas, Andreas F. Garza, Daniel R. Dutilh, Bas E. Improving genome-scale metabolic models of incomplete genomes with deep learning |
description |
Deciphering microbial metabolism is essential for understanding ecosystem functions. Genome-scale metabolic models (GSMMs) predict metabolic traits from genomic data, but constructing GSMMs for uncultured bacteria is challenging due to incomplete metagenome-assembled genomes, resulting in many gaps. We introduce the deep neural network guided imputation of reactomes (DNNGIOR), which uses AI to improve gap-filling by learning from the presence and absence of metabolic reactions across diverse bacterial genomes. Key factors for prediction accuracy are: (1) reaction frequency across all bacteria and (2) phylogenetic distance of the query to the training genomes. DNNGIOR predictions achieve an average F1 score of 0.85 for reactions present in over 30% of training genomes. DNNGIOR guided gap-filling was 14 times more accurate for draft reconstructions and 2–9 times for curated models than unweighted gap-filling. |
format |
Article/Letter to editor |
topic_facet |
Biocomputational method Computational Bioinformatics Genomic analysis Microbial genomics |
author |
Boer, Meine D. Melkonian, Chrats Zafeiropoulos, Haris Haas, Andreas F. Garza, Daniel R. Dutilh, Bas E. |
author_facet |
Boer, Meine D. Melkonian, Chrats Zafeiropoulos, Haris Haas, Andreas F. Garza, Daniel R. Dutilh, Bas E. |
author_sort |
Boer, Meine D. |
title |
Improving genome-scale metabolic models of incomplete genomes with deep learning |
title_short |
Improving genome-scale metabolic models of incomplete genomes with deep learning |
title_full |
Improving genome-scale metabolic models of incomplete genomes with deep learning |
title_fullStr |
Improving genome-scale metabolic models of incomplete genomes with deep learning |
title_full_unstemmed |
Improving genome-scale metabolic models of incomplete genomes with deep learning |
title_sort |
improving genome-scale metabolic models of incomplete genomes with deep learning |
url |
https://research.wur.nl/en/publications/improving-genome-scale-metabolic-models-of-incomplete-genomes-wit |
work_keys_str_mv |
AT boermeined improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning AT melkonianchrats improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning AT zafeiropoulosharis improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning AT haasandreasf improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning AT garzadanielr improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning AT dutilhbase improvinggenomescalemetabolicmodelsofincompletegenomeswithdeeplearning |
_version_ |
1819140168500641792 |