Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources

Although Arabidopsis thaliana is the best studied plant species, the biological role of one third of its proteins is still unknown. We developed a probabilistic protein function prediction method that integrates information from sequences, protein-protein interactions and gene expression. The method was applied to proteins from Arabidopsis thaliana. Evaluation of prediction performance showed that our method has improved performance compared to single source-based prediction approaches and two existing integration approaches. An innovative feature of our method is that enables transfer of functional information between proteins that are not directly associated with each other. We provide novel function predictions for 5,807 proteins. Recent experimental studies confirmed several of the predictions. We highlight these in detail for proteins predicted to be involved in flowering and floral organ development.

Saved in:
Bibliographic Details
Main Authors: Kourmpetis, Y.I.A., van Dijk, A.D.J., van Ham, R.C.H.J., ter Braak, C.J.F.
Format: Article/Letter to editor biblioteca
Language:English
Subjects:algorithm, biology, cell-death, family, flowering time, gene, generalized linear-models, networks, thaliana, transcription factor,
Online Access:https://research.wur.nl/en/publications/genome-wide-computational-function-prediction-of-arabidopsis-thal
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-wur-nl-wurpubs-398519
record_format koha
spelling dig-wur-nl-wurpubs-3985192024-09-23 Kourmpetis, Y.I.A. van Dijk, A.D.J. van Ham, R.C.H.J. ter Braak, C.J.F. Article/Letter to editor Plant Physiology 155 (2011) ISSN: 0032-0889 Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources 2011 Although Arabidopsis thaliana is the best studied plant species, the biological role of one third of its proteins is still unknown. We developed a probabilistic protein function prediction method that integrates information from sequences, protein-protein interactions and gene expression. The method was applied to proteins from Arabidopsis thaliana. Evaluation of prediction performance showed that our method has improved performance compared to single source-based prediction approaches and two existing integration approaches. An innovative feature of our method is that enables transfer of functional information between proteins that are not directly associated with each other. We provide novel function predictions for 5,807 proteins. Recent experimental studies confirmed several of the predictions. We highlight these in detail for proteins predicted to be involved in flowering and floral organ development. en application/pdf https://research.wur.nl/en/publications/genome-wide-computational-function-prediction-of-arabidopsis-thal 10.1104/pp.110.162164 https://edepot.wur.nl/157207 algorithm biology cell-death family flowering time gene generalized linear-models networks thaliana transcription factor Wageningen University & Research
institution WUR NL
collection DSpace
country Países bajos
countrycode NL
component Bibliográfico
access En linea
databasecode dig-wur-nl
tag biblioteca
region Europa del Oeste
libraryname WUR Library Netherlands
language English
topic algorithm
biology
cell-death
family
flowering time
gene
generalized linear-models
networks
thaliana
transcription factor
algorithm
biology
cell-death
family
flowering time
gene
generalized linear-models
networks
thaliana
transcription factor
spellingShingle algorithm
biology
cell-death
family
flowering time
gene
generalized linear-models
networks
thaliana
transcription factor
algorithm
biology
cell-death
family
flowering time
gene
generalized linear-models
networks
thaliana
transcription factor
Kourmpetis, Y.I.A.
van Dijk, A.D.J.
van Ham, R.C.H.J.
ter Braak, C.J.F.
Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources
description Although Arabidopsis thaliana is the best studied plant species, the biological role of one third of its proteins is still unknown. We developed a probabilistic protein function prediction method that integrates information from sequences, protein-protein interactions and gene expression. The method was applied to proteins from Arabidopsis thaliana. Evaluation of prediction performance showed that our method has improved performance compared to single source-based prediction approaches and two existing integration approaches. An innovative feature of our method is that enables transfer of functional information between proteins that are not directly associated with each other. We provide novel function predictions for 5,807 proteins. Recent experimental studies confirmed several of the predictions. We highlight these in detail for proteins predicted to be involved in flowering and floral organ development.
format Article/Letter to editor
topic_facet algorithm
biology
cell-death
family
flowering time
gene
generalized linear-models
networks
thaliana
transcription factor
author Kourmpetis, Y.I.A.
van Dijk, A.D.J.
van Ham, R.C.H.J.
ter Braak, C.J.F.
author_facet Kourmpetis, Y.I.A.
van Dijk, A.D.J.
van Ham, R.C.H.J.
ter Braak, C.J.F.
author_sort Kourmpetis, Y.I.A.
title Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources
title_short Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources
title_full Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources
title_fullStr Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources
title_full_unstemmed Genome-wide computational function prediction of Arabidopsis thaliana proteins by integration of multiple data sources
title_sort genome-wide computational function prediction of arabidopsis thaliana proteins by integration of multiple data sources
url https://research.wur.nl/en/publications/genome-wide-computational-function-prediction-of-arabidopsis-thal
work_keys_str_mv AT kourmpetisyia genomewidecomputationalfunctionpredictionofarabidopsisthalianaproteinsbyintegrationofmultipledatasources
AT vandijkadj genomewidecomputationalfunctionpredictionofarabidopsisthalianaproteinsbyintegrationofmultipledatasources
AT vanhamrchj genomewidecomputationalfunctionpredictionofarabidopsisthalianaproteinsbyintegrationofmultipledatasources
AT terbraakcjf genomewidecomputationalfunctionpredictionofarabidopsisthalianaproteinsbyintegrationofmultipledatasources
_version_ 1813027947361075200