Relative power and sample size analysis on gene expression profiling data

Background - With the increasing number of expression profiling technologies, researchers today are confronted with choosing the technology that has sufficient power with minimal sample size, in order to reduce cost and time. These depend on data variability, partly determined by sample type, preparation and processing. Objective measures that help experimental design, given own pilot data, are thus fundamental. Results - Relative power and sample size analysis were performed on two distinct data sets. The first set consisted of Affymetrix array data derived from a nutrigenomics experiment in which weak, intermediate and strong PPARa agonists were administered to wild-type and PPARa-null mice. Our analysis confirms the hierarchy of PPARa-activating compounds previously reported and the general idea that larger effect sizes positively contribute to the average power of the experiment. A simulation experiment was performed that mimicked the effect sizes seen in the first data set. The relative power was predicted but the estimates were slightly conservative. The second, more challenging, data set describes a microarray platform comparison study using hippocampal dC-doublecortin-like kinase transgenic mice that were compared to wild-type mice, which was combined with results from Solexa/Illumina deep sequencing runs. As expected, the choice of technology greatly influences the performance of the experiment. Solexa/Illumina deep sequencing has the highest overall power followed by the microarray platforms Agilent and Affymetrix. Interestingly, Solexa/Illumina deep sequencing displays comparable power across all intensity ranges, in contrast with microarray platforms that have decreased power in the low intensity range due to background noise. This means that deep sequencing technology is especially more powerful in detecting differences in the low intensity range, compared to microarray platforms. Conclusion - Power and sample size analysis based on pilot data give valuable information on the performance of the experiment and can thereby guide further decisions on experimental design. Solexa/Illumina deep sequencing is the technology of choice if interest lies in genes expressed in the low-intensity range. Researchers can get guidance on experimental design using our approach on their own pilot data implemented as a BioConductor package, SSPA http://bioconductor.org/packages/release/bioc/html/SSPA.html

Saved in:
Bibliographic Details
Main Authors: van Iterson, M., 't Hoen, P.A.C., Pedotti, P., Hooiveld, G.J.E.J., den Dunnen, J.T., van Ommen, G.J.B., Boer, J.M., Menezes, R.X.
Format: Article/Letter to editor biblioteca
Language:English
Subjects:control maqc project, dna microarray, false discovery rate, microarray data,
Online Access:https://research.wur.nl/en/publications/relative-power-and-sample-size-analysis-on-gene-expression-profil
Tags: Add Tag
No Tags, Be the first to tag this record!
id dig-wur-nl-wurpubs-385203
record_format koha
spelling dig-wur-nl-wurpubs-3852032024-08-06 van Iterson, M. 't Hoen, P.A.C. Pedotti, P. Hooiveld, G.J.E.J. den Dunnen, J.T. van Ommen, G.J.B. Boer, J.M. Menezes, R.X. Article/Letter to editor BMC Genomics 10 (2009) ISSN: 1471-2164 Relative power and sample size analysis on gene expression profiling data 2009 Background - With the increasing number of expression profiling technologies, researchers today are confronted with choosing the technology that has sufficient power with minimal sample size, in order to reduce cost and time. These depend on data variability, partly determined by sample type, preparation and processing. Objective measures that help experimental design, given own pilot data, are thus fundamental. Results - Relative power and sample size analysis were performed on two distinct data sets. The first set consisted of Affymetrix array data derived from a nutrigenomics experiment in which weak, intermediate and strong PPARa agonists were administered to wild-type and PPARa-null mice. Our analysis confirms the hierarchy of PPARa-activating compounds previously reported and the general idea that larger effect sizes positively contribute to the average power of the experiment. A simulation experiment was performed that mimicked the effect sizes seen in the first data set. The relative power was predicted but the estimates were slightly conservative. The second, more challenging, data set describes a microarray platform comparison study using hippocampal dC-doublecortin-like kinase transgenic mice that were compared to wild-type mice, which was combined with results from Solexa/Illumina deep sequencing runs. As expected, the choice of technology greatly influences the performance of the experiment. Solexa/Illumina deep sequencing has the highest overall power followed by the microarray platforms Agilent and Affymetrix. Interestingly, Solexa/Illumina deep sequencing displays comparable power across all intensity ranges, in contrast with microarray platforms that have decreased power in the low intensity range due to background noise. This means that deep sequencing technology is especially more powerful in detecting differences in the low intensity range, compared to microarray platforms. Conclusion - Power and sample size analysis based on pilot data give valuable information on the performance of the experiment and can thereby guide further decisions on experimental design. Solexa/Illumina deep sequencing is the technology of choice if interest lies in genes expressed in the low-intensity range. Researchers can get guidance on experimental design using our approach on their own pilot data implemented as a BioConductor package, SSPA http://bioconductor.org/packages/release/bioc/html/SSPA.html en application/pdf https://research.wur.nl/en/publications/relative-power-and-sample-size-analysis-on-gene-expression-profil 10.1186/1471-2164-10-439 https://edepot.wur.nl/16239 control maqc project dna microarray false discovery rate microarray data Wageningen University & Research
institution WUR NL
collection DSpace
country Países bajos
countrycode NL
component Bibliográfico
access En linea
databasecode dig-wur-nl
tag biblioteca
region Europa del Oeste
libraryname WUR Library Netherlands
language English
topic control maqc project
dna microarray
false discovery rate
microarray data
control maqc project
dna microarray
false discovery rate
microarray data
spellingShingle control maqc project
dna microarray
false discovery rate
microarray data
control maqc project
dna microarray
false discovery rate
microarray data
van Iterson, M.
't Hoen, P.A.C.
Pedotti, P.
Hooiveld, G.J.E.J.
den Dunnen, J.T.
van Ommen, G.J.B.
Boer, J.M.
Menezes, R.X.
Relative power and sample size analysis on gene expression profiling data
description Background - With the increasing number of expression profiling technologies, researchers today are confronted with choosing the technology that has sufficient power with minimal sample size, in order to reduce cost and time. These depend on data variability, partly determined by sample type, preparation and processing. Objective measures that help experimental design, given own pilot data, are thus fundamental. Results - Relative power and sample size analysis were performed on two distinct data sets. The first set consisted of Affymetrix array data derived from a nutrigenomics experiment in which weak, intermediate and strong PPARa agonists were administered to wild-type and PPARa-null mice. Our analysis confirms the hierarchy of PPARa-activating compounds previously reported and the general idea that larger effect sizes positively contribute to the average power of the experiment. A simulation experiment was performed that mimicked the effect sizes seen in the first data set. The relative power was predicted but the estimates were slightly conservative. The second, more challenging, data set describes a microarray platform comparison study using hippocampal dC-doublecortin-like kinase transgenic mice that were compared to wild-type mice, which was combined with results from Solexa/Illumina deep sequencing runs. As expected, the choice of technology greatly influences the performance of the experiment. Solexa/Illumina deep sequencing has the highest overall power followed by the microarray platforms Agilent and Affymetrix. Interestingly, Solexa/Illumina deep sequencing displays comparable power across all intensity ranges, in contrast with microarray platforms that have decreased power in the low intensity range due to background noise. This means that deep sequencing technology is especially more powerful in detecting differences in the low intensity range, compared to microarray platforms. Conclusion - Power and sample size analysis based on pilot data give valuable information on the performance of the experiment and can thereby guide further decisions on experimental design. Solexa/Illumina deep sequencing is the technology of choice if interest lies in genes expressed in the low-intensity range. Researchers can get guidance on experimental design using our approach on their own pilot data implemented as a BioConductor package, SSPA http://bioconductor.org/packages/release/bioc/html/SSPA.html
format Article/Letter to editor
topic_facet control maqc project
dna microarray
false discovery rate
microarray data
author van Iterson, M.
't Hoen, P.A.C.
Pedotti, P.
Hooiveld, G.J.E.J.
den Dunnen, J.T.
van Ommen, G.J.B.
Boer, J.M.
Menezes, R.X.
author_facet van Iterson, M.
't Hoen, P.A.C.
Pedotti, P.
Hooiveld, G.J.E.J.
den Dunnen, J.T.
van Ommen, G.J.B.
Boer, J.M.
Menezes, R.X.
author_sort van Iterson, M.
title Relative power and sample size analysis on gene expression profiling data
title_short Relative power and sample size analysis on gene expression profiling data
title_full Relative power and sample size analysis on gene expression profiling data
title_fullStr Relative power and sample size analysis on gene expression profiling data
title_full_unstemmed Relative power and sample size analysis on gene expression profiling data
title_sort relative power and sample size analysis on gene expression profiling data
url https://research.wur.nl/en/publications/relative-power-and-sample-size-analysis-on-gene-expression-profil
work_keys_str_mv AT vanitersonm relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
AT thoenpac relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
AT pedottip relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
AT hooiveldgjej relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
AT dendunnenjt relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
AT vanommengjb relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
AT boerjm relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
AT menezesrx relativepowerandsamplesizeanalysisongeneexpressionprofilingdata
_version_ 1813205415772553216