Title: | Power and Sample Size Calculation for eQTL Analysis |
---|---|
Description: | Power and sample size calculation for eQTL analysis based on ANOVA or simple linear regression. It can also calculate power/sample size for testing the association of a SNP to a continuous type phenotype. |
Authors: | Xianjun Dong [aut, ctb], Tzuu-Wang Chang [aut, ctb], Scott T. Weiss [aut, ctb], Weiliang Qiu [aut, cre] |
Maintainer: | Weiliang Qiu <[email protected]> |
License: | GPL (>=2) |
Version: | 0.1.3 |
Built: | 2024-11-09 02:35:40 UTC |
Source: | https://github.com/sterding/powereqtl |
Calculation of minimum detectable effect size () for eQTL analysis that tests if a SNP is associated to a gene probe by using un-balanced one-way ANOVA.
minEffectEQTL.ANOVA(MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, mypower = 0.8, verbose = TRUE)
minEffectEQTL.ANOVA(MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, mypower = 0.8, verbose = TRUE)
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
myntotal |
integer. Number of subjects. |
mypower |
Desired power for the eQTL analysis. |
verbose |
logic. indicating if intermediate results should be output. |
The assumption of the ANOVA approach is that the association of a SNP to a gene probe is tested by using un-balanced one-way ANOVA (e.g. Lonsdale et al. 2013). According to SAS online document https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_power_a0000000982.htm, the power calculation formula is
where is the number of groups of subjects,
is the total number
of subjects,
is the
-th percentile of central F distribution with degrees of freedoms
and
, and
is the non-central F distribution
with degrees of freedoms
and
and non-central parameter (ncp)
. The ncp
is equal to
where is the mean gene expression level
for the
-th group of subjects,
is the weight for the
-th group of subjects,
is the variance of the random errors in ANOVA (assuming each group has equal variance), and
is the weighted mean gene expression level
The weights are the sample proportions for the 3 groups of subjects. Hence,
.
We assume that ,
where
,
, and
are the
mean gene expression level for mutation homozygotes, heterozygotes,
and wild-type homozygotes, respectively.
Denote as the minor allele frequency (MAF) of a SNP. Under Hardy-Weinberg equilibrium, we have genotype frequencies:
,
,
and
, where
,
, and
are genotype
for mutation homozygotes, heterozygotes, and wild-type homozygotes, respectively,
. Then ncp can be simplified as
minimum detectable effect size .
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Lonsdale J and Thomas J, et al. The Genotype-Tissue Expression (GTEx) project. Nature Genetics, 45:580-585, 2013.
powerEQTL.ANOVA, powerEQTL.ANOVA2, ssEQTL.ANOVA, ssEQTL.ANOVA2
minEffectEQTL.ANOVA( MAF = 0.1, typeI = 0.05, nTests = 200000, myntotal = 234, mypower = 0.8, verbose = TRUE)
minEffectEQTL.ANOVA( MAF = 0.1, typeI = 0.05, nTests = 200000, myntotal = 234, mypower = 0.8, verbose = TRUE)
Calculation of minimum detectable minor allele frequency (MAF) for eQTL analysis that tests if a SNP is associated to a gene probe by using un-balanced one-way ANOVA.
minMAFeQTL.ANOVA(effsize, typeI = 0.05, nTests = 200000, myntotal = 200, mypower = 0.8, verbose = TRUE)
minMAFeQTL.ANOVA(effsize, typeI = 0.05, nTests = 200000, myntotal = 200, mypower = 0.8, verbose = TRUE)
effsize |
Effect size |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
myntotal |
integer. Number of subjects. |
mypower |
Desired power for the eQTL analysis. |
verbose |
logic. indicating if intermediate results should be output. |
The assumption of the ANOVA approach is that the association of a SNP to a gene probe is tested by using un-balanced one-way ANOVA (e.g. Lonsdale et al. 2013). According to SAS online document https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_power_a0000000982.htm, the power calculation formula is
where is the number of groups of subjects,
is the total number
of subjects,
is the
-th percentile of central F distribution with degrees of freedoms
and
, and
is the non-central F distribution
with degrees of freedoms
and
and non-central parameter (ncp)
. The ncp
is equal to
where is the mean gene expression level
for the
-th group of subjects,
is the weight for the
-th group of subjects,
is the variance of the random errors in ANOVA (assuming each group has equal variance), and
is the weighted mean gene expression level
The weights are the sample proportions for the 3 groups of subjects. Hence,
.
We assume that ,
where
,
, and
are the
mean gene expression level for mutation homozygotes, heterozygotes,
and wild-type homozygotes, respectively.
Denote as the minor allele frequency (MAF) of a SNP. Under Hardy-Weinberg equilibrium, we have genotype frequencies:
,
,
and
, where
,
, and
are genotype
for mutation homozygotes, heterozygotes, and wild-type homozygotes, respectively,
. Then ncp can be simplified as
minimum detectable MAF.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Lonsdale J and Thomas J, et al. The Genotype-Tissue Expression (GTEx) project. Nature Genetics, 45:580-585, 2013.
powerEQTL.ANOVA, powerEQTL.ANOVA2, ssEQTL.ANOVA, ssEQTL.ANOVA2
minMAFeQTL.ANOVA(effsize = 1, typeI = 0.05, nTests = 200000, myntotal = 234, mypower = 0.8, verbose = TRUE)
minMAFeQTL.ANOVA(effsize = 1, typeI = 0.05, nTests = 200000, myntotal = 234, mypower = 0.8, verbose = TRUE)
Minimum detectable minor allele frequency (MAF) calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using simple linear regression.
minMAFeQTL.SLR(slope, typeI = 0.05, nTests = 200000, myntotal = 200, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
minMAFeQTL.SLR(slope, typeI = 0.05, nTests = 200000, myntotal = 200, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
slope |
Slope of the simple linear regression. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
myntotal |
integer. Number of subjects. |
mypower |
Desired power for the eQTL analysis. |
mystddev |
Standard deviation of the random error term |
verbose |
logic. indicating if intermediate results should be output. |
To test if a SNP is associated with a gene probe, we use the simple linear regression
where is the gene expression level of the
-th subject,
is the genotype of the
-th subject, and
is the random error term. Additive coding for genotype is used. To test if the SNP is associated with the gene probe, we test the null hypothesis
.
Denote as the minor allele frequency (MAF) of the SNP. Under Hardy-Weinberg equilibrium, we can calculate the variance of genotype of the SNP:
,
where
is the variance of the
predictor (i.e. the SNP)
.
We then can use Dupont and Plummer's (1998) power/sample size calculation formula to calculate the minimum detectable slope, adjusting for multiple testing.
The estimated minimum detectable MAF.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Dupont, W.D. and Plummer, W.D.. Power and Sample Size Calculations for Studies Involving Linear Regression. Controlled Clinical Trials. 1998;19:589-601.
minMAFeQTL.SLR(slope = 0.1299513, typeI = 0.05, nTests = 200000, myntotal = 176, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
minMAFeQTL.SLR(slope = 0.1299513, typeI = 0.05, nTests = 200000, myntotal = 176, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
Minimum detectable slope calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using simple linear regression.
minSlopeEQTL.SLR( MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
minSlopeEQTL.SLR( MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
myntotal |
integer. Number of subjects. |
mypower |
Desired power for the eQTL analysis. |
mystddev |
Standard deviation of the random error term |
verbose |
logic. indicating if intermediate results should be output. |
To test if a SNP is associated with a gene probe, we use the simple linear regression
where is the gene expression level of the
-th subject,
is the genotype of the
-th subject, and
is the random error term. Additive coding for genotype is used. To test if the SNP is associated with the gene probe, we test the null hypothesis
.
Denote as the minor allele frequency (MAF) of the SNP. Under Hardy-Weinberg equilibrium, we can calculate the variance of genotype of the SNP:
,
where
is the variance of the
predictor (i.e. the SNP)
.
We then can use Dupont and Plummer's (1998) power/sample size calculation formula to calculate the minimum detectable slope, adjusting for multiple testing.
The estimated minimum detectable slope.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Dupont, W.D. and Plummer, W.D.. Power and Sample Size Calculations for Studies Involving Linear Regression. Controlled Clinical Trials. 1998;19:589-601.
minSlopeEQTL.SLR( MAF = 0.1, typeI = 0.05, nTests = 2e+05, myntotal = 176, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
minSlopeEQTL.SLR( MAF = 0.1, typeI = 0.05, nTests = 2e+05, myntotal = 176, mypower = 0.8, mystddev = 0.13, verbose = TRUE)
Power calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using un-balanced one-way ANOVA.
powerEQTL.ANOVA(MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, mystddev = 0.13, deltaVec = c(0.13, 0.13), verbose = TRUE)
powerEQTL.ANOVA(MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, mystddev = 0.13, deltaVec = c(0.13, 0.13), verbose = TRUE)
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
myntotal |
integer. Number of subjects. |
mystddev |
Standard deviation of gene expression levels in one group of subjects. Assume all 3 groups of subjects (mutation homozygote, heterozygote, wild-type homozygote) have the same standard deviation of gene expression levels. |
deltaVec |
A vector having 2 elements. The first element is equal to
|
verbose |
logic. indicating if intermediate results should be output. |
The assumption of the ANOVA approach is that the association of a SNP to a gene probe is tested by using un-balanced one-way ANOVA (e.g. Lonsdale et al. 2013). According to SAS online document https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_power_a0000000982.htm, the power calculation formula is
where is the number of groups of subjects,
is the total number
of subjects,
is the
-th percentile of central F distribution with degrees of freedoms
and
, and
is the non-central F distribution
with degrees of freedoms
and
and non-central parameter (ncp)
. The ncp
is equal to
where is the mean gene expression level
for the
-th group of subjects,
is the weight for the
-th group of subjects,
is the variance of the random errors in ANOVA (assuming each group has equal variance), and
is the weighted mean gene expression level
The weights are the sample proportions for the 3 groups of subjects. Hence,
.
power of the test after Bonferroni correction for multiple testing.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Lonsdale J and Thomas J, et al. The Genotype-Tissue Expression (GTEx) project. Nature Genetics, 45:580-585, 2013.
minEffectEQTL.ANOVA, powerEQTL.ANOVA2, ssEQTL.ANOVA, ssEQTL.ANOVA2
powerEQTL.ANOVA( MAF = 0.1, typeI = 0.05, nTests = 200000, myntotal = 234, mystddev = 0.13, deltaVec = c(0.13, 0.13))
powerEQTL.ANOVA( MAF = 0.1, typeI = 0.05, nTests = 200000, myntotal = 234, mystddev = 0.13, deltaVec = c(0.13, 0.13))
Power calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using un-balanced one-way ANOVA (assuming Hardy-Weinberg equilibrium).
powerEQTL.ANOVA2(effsize, MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, verbose = TRUE)
powerEQTL.ANOVA2(effsize, MAF, typeI = 0.05, nTests = 2e+05, myntotal = 200, verbose = TRUE)
effsize |
effect size |
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
myntotal |
integer. Number of subjects. |
verbose |
logic. indicating if intermediate results should be output. |
The assumption of the ANOVA approach is that the association of a SNP to a gene probe is tested by using un-balanced one-way ANOVA (e.g. Lonsdale et al. 2013). According to SAS online document https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_power_a0000000982.htm, the power calculation formula is
where is the number of groups of subjects,
is the total number
of subjects,
is the
-th percentile of central F distribution with degrees of freedoms
and
, and
is the non-central F distribution
with degrees of freedoms
and
and non-central parameter (ncp)
. The ncp
is equal to
where is the mean gene expression level
for the
-th group of subjects,
is the weight for the
-th group of subjects,
is the variance of the random errors in ANOVA (assuming each group has equal variance), and
is the weighted mean gene expression level
The weights are the sample proportions for the 3 groups of subjects. Hence,
.
We assume that ,
where
,
, and
are the
mean gene expression level for mutation homozygotes, heterozygotes,
and wild-type homozygotes, respectively.
Denote as the minor allele frequency (MAF) of a SNP. Under Hardy-Weinberg equilibrium, we have genotype frequencies:
,
,
and
, where
,
, and
are genotype
for mutation homozygotes, heterozygotes, and wild-type homozygotes, respectively,
. Then ncp can be simplified as
power of the test after Bonferroni correction for multiple testing.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Lonsdale J and Thomas J, et al. The Genotype-Tissue Expression (GTEx) project. Nature Genetics, 45:580-585, 2013.
minEffectEQTL.ANOVA, powerEQTL.ANOVA, ssEQTL.ANOVA, ssEQTL.ANOVA2
powerEQTL.ANOVA2(effsize = 1, MAF = 0.1, typeI = 0.05, nTests = 2e+05, myntotal = 234, verbose = TRUE)
powerEQTL.ANOVA2(effsize = 1, MAF = 0.1, typeI = 0.05, nTests = 2e+05, myntotal = 234, verbose = TRUE)
Power calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using simple linear regression.
powerEQTL.SLR( MAF, typeI = 0.05, nTests = 2e+05, slope = 0.13, myntotal = 200, mystddev = 0.13, verbose = TRUE)
powerEQTL.SLR( MAF, typeI = 0.05, nTests = 2e+05, slope = 0.13, myntotal = 200, mystddev = 0.13, verbose = TRUE)
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
slope |
Slope
where |
myntotal |
integer. Number of subjects. |
mystddev |
Standard deviation of the random error term |
verbose |
logic. indicating if intermediate results should be output. |
To test if a SNP is associated with a gene probe, we use the simple linear regression
where is the gene expression level of the
-th subject,
is the genotype of the
-th subject, and
is the random error term. Additive coding for genotype is used. To test if the SNP is associated with the gene probe, we test the null hypothesis
.
Denote as the minor allele frequency (MAF) of the SNP. Under Hardy-Weinberg equilibrium, we can calculate the variance of genotype of the SNP:
,
where
is the variance of the
predictor (i.e. the SNP)
.
We then can use Dupont and Plummer's (1998) power/sample size calculation formula to calculate the minimum detectable slope, adjusting for multiple testing.
power of the test after Bonferroni correction for multiple testing.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Dupont, W.D. and Plummer, W.D.. Power and Sample Size Calculations for Studies Involving Linear Regression. Controlled Clinical Trials. 1998;19:589-601.
powerEQTL.SLR( MAF = 0.1, typeI = 0.05, nTests = 2e+05, slope = 0.13, myntotal = 176, mystddev = 0.13, verbose = TRUE)
powerEQTL.SLR( MAF = 0.1, typeI = 0.05, nTests = 2e+05, slope = 0.13, myntotal = 176, mystddev = 0.13, verbose = TRUE)
Sample size calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using un-balanced one-way ANOVA.
ssEQTL.ANOVA( MAF, typeI = 0.05, nTests = 2e+05, mypower = 0.8, mystddev = 0.13, deltaVec = c(0.13, 0.13))
ssEQTL.ANOVA( MAF, typeI = 0.05, nTests = 2e+05, mypower = 0.8, mystddev = 0.13, deltaVec = c(0.13, 0.13))
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
mypower |
Desired power for the eQTL analysis. |
mystddev |
Standard deviation of gene expression levels in one group of subjects. Assume all 3 groups of subjects (mutation homozygote, heterozygote, wild-type homozygote) have the same standard deviation of gene expression levels. |
deltaVec |
A vector having 2 elements. The first element is equal to
|
The assumption of the ANOVA approach is that the association of a SNP to a gene probe is tested by using un-balanced one-way ANOVA (e.g. Lonsdale et al. 2013). According to SAS online document https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_power_a0000000982.htm, the power calculation formula is
where is the number of groups of subjects,
is the total number
of subjects,
is the
-th percentile of central F distribution with degrees of freedoms
and
, and
is the non-central F distribution
with degrees of freedoms
and
and non-central parameter (ncp)
. The ncp
is equal to
where is the mean gene expression level
for the
-th group of subjects,
is the weight for the
-th group of subjects,
is the variance of the random errors in ANOVA (assuming each group has equal variance), and
is the weighted mean gene expression level
The weights are the sample proportions for the 3 groups of subjects. Hence,
.
sample size required for the eQTL analysis to achieve the desired power.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Lonsdale J and Thomas J, et al. The Genotype-Tissue Expression (GTEx) project. Nature Genetics, 45:580-585, 2013.
minEffectEQTL.ANOVA, powerEQTL.ANOVA, powerEQTL.ANOVA2, ssEQTL.ANOVA2
ssEQTL.ANOVA(MAF = 0.1, typeI = 0.05, nTests = 200000, mypower = 0.8, mystddev = 0.13, deltaVec = c(0.13, 0.13))
ssEQTL.ANOVA(MAF = 0.1, typeI = 0.05, nTests = 200000, mypower = 0.8, mystddev = 0.13, deltaVec = c(0.13, 0.13))
Sample size calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using un-balanced one-way ANOVA.
ssEQTL.ANOVA2( effsize, MAF, typeI = 0.05, nTests = 2e+05, mypower = 0.8 )
ssEQTL.ANOVA2( effsize, MAF, typeI = 0.05, nTests = 2e+05, mypower = 0.8 )
effsize |
effect size |
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
mypower |
Desired power for the eQTL analysis. |
The assumption of the ANOVA approach is that the association of a SNP to a gene probe is tested by using un-balanced one-way ANOVA (e.g. Lonsdale et al. 2013). According to SAS online document https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_power_a0000000982.htm, the power calculation formula is
where is the number of groups of subjects,
is the total number
of subjects,
is the
-th percentile of central F distribution with degrees of freedoms
and
, and
is the non-central F distribution
with degrees of freedoms
and
and non-central parameter (ncp)
. The ncp
is equal to
where is the mean gene expression level
for the
-th group of subjects,
is the weight for the
-th group of subjects,
is the variance of the random errors in ANOVA (assuming each group has equal variance), and
is the weighted mean gene expression level
The weights are the sample proportions for the 3 groups of subjects. Hence,
.
We assume that ,
where
,
, and
are the
mean gene expression level for mutation homozygotes, heterozygotes,
and wild-type homozygotes, respectively.
Denote as the minor allele frequency (MAF) of a SNP. Under Hardy-Weinberg equilibrium, we have genotype frequencies:
,
,
and
, where
,
, and
are genotype
for mutation homozygotes, heterozygotes, and wild-type homozygotes, respectively,
. Then ncp can be simplified as
sample size required for the eQTL analysis to achieve the desired power.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Lonsdale J and Thomas J, et al. The Genotype-Tissue Expression (GTEx) project. Nature Genetics, 45:580-585, 2013.
minEffectEQTL.ANOVA, powerEQTL.ANOVA, powerEQTL.ANOVA2, ssEQTL.ANOVA
ssEQTL.ANOVA2( effsize = 1, MAF = 0.1, typeI = 0.05, nTests = 2e+05, mypower = 0.8 )
ssEQTL.ANOVA2( effsize = 1, MAF = 0.1, typeI = 0.05, nTests = 2e+05, mypower = 0.8 )
Sample size calculation for eQTL analysis that tests if a SNP is associated to a gene probe by using simple linear regression.
ssEQTL.SLR( MAF, typeI = 0.05, nTests = 2e+05, slope = 0.13, mypower = 0.8, mystddev = 0.13, n.lower = 2.01, n.upper = 1e+30, verbose = TRUE)
ssEQTL.SLR( MAF, typeI = 0.05, nTests = 2e+05, slope = 0.13, mypower = 0.8, mystddev = 0.13, n.lower = 2.01, n.upper = 1e+30, verbose = TRUE)
MAF |
Minor allele frequency. |
typeI |
Type I error rate for testing if a SNP is associated to a gene probe. |
nTests |
integer. Number of tests in eQTL analysis. |
slope |
Slope
where |
mypower |
Desired power for the eQTL analysis. |
mystddev |
Standard deviation of the random error term |
n.lower |
integer. Lower bound of the total number of subjects. |
n.upper |
integer. Upper bound of the total number of subjects. |
verbose |
logic. indicating if intermediate results should be output. |
To test if a SNP is associated with a gene probe, we use the simple linear regression
where is the gene expression level of the
-th subject,
is the genotype of the
-th subject, and
is the random error term. Additive coding for genotype is used. To test if the SNP is associated with the gene probe, we test the null hypothesis
.
Denote as the minor allele frequency (MAF) of the SNP. Under Hardy-Weinberg equilibrium, we can calculate the variance of genotype of the SNP:
,
where
is the variance of the
predictor (i.e. the SNP)
.
We then can use Dupont and Plummer's (1998) power/sample size calculation formula to calculate the minimum detectable slope, adjusting for multiple testing.
sample size required for the eQTL analysis to achieve the desired power.
Xianjun Dong <[email protected]>, Tzuu-Wang Chang <[email protected]>, Scott T. Weiss <[email protected]>, Weiliang Qiu <[email protected]>
Dupont, W.D. and Plummer, W.D.. Power and Sample Size Calculations for Studies Involving Linear Regression. Controlled Clinical Trials. 1998;19:589-601.
powerEQTL.SLR, minSlopeEQTL.SLR
ssEQTL.SLR( MAF = 0.1, typeI = 0.05, nTests = 2e+05, slope = 0.13, mypower = 0.8, mystddev = 0.13, n.lower = 2.01, n.upper = 1e+30, verbose = TRUE)
ssEQTL.SLR( MAF = 0.1, typeI = 0.05, nTests = 2e+05, slope = 0.13, mypower = 0.8, mystddev = 0.13, n.lower = 2.01, n.upper = 1e+30, verbose = TRUE)