SSiz Prediction

Content Disclaimer
Copyright @2020.
All Rights Reserved.

StatsToDo: Sample Size for Prediction Statistics

Links : Home Index (Subjects) Contact StatsToDo

Explanations and References Sample Size Table Javascript Program

Input Data

Sample Size (TPR or TNR) in 1 Group: The data is in 3 columns
    - Each row a separate study
    - Col 1 = Type I Error (α)
    - Col 2 = Power (1 - β)
    - Col 3 = Proportion (TPR of TNR)

Sample Size Comparing 2 Groups: The data is in 4 columns.
    - Each row a separate study
    - Col 1 = Probability of Type I error (α)
    - Col 2 = Power (1-β)
    - Col 3 and 4 = the 2 TPRs or 2 TNRs to be compared

R Codes

Program 1: Sample size for Single Group to Estimate TPR or TNR

# Pgm1: Single geoup TPR or TNR > 0.5
#Note: Rate must be >0.5
myDat = ("
Alpha Power Rate
0.05  0.8   0.55
0.01  0.8   0.60
0.01  0.95  0.85
         ") 
myDat
df <- read.table(textConnection(myDat),header=TRUE)  # conversion to data frame    
df      # display input data (true positive, false positive, false negative, and true negative)

SSiz <- vector()     # sample size
for(i in 1:nrow(df))
{
  za = qnorm(df$Alpha[i])
  zb = qnorm(1 - df$Power[i])
  pn = df$Rate[i]
  p0 = 0.5;
  top = za * sqrt(p0*(1-p0)) + zb * sqrt(pn*(1-pn));
  bot = pn - p0;
  SSiz <- append(SSiz, ceiling((top*top) / (bot*bot)))
}
SSiz
df$SSiz <- SSiz
df    # display data frame including calculated sample size

The results are

> df    # display data frame including calculated sample size
  Alpha Power Rate SSiz
1  0.05  0.80 0.55  617
2  0.01  0.80 0.60  249
3  0.01  0.95 0.85   26

Program 2: Sample size for comparing rates (TPR, FPR, FNR, TNR) in two groups
# Pgm2: SSiz comparing two rates myDat = (" Alpha Power Rate1 Rate2 0.05 0.8 0.8 0.7 0.01 0.8 0.8 0.7 0.05 0.9 0.8 0.7 0.01 0.9 0.8 0.7 ") myDat df <- read.table(textConnection(myDat),header=TRUE) # conversion to data frame df # display True Positive Rateate and False Positive Rate SSizU <- vector() # Sample Size per group for unpaired comparison SSizPMin <- vector() # Minimum Sample Size (pairs) for paired comparison SSizPMax <- vector() # Maximum Sample Size (pairs) for paired comparison for(i in 1:nrow(df)) { za = qnorm(df$Alpha[i]) zb = qnorm(1 - df$Power[i]) r1 = df$Rate1[i] r2 = df$Rate2[i] Phat = (r1 + r2) / 2.0 q1 = 1.0 - r1 q2 = 1.0 - r2 Qhat = (q1 + q2) / 2.0 a = za * sqrt(2.0 * Phat * Qhat) + zb * sqrt(r1 * q1 + r2 * q2) a = a * a dif = abs(r1-r2); SSizU <- append(SSizU, ceiling(a * (1 + sqrt(1 + 4 * dif / a))^2 / (4 * dif * dif))) phimax = r1 * (1 - r2) + r2 * (1 - r1); SSizPMin <- append(SSizPMin, ceiling((za * sqrt(dif) + zb * sqrt(dif - dif * dif))^2 / (dif * dif))) SSizPMax <- append(SSizPMax, ceiling((za * sqrt(phimax) + zb * sqrt(phimax - dif * dif))^2 / (dif * dif))) } df$SSizU <- SSizU df$SSizPMin <- SSizPMin df$SSizPMax <- SSizPMax df # display data frame plus sample sizes for unpairec xomparison, plus min and max of paired comparison
The results are
> df # display data frame plus sample sizes for unpairec xomparison, plus min and max of paired comparison Alpha Power Rate1 Rate2 SSizU SSizPMin SSizPMax 1 0.05 0.8 0.8 0.7 251 60 233 2 0.01 0.8 0.8 0.7 395 98 379 3 0.05 0.9 0.8 0.7 339 82 322 4 0.01 0.9 0.8 0.7 506 126 491

StatsToDo: Sample Size for Prediction Statistics

Introduction

Sample size for a Single Group

Sample Size for comparing rates from two groups

References