2 Counts

Content Disclaimer
Copyright @2020.
All Rights Reserved.

StatsToDo: Compare Two Poisson Distributed Count Rates

Links : Home Index (Subjects) Contact StatsToDo

Explanations Javascript Ptogram

Data

Data Entry

The data is a table with 4 columns separated by spaces or tabs.

Col 1 is the count in group 1 (k1)
Col 2 is the sample size in group 1 (n1)
Col 3 is the count in group 2 (k2)
Col 4 is the sample size in group 2 (n2)

R Codes

The R codes are divided into the following sections

Subroutines that creates and access a vector of log(factorial) numbers, as these are repeatedly used in C and E Tests
The input data and setting up of log(factorial) vectors as these are common to all 3 algorithms
Three (3) algorithms. The whitehead algorithm, the C Test, and the E Test
Results

Section 1; Vector of log(factorial) numbers

# log(factorial) vector for repeated calculations of factorial and binomial coefficients
logFactVector = vector()

MakeLogFactVector <- function(n) # make log(factorial vector
{
  x = 0
  for(i in 1:n)
  {
    x = x + log(i)
    logFactVector <<- append(logFactVector,x)
  }
}

LogFact <- function(i)
{
  if(i==0) i=1
  return (logFactVector[i])
}

# binomial coefficient
BinomCoeff <- function (n,k)
{
  if(k==0 | k==n) return (1)
  return (exp(LogFact(n) - LogFact(k) - LogFact(n-k)))
}

Section 2: Input data and initial setup of log(factorial) vector
# input data myDat = (" k1 n1 k2 n2 13 10 8 10 10 30 10 50 12 100 4 110 ") myDat df <- read.table(textConnection(myDat),header=TRUE) # conversion to data frame df # display input data (count(k) and sample size (n) for groups 1 and 2) # Create log(factorial) vector maxN = max(df) MakeLogFactVector(maxN)
Section 3: three programs of comparing two counts using the same data and supportive functions
Program 1: Whitehead's algorithm
# Pgm 1 Whitehead's algorithm Pw <- vector() # probability of Type I Error (Whitehead) one tail for(i in 1:nrow(df)) { k1 = df$k1[i] n1 = df$n1[i] k2 = df$k2[i] n2 = df$n2[i] n = n1 + n2; zi = (n1*k2 - n2*k1)/n vi = n1*n2*(k1+k2)/(n*n); z = abs(zi / sqrt(vi)) Pw <- append(Pw,1 - pnorm(z)) # Whitehead alpha one tail } df$Pw <- Pw # add Whitehead's algorithm results to data frame
Program 2: C Test
# Program 2: C Test Pc <- vector() # probability of Type I Error (C Test) one tail for(i in 1:nrow(df)) { k1 = df$k1[i] n1 = df$n1[i] k2 = df$k2[i] n2 = df$n2[i] x = 1 # for ratio = 1 if(k1/n1!=k2/n2) { x1=0 x2=0 k = k1 + k2 p = 0; if(k1>k2) { p = (n1/n2)/(1.0 + (n1/n2)) } else { k1 = k2 p = (n2/n1) / (1.0 + (n2/n1)) } for(j in 0:1) { x = 0; if(j==0) { for(ii in k1:k) x = x + BinomCoeff(k,ii) * p^ii * (1-p)^(k-ii) # choose is binomial coefficient } else { for(ii in 0:k) x = x + BinomCoeff(k,ii) * p^ii * (1-p)^(k-ii) } if(j==0) { x1 = x } else { x2 = x } } if(x1<x2) { x = x1; } else { x = x2; } if(x>1.0)x = 1.0; } Pc <- append(Pc,x / 2) } df$Pc <- Pc # add C Test results to data frame
Program 3: E Test
#Program 3: E Test # Result vectors Pe <- vector() # probability of Type I Error (E Test) one tail for(i in 1:nrow(df)) { k1 = df$k1[i] n1 = df$n1[i] k2 = df$k2[i] n2 = df$n2[i] diff = 0 x = 1 if(k1 / n1 != k2 / n2) { Vk = k1 / (n1*n1) + k2 / (n2*n2) Tk1k2 = abs((k1 / n1 - k2 / n2 - diff) / sqrt(Vk)) Lambda2k = (k1+k2) / (n1+n2) - diff * n1 / (n1 + n2) SumT = 0 SumF = 1 x1 = 0 while (x1<1000 && SumF>1e-10) { x2 = 0 SumF = 0 f = 1; while (x2<1000 & f>1e-10) { Vx = x1 / (n1*n1) + x2 / (n2*n2) if(abs(x1 / n1 - x2 / n2)<=diff) { Tx1x2 = 0 } else { Tx1x2 = abs(x1 / n1 - x2 / n2 - diff) / sqrt(Vx); } if (Tx1x2>=Tk1k2) { v1 = exp(-n1 * (Lambda2k + diff)) v2 = (n1 * (Lambda2k + diff))^x1 v3 = exp(-n2 * Lambda2k) * (n2 * Lambda2k)^x2 f = exp(log((v1*v2*v3)) - (LogFact(round(x1)) + LogFact(round(x2)))) SumF = SumF + f } x2 = x2 + 1; } SumT = SumT + SumF x1 = x1 + 1 } x = SumT / 2 } Pe <- append(Pe,x) } df$Pe <- Pe # add E Test results to data frame
Final display of data and results
df # display input data and the results (Type I Error, one tail), Pw=Whitehead, Pc=C Test, Pe= E Test
The results are:
> df # display input data (count(k) and sample size (n) for groups 1 and 2) k1 n1 k2 n2 1 13 10 8 10 2 20 30 10 50 3 12 100 4 110 > k1 n1 k2 n2 Pw Pc Pe 1 13 10 8 10 0.13761676 0.09582758 0.14336618 2 10 20 10 50 0.01694743 0.49414271 0.04229829 3 12 100 4 110 0.01415499 0.01249091 0.01611372 >

StatsToDo: Compare Two Poisson Distributed Count Rates

Data Input

More Complex Models

References