Prediction

Content Disclaimer
Copyright @2020.
All Rights Reserved.

StatsToDo: Prediction and Tests

Links : Home Index (Subjects) Contact StatsToDo

Explanations, References and Examples Javascript Programs

Prediction Statistics :
The data is in 4 columns.
    - Each row a separate study
    - Col 1 = Number of True Positives (TP), Test Positive and Outcome Positive
    - Col 2 = Nmber of False Positives (FP), Test Positive and Outcome Negative
    - Col 3 = Nmber of False Negatives (FN), Test Negative and Outcome Positive
    - Col 4 = Number of True Negatives (TN), Test Negative and Outcome Negative

Likelihood Ratio :
The data is in 2 columns.
    - Each row a separate study
    - Col 1 is True Positive Rate (TPR, Sensitivity)
    - Col 2 is False Positive Rate (FPR, 1-specificity, 1-TNR)

Post-test Probability :
The data is in 2 columns.
    - Each row a separate study
    - Col 1 is Pre-test Probability
    - Col 2 is Likelihood Ratio
Post-test Probability

R Codes

This panel shows the 3 programs in R Codes

Program 1: Prediction

# Pgm 1: Prediction
# common subroutines

StandardError <- function(n,p)  # Standard Error of a probability p, given sample size n
{
  return (sprintf("%1.3f", sqrt(p * (1.0 - p) / n)))   # Standard Error
}
# Main Program
myDat = ("
TPos  FPos  FNeg  TNeg
12       3    18    27
39      26    91   104
         ") 
df <- read.table(textConnection(myDat),header=TRUE)  # conversion to data frame    
df      # display input data (true positive, false positive, false negative, and true negative)

TPR <- vector()     # true positive rate
TPRSE <- vector()   # SE of true positive rate
FPR <- vector()     # false positive rate
FPRSE <- vector()   # SE of false positive rate
# FNR (false negative rate) = 1 - TPR and has the same SE value
# TNR (true negative rate) = 1 - FPR and has the same SE value
LRPos <- vector()   # likelihood ratio test positive
LRNeg <- vector()   # likelihood ratio test negative

for(i in 1:nrow(df))
{
  tp = df$TPos[i]  # true positive
  fp = df$FPos[i]  # false positive
  fn = df$FNeg[i]  # false negative
  tn = df$TNeg[i]  # true negative
  testPos = tp + fn
  tpr = tp / testPos
  TPR <- append(TPR,sprintf("%1.3f",tpr))
  TPRSE <- append(TPRSE,StandardError(testPos,tpr))
  fpr = fp / testPos
  FPR = append(FPR,sprintf("%1.3f",fpr))
  FPRSE <- append(FPRSE,StandardError(testPos,fpr))
  LRPos <- append(LRPos, sprintf("%1.3f",tpr / fpr))
  LRNeg <- append(LRNeg, sprintf("%1.3f",(1-tpr) / (1-fpr)))
}

df$TPR <- TPR
df$TPRSE <- TPRSE
df$FPR <- FPR
df$FPRSE <- FPRSE
df$LRPos <- LRPos
df$LRNeg <- LRNeg
df      # display input data + true and false positive rates, their Standard Errors, and Likelihood Ratios

The outputs are
> df # display input data (true positive, false positive, false negative, and true negative) TPos FPos FNeg TNeg 1 12 3 18 27 2 39 26 91 104 > df # display input data + true and false positive rates, their Standard Errors, and Likelihood Ratios TPos FPos FNeg TNeg TPR TPRSE FPR FPRSE LRPos LRNeg 1 12 3 18 27 0.400 0.089 0.100 0.055 4.000 0.667 2 39 26 91 104 0.300 0.040 0.200 0.035 1.500 0.875
Program 2: Likelihood Ratio
# Pgm 2: Likelihood Ratio from true and false positive rates (TPR, FPR) myDat = (" TPR FPR 0.4 0.1 0.3 0.2 ") myDat df <- read.table(textConnection(myDat),header=TRUE) # conversion to data frame df # display True Positive Rateate and False Positive Rate LRPos <- vector() # likelihood ratio test positive LRNeg <- vector() # likelihood ratio test negative for(i in 1:nrow(df)) { LRPos <- append(LRPos,sprintf("%1.3f",df$TPR[i] / df$FPR[i])) LRNeg <- append(LRNeg, sprintf("%1.3f",(1-df$TPR[i]) / (1-df$FPR[i]))) } df$LRPos <- LRPos df$LRNeg <- LRNeg df # display True Positive Rateate and False Positive Rate + Likelihood Rario (pos and neg)
The outputs are
> df # display True Positive Rateate and False Positive Rate TPR FPR 1 0.4 0.1 2 0.3 0.2 > df # display True Positive Rateate and False Positive Rate + Likelihood Rario (pos and neg) TPR FPR LRPos LRNeg 1 0.4 0.1 4.000 0.667 2 0.3 0.2 1.500 0.875
Program 3: Post-test Probability from Pre-test Probability and Likelihood Ratio
# Pgm 3: Post-test Probability from Pre-test Probability and Likelihood Ratio myDat = (" PreProb LR 0.5 1.5 0.5 0.88 0.14 1.5 0.14 0.88 ") df <- read.table(textConnection(myDat),header=TRUE) # conversion to data frame df # display pre test probability and Likelihood Ratio PostProb <- vector() # Post test probability for(i in 1:nrow(df)) { odds = df$PreProb[i] / (1.0 - df$PreProb[i]) * df$LR[i] PostProb <- append(PostProb, sprintf("%1.3f",odds / (1.0 + odds))) } df$PostProb <- PostProb df # display input data + post test probability
The outputs are
> df # display pre test probability and Likelihood Ratio PreProb LR 1 0.50 1.50 2 0.50 0.88 3 0.14 1.50 4 0.14 0.88 > df # display input data + post test probability PreProb LR PostProb 1 0.50 1.50 0.600 2 0.50 0.88 0.468 3 0.14 1.50 0.196 4 0.14 0.88 0.125

StatsToDo: Prediction and Tests

Terminology

Procedures

References