FALSE POSITIVES

Name:

FALSE POSITIVES (LET) Type:

Let Subcommand Purpose:

Compute the proportion of false positives between two binary variables. Description:

	Variable 2
Variable 1	Success	Failure	Row Total

Success	N₁₁	N₁₂	N₁₁ + N₁₂
Failure	N₂₁	N₂₂	N₂₁ + N₂₂

Column Total	N₁₁ + N₂₁	N₁₂ + N₂₂	N

The parameters N₁₁, N₁₂, N₂₁, and N₂₂ denote the counts for each category.

Success and failure can denote any binary response. Dataplot expects "success" to be coded as "1" and "failure" to be coded as "0". Some typical examples would be:

Variable 1 denotes whether or not a patient has a disease (1 denotes disease is present, 0 denotes disease not present). Variable 2 denotes the result of a test to detect the disease (1 denotes a positive result and 0 denotes a negative result).
Variable 1 denotes whether an object is present or not (1 denotes present, 0 denotes absent). Variable 2 denotes a detection device (1 denotes object detected and 0 denotes object not detected).

In these examples, the "ground truth" is typically given as variable 1 while some estimator of the ground truth is given as variable 2.

The proportion of false positives is then N₂₁/N (i.e., the number of cases where variable 1 is a failure and variable 2 is a success). In the context of the first examples above, the test detected the disease when it was in fact not present.

Syntax:

Examples:

Note:

The two variables must have the same number of elements. Note:

Raw data - in this case, the variables contain 0's and 1's.
If the data is not coded as 0's and 1's, Dataplot will check for the number of distinct values. If there are two distinct values, the minimum value is converted to 0's and the maximum value is converted to 1's. If there is a single distinct value, it is converted to 0's if it is less than 0.5 and to 1's if it is greater than or equal to 0.5. If there are more than two distinct values, an error is returned.
Summary data - if there are two observations, the data is assummed to be the 2x2 summary table. That is,

Note:

In this case, if variable one denotes "ground truth" and variable two denotes the estimate of ground truth, then we define:

A correct value is the case where the estimated category is the same as the ground truth category.
For this case, we do not distinguish between "true positives" and "true negatives" as we do for the 2x2 case.
A false positive is the case where the estimated category is too large.
A false negative is the case where the estimated category is too small.

Note:

HELP STATISTICS

Default:

None Synonyms:

None Related Commands:

TRUE POSITIVES	= Compute the proportion of true positives.
TRUE NEGATIVES	= Compute the proportion of true negatives.
FALSE NEGATIVES	= Compute the proportion of false negatives.
POSITIVE PREDICTIVE VALUE	= Compute the positive predictive value.
NEGATIVE PREDICTIVE VALUE	= Compute the negative predictive value.
TEST SENSITIVITY	= Compute the test sensitivity.
TEST SPECIFICITY	= Compute the test specificity.
ODDS RATIO	= Compute the bias corrected log(odds ratio).
ODDS RATIO STANDARD ERROR	= Compute the standard error of the bias corrected log(odds ratio).
RELATIVE RISK	= Compute the relative risk.
STATISTICS PLOT	= Generate a statistic versus subset plot.
BOOTSTRAP PLOT	= Generate a bootstrap plot.
TABULATE	= Perform a tabulation for a specified statistic.

Reference:

Statistical Methods for Rates and Proportions

Applications:

Categorical Data Analysis Implementation Date:

2007/04 Program:

 
let n = 1
.
let p = 0.2
let y1 = binomial rand numb for i = 1 1 100
let p = 0.1
let y2 = binomial rand numb for i = 1 1 100
.
let p = 0.4
let y1 = binomial rand numb for i = 101 1 200
let p = 0.08
let y2 = binomial rand numb for i = 101 1 200
.
let p = 0.15
let y1 = binomial rand numb for i = 201 1 300
let p = 0.18
let y2 = binomial rand numb for i = 201 1 300
.
let p = 0.6
let y1 = binomial rand numb for i = 301 1 400
let p = 0.45
let y2 = binomial rand numb for i = 301 1 400
.
let p = 0.3
let y1 = binomial rand numb for i = 401 1 500
let p = 0.1
let y2 = binomial rand numb for i = 401 1 500
.
let x = sequence 1 100 1 5
.
let a = false positives y1 y2 subset x = 1
tabulate false positives y1 y2 x
.
label case asis
xlimits 1 5
major xtic mark number 5
minor xtic mark number 0
xtic mark offset 0.5 0.5
ytic mark offset 0.05 0.05
y1label Proportion of False Positives
x1label Group ID
character x blank
line blank solid
.
false positives plot y1 y2 x