Disease Screening

Disease Screening

An Application of Conditional Probability

	D	D*	Total
T	Disease Present, Screens +	Disease Absent, Screens +	Screens +
T*	Disease Present, Screens -	Disease Absent, Screens -	Screens -
Total	Disease Present	Disease Absent	Total

There are four possible combinations of Disease and Test:

DandT ~ Disease Present and Screen Shows Positive
DandT* ~ Disease Present and Screen Shows Negative

D*andT ~ Disease Absent and Screen Shows Positive
D*andT* ~ Disease Absent and Screen Shows Negative

Pr{D|T} ~ Probability that a subject has the disease, given a positive test
Pr{D*|T} ~ Probability that a subject lacks the disease, given a positive test

Pr{D*|T*} ~ Probability that a subject lacks the disease, given a negative test
Pr{D|T*} ~ Probability that a subject has the disease, given a negative test

What is Pr{D|T}, the probability that disease is present given a positive screen?

Pr{D|T} =
Pr{DandT}/Pr{T} =
Pr{T|D}*Pr{D}/Pr{T} =
Pr{T|D}*Pr{D}/(Pr{TandD} + Pr{TandD*}) =
Pr{T|D}*Pr{D}/(Pr{T|D}*Pr{D} + Pr{T|D*}*Pr{D*})

That is,

Pr{D|T} = ( Pr{T|D}*Pr{D} ) / ( Pr{T|D}*Pr{D} + Pr{T|D*}*Pr{D*} )

So we need the prevalence of disease (Pr{D}), Pr{T|D} = 1 - Pr{T*|D}, where Pr{T*|D} is the false-negative rate, and Pr{T|D*}, the false positive rate.

Suppose that we are given that Pr{T|D*} = "False Positive" = .05, Pr{T*|D} = "False Negative" = .01 and Pr{D} = .001. Then Pr{T|D*} = "False Positive" = .05, so then Pr{T*|D*} = .95 and Pr{T*|D} = "False Negative" = .01, so then Pr{T|D} = .99

Plugging in what we know:

Pr{D|T} =(.99)*(.001)/(.99*(.001) + .05*(.999)) = .00099/(.00099+.04995) = .019 (<2%). So in this case, approximately 2% of the positive tests actually indicate disease, which leaves the other 98% with a false finding.

Let's repeat the calculation, but with Pr{T|D*} = "False Positive" = .005, Pr{T*|D} = "False Negative" = .005 and Pr{D} = .001. Then Pr{T|D*} = "False Positive" = .05, so then Pr{T*|D*} = 1 - Pr{T|D*} = 1 - .005 = .995 and then Pr{T|D} = 1 - Pr{T*|D} = 1 - .005 = .995.

Pr{D|T} =(.995)*(.001)/(.995*(.001) + .005*(.999)) = .00099/(.00099+.04995) = .16611 (16%). So in this case, approximately 16% of the positive tests actually indicate disease, which leaves the other 84% with a false finding.

What is Pr{D*|T*}, the probability that disease is absent given a negative screen?

Pr{D*|T*} =
Pr{D*andT*}/Pr{T*} =
Pr{T*|D*}*Pr{D*}/Pr{T*} =
Pr{T*|D*}*Pr{D*}/(Pr{T*andD*} + Pr{T*andD)) =
Pr{T*|D*}*Pr{D*}/(Pr{T*|D*}*Pr{D*} + Pr{T*|D}*Pr{D})

That is,

Pr{D*|T*} = Pr{T*|D*}*Pr{D*}/(Pr{T*|D*}*Pr{D*} + Pr{T*|D}*Pr{D})

We need the prevalence of disease (Pr{D}), Pr{T*|D*} = 1 - Pr{T|D*}, where Pr{T|D*} is the false-positive rate, and Pr{T*|D}, the false negative rate. Recall that Pr{D*) = 1 - Pr{D}.

Plugging in what we know:

Pr{D*|T*} = (.95)*(.999)/( (.95)*(.999) + (.01)*(.001)) = .9999+. So in this case, approximately 99.99% of the negative tests actually indicate absence of disease.

Pr{D*|T*} = .995*.999/(.995*.999 + .005*.001) = .9999+. So in this case, approximately 99.99% of the negative tests actually indicate absence of disease.

What this application suggests is that the likely problem with a diagnostic test is the proper interpretation of positive tests.