Z-factor

From Wikipedia, the free encyclopedia

This article or section is in need of attention from an expert on the subject.

WikiProject Statistics may be able to help recruit one.

If a more appropriate WikiProject or portal exists, please adjust this template accordingly.

In statistics, the Z-factor is a measure of the quality or power of a high-throughput screening (HTS) assay. It is not the same as the z-score.^[1]

In an HTS campaign, assayists often compare a large number (hundreds of thousands to tens of millions) of single measurements of unknown samples to well established positive and negative control samples. The purpose is to determine which, if any, of the single measurements are significantly different from the negative control. The analyst must consider the distribution of measurements from the positive control, negative control, and the other single measurements in order to determine the probability that each measurement may have occurred by chance. These distributions cannot been determined until after the campaign is completed, and by their nature, HTS projects are expensive in time and resources. So prior to starting a campaign, much work is done to assess the quality of an assay on a smaller scale, and predict if the assay would be useful in a high-throughput setting. The Z-factor predicts if useful data could be expected if the assay were scaled up to millions of samples.^[2]

You need four parameters to calculate the Z-factor: the mean ( $μ$ ) and standard deviation ( $σ$ ) of both the positive (p) and negative (n) controls ( $μ p$ , $σ p$ , $μ n$ , $σ n$ , respectively). Given these, Zhang and colleagues define Z-factor as follows:

$Zfactor = 1 - {3 \times (\sigma_p + \sigma_n) \over | \mu_p - \mu_n |}$

An alternative but equivalent definition of Z-factor is calculated from the Sum of Standard Deviations (SSD) divided by the range of the assay (R):

$S S D = σ p + σ n$
$R = | μ p - μ n |$
$Zfactor = 1 - 3 \times {SSD \over R}$

The following interpretations for the Z-factor were taken from Zhang, et. al. 1999:

Z-factor	Interpretation
1.0	Ideal. This is approached when you have a huge dynamic range with tiny standard deviations. Z-factors can never actually equal 1.0 and can certainly never be greater than 1.0.
between 0.5 and 1.0	An excellent assay.
between 0 and 0.5	A marginal assay.
less than 0	The signal from the positive and negative controls overlap, making the assay essentially useless for screening purposes.

[edit] See also

[edit] Notes

^ "Note that the term 'z' as used here has absolutely nothing to do with the use of z to describe how far a value is from the mean as the z ratio, which is the number of standard deviations away from the mean. All statistics books use z in this context, which has nothing to do with the Z-factor used to assess a screening assay." (Zhang, 1999)
^ Point-of-view: The constant in the equation, 3, is set for the limits of the two reference controls to 3 standard deviations. It may need to be changed if you plan on collecting replicate measurements from your unknown samples, instead of single measurements.

[edit] References

Zhang JH, Chung TD, Oldenburg KR, "A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays." J Biomol Screen. 1999;4(2):67-73.