odds_ratio

(usually abbreviated “OR”) is one of three main ways to quantify how strongly the presence or absence of property A is associated with the presence or absence of property B in a given population. If each individual in a population either does or does not have a property “A”, (e.g. “high blood pressure”), and also either does or does not have a property “B” (e.g. “moderate alcohol consumption”) where both properties are appropriately defined, then a ratio can be formed which quantitatively describes the association between the presence/absence of “A” (high blood pressure) and the presence/absence of “B” (moderate alcohol consumption) for individuals in the population. This ratio is the odds ratio (OR) and can be computed following these steps:

For a given individual that has “B” compute the odds that the same individual has “A” For a given individual that does not have “B” compute the odds that the same individual has “A” Divide the odds from step 1 by the odds from step 2 to obtain the odds ratio (OR). The term “individual” in this usage does not have to refer to a human being, as a statistical population can measure any set of entities, whether living or inanimate.

If the OR is greater than 1, then having “A” is considered to be “associated” with having “B” in the sense that the having of “B” raises (relative to not-having “B”) the odds of having “A”. Note that this is not enough to establish that B is a contributing cause of “A”: it could be that the association is due to a third property, “C”, which is a contributing cause of both “A” and “B”.

The two other major ways of quantifying association are the risk ratio (“RR”) and the absolute risk reduction (“ARR”). In clinical studies and many other settings, the parameter of greatest interest is often actually the RR, which is determined in a way that is similar to the one just described for the OR, except using probabilities instead of odds. Frequently, however, the available data only allows the computation of the OR; notably, this is so in the case of case-control studies, as explained below. On the other hand, if one of the properties (say, A) is sufficiently rare (the “rare disease assumption”), then the OR of having A given that the individual has B is a good approximation to the corresponding RR (the specification “A given B” is needed because, while the OR treats the two properties symmetrically, the RR and other measures do not).

In a more technical language, the OR is a measure of effect size, describing the strength of association or non-independence between two binary data values. It is used as a descriptive statistic, and plays an important role in logistic regression.

odds_ratio.txt · Last modified: 2014/12/12 10:47 (external edit)