Someone asked me today how to compute an adjusted odds ratio. He has a case control study where cases represent cancer patients. He also has various Single Nucleotide Polymorphisms (SNPs). These would be coded as 0-1 depending on whether the SNP was present or absent. He also has demographic information, such as age, sex, smoking status, and so forth.

To compute an adjusted odds ratio, simply fit a logistic regression model. The outcome variable is the 0-1 variable which represents case or control status. The independent variables include a particular SNP variable, as well as all the demographic variable. The odds ratio that you get for the SNP variable shows the effect of that SNP on cancer status, after adjusting for all the demographic variables.

Make sure you have enough data before you do these adjustments. The rule of thumb is that you need 10 to 15 observations per outcome (here, either the number of cases or the number of controls, whichever is smaller). So if you have 60 cases and 120 controls, then you can fit a logistic model with about 4 to 6 independent variables.

For further details and a simple example, check out

You can find an earlier version of this page on my original website.