One upcoming regression method in Colossus is matched case-control
logistic regression. The implementation is ongoing, but the theory is
presented in the following section.
General Theory
Suppose we have matched case-control data and divide our data into
each matched set. Each set has
cases and
records. We denote the relative risk for individual
in the set by
.
We can calculate the probability of case exposures conditional on all
exposures in the set by taking the ratio of the product of relative
risks in the cases to the sum of the product of relative risks for every
way of selecting
individuals from the
at risk.
Using the methods presented in Gail et al. (1981) we can calculate
the combination of all
ways to select
items with a more manageable recursive formula
.
We can then directly solve for the first and second derivatives and
their recursive formula.
Finally these expressions for
can be substituted into the equations for the contribution of
Log-Likelihood and it’s derivatives from each matched set. The model is
then optimized via the same methods as the other regression models.