[StATS]: A simple example of a mixed linear regression model (October 18, 2006).
I want to illustrate how to run a simple mixed linear regression model in SPSS. I will use some data on the plasma protein levels of turtles at baseline, after fasting 10 days, and after fasting 20 days. The data is available at
If you import the data directly, it appears like this:
The above format is a “short and wide” format with multiple
measurements as additional columns and only one row per turtle. If you
want to change this to a “tall and thin” format with all the plasma
protein measurements in a single column and three rows per turtle, you
need to use the Restructure Data Wizard
. Select Data | Restructure
from the menu.
The first option transforms “short and wide” data sets to “tall and thin” data sets. The second option does the reverse. The third option performs a simple transposition (rows become columns and columns become rows).
If we had multiple repeated measures (e.g., plasma protein levels and
urine specific gravity), then we would let SPSS know that the repeated
measures should represent two or more distinct columns in the “tall
and thin” data set. Instead, our goal is to place the Fed
,
Fasted10
, and Fasted20
variables all in a single column, so we
tell SPSS that we only want to restructure one variable.
There will be three rows in the new data set for every row in the
original data set, so we need to identify a variable to tell us which
rows are associated with the first subject, which with the second, and
so forth. The three variables that will consolidate into a single
column in the “tall and thin” data set are listed in the
VARIABLES TO BE TRANSPOSED
field, and we will assign the name
PlasmaProtein
to this variable. We also have to warn SPSS that Sex
is a fixed (non-repeating variable).
The index variable assigns a value of 1 to a row if the original
variable was Fed
, 2 if it was Fasted10
, and 3 if it was
Fasted20
.
There are some additional options here, but unless you are working on
a complex problem, it’s generally best to click on the FINISH
button as soon as it appears. Here is what the data looks like.
I created a Time
variable with values 0, 10, and 20 instead of the
TimeIndex
which has values 1, 2, and 3. Then I plotted the
data.
The green lines represent male turtles and the red lines represent female turtles. Notice that all eight turtles show a more or less consistent decline and that the female turtles tend to have low values at the start and the end compared to male turtles. A mixed linear regression model can quantify the size of the decline and the relative separation between males and females.
Select Analyze | Mixed Models | Linear
from the menu.
The first dialog box (above) allows you to designate a variable which indicates which rows in the data set belong to the same subject.
This dialog box appears next and asks you specify a dependent (outcome variable) as well as factors and covariates. Use factors for categorical predictor variables and covariates for continuous predictor variables.
If you have more than one factor/covariate, you should click on the
Fixed
button and inform SPSS what types of interactions, if any, you
wish to examine. In this model, the Sex*Time
interaction tests
whether females have a steeper or flatter slope than the males.
You also need to specify the random effects portion of the model. If
you do nothing else, be sure to include Subject
in the
Combinations
field at the bottom of this dialog box and check the
Include Intercept option. If you do nothing in addition, then you are
fitting a random intercepts model. If you include a term for Time
,
then you are fitting a random slopes model.
Finally, be sure that the Parameter estimates
option under the
Statistics
button is checked.
To run a mixed linear regression model select
Analyze | Mixed Models | Linear
from the menu.
This table shows the degrees of freedom associated with various predictor variables.
The various Information Criteria variables allow you to compare different models (e.g., a random intercepts and a random slopes model).
The above table shows tests for all of the predictor variables and interactions. Note that the Sex*Time interaction is not statistically significant.
The Estimates of Fixed Effects table shows the quantitative estimates and their confidence intervals. The intercept represents the estimated baseline plasma protein in the reference category (male). The estimate for Sex, -4.1, tells you that the estimated baseline plasma protein is lower in females, but the effect is borderline, at best, because the 95% confidence interval includes the value of zero. The effect of time is large–each day of fasting is associated with a 0.17 unit decline in plasma protein. Even after allowing for sampling error, we would still estimate this slope to represent a decline of at least 0.11 units per day. The interaction term is small showing that the difference in the decline between male and female turtles is of little practical consequence.
The covariance parameters provide you with an estimate of between and within subject variation. These numbers are very useful for planning future research studies.
This page was written by Steve Simon while working at Children’s Mercy Hospital. Although I do not hold the copyright for this material, I am reproducing it here as a service, as it is no longer available on the Children’s Mercy Hospital website. Need more information? I have a page with general help resources. You can also browse for pages similar to this one at Category: Mixed linear model.
model](../category/MixedModels.html). for pages similar to this one at [Category: Mixed linear with general help resources. You can also browse Children’s Mercy Hospital website. Need more information? I have a page reproducing it here as a service, as it is no longer available on the Hospital. Although I do not hold the copyright for this material, I am This page was written by Steve Simon while working at Children’s Mercy