David A. Kenny
Moderator Variables: Introduction
Categorical Moderator and Causal Variables
Categorical Moderator and a
Continuous Causal Variable
Continuous Moderator and a
Categorical Causal Variable
Continuous Moderator and Causal
Variables
Moderator Variables: Introduction
Basic Definitions
A moderator
analysis is an exercise of external validity in that the question is how
universal is the causal effect.
A key part of
moderation is the measurement of X
to Y causal relationship for
different values of M. We refer to the effect of X on Y for a give value of M
as the simple effect X on Y.
Causal Assumptions
Timing of Measurement
Moderator and Causal Variable
Relationship
Measurement of Moderation
Y = d + aX + bM + cXM
+ E
The interaction of X and M or coefficient c
measures the moderation effect. Note that path a measures the simple
effect of X when M equals zero. As will be seen, the test of moderation is
not always operationalized by the product term XM, but it often is.
Alternative Interpretations of Moderator
Effects
Another worry is the actual moderator may not be the moderator but some other variable with which the moderator correlates. For instance, if we find that gender is a moderator, the real moderator might be height, masculinity-femininity, expectations of others, or income. Unless the moderator is a manipulated variable, we do not know whether it is a true moderator or just a proxy moderator.
Level of Measurement of the Variables
Categorical
Moderator and Causal Variables
When both X and M are dichotomous
(i.e., each have two levels), we have a 2 x 2 design. So for instance,
psychotherapy (therapy versus no therapy), might be more effective for women
than for men. We denote the four cells as X1M1, X1M2, X2M1,and X2M2. To estimate the above
regression equation, we need to dummy code the moderator and the causal
variable. So for instance, if we use codes of zero and one, then we have the
following interpretations of the coefficients in the above multiple regression
equation:
a – the effect of X when M is zero
b – the
effect of M when X is zero
c – how
much the effect of X changes as M goes from 0 to 1
The focus on c because it captures the moderator
effect. If c is positive, then it indicates that the effect of X on Y increases as M goes
from 0 to 1. If c is negative, then
it indicates that the effect of X on
Y decreases as M goes from 0 to 1.
If effect coding (one value of X and M
is 1 and the other value is –1) is used, the interpretation of the
coefficients is as follows:
a – the effect of X averaged across M
b – the
effect of M averaged across X
c – half
of how much the effect of X changes
as M goes from -1 to 1
Which particular coding method that is used
is largely a matter of personal preference. The important thing is to know what coding
system is used and interpret coefficients accordingly. While coding affects the
coefficients, it does not affect the inferential statistic for the test of the
interaction (but it does affect the tests of main effects), the multiple
correlation, the predicted values, and the residuals.
Regardless which coding system is used, there
are four means because the design is 2 x 2.
If effect coding were used, the means would equal where i is the intercept in the regression
equation:
Cell Coding Predicted Mean
X1M1 X = -1; M = -1
i – a – b + c
X2M1 X = 1; M = -1 i +
a – b – c
X1M2 X = -1; M = 1 i – a + b – c
X2M2 X = 1; M = 1 i + a + b + c
There might be an interest in the effect of
the causal variable or X for each of
the levels of the moderator or the simple effects of X. To estimate the simple effects, a different regression equation
is run and in each we recode the moderator so that a given level is set to zero
(Aiken & West, 1991). If we want to test the effect of X when the M = 1, the
equation is run but M is not used
but M׳ = M – 1.
If X
or M have more than two levels, then
multiple dummy variables are needed (the number of levels less one), and
moderation is tested by a set of product variables.
If there are covariates (variables that cause
Y and measured prior to Y), they can be entered into the
equation. If the covariates are themselves considered to be moderators, then
they would be allowed to interact with X.
Effect Size Measurement of Moderator
Effects and Power Analysis
Because the design
is 2 X 2, the estimate of the moderator effects can be
viewed as a difference between two means (X1M1 and X2M2 vs. X2M1 and
X1M2). Using these two means a d can be computed and a power analysis
can be undertaken.
Categorical
Moderator and Continuous Causal Variables
An example of this case, M might be race, X might be a personnel test, and Y might be some job performance score. Generally, it is assumed
that the effect of X on Y is linear. It is also assumed (but it
can be tested, see below) that the moderation is linear. That is, as M varies, the linear effect of X on Y might vary. Thus, the linear relationship increases or decreases
as M increases.
It is almost always preferable
to measure the linear effect by using a regression coefficient and not a
correlation coefficient.
More Complex Specification
Y = d + a1X + a2X2 + bM + c1XM + c2X2M
+ E
Nonlinear moderation can be
tested by determining if c2
is different from zero.
Baron and Kenny (1986, page 1175)
discuss alternative specifications of moderation. For instance, the moderator
might act as a threshold variable and there would be no effect of the causal
variable when the moderator is low, but at a certain value of the moderator the
effect emerges.
Power
Simple Effects
Continuous
Moderator and Categorical Causal Variable
One example might be that the socio-economic
status moderates the effect of some intervention. One key issue is to center
the variable of socio-economic status; i.e., make sure that zero is a
meaningful value.
We may want to determine the effect of X for various levels of the moderator, M. One idea is to determine the effect
of X for different values of M. In principal, these values would be
chosen using some sort conceptual rationale. For instance, if IQ were the
moderator, we might use 140 (genius level) and 100 (average level) to compute
the effects of X on Y.
More commonly, the values are one standard deviation above the mean of M and one standard deviation below the
mean of M.
Continuous Moderator and Causal Variables
Repeated Measures Multilevel Modeling Multilevel Modeling Mediated Moderation and Moderated
Mediation Aguinis, H. (2004). Moderated regression.
Aiken, L. S., &
West, S. G. (1991). Multiple regression:
Testing and interpreting interactions. Baron, R. M., &
Kenny, D. A. (1986). The moderator-mediator variable distinction in social
psychological research: Conceptual, strategic and statistical considerations. Journal of Personality and Social
Psychology, 51, 1173-1182. Judd, C. M., &
Kenny, D. A. (2010). Data analysis. In S. Fiske, D.
Gilbert, G. Lindsay (Eds.), The handbook of social psychology, in
press. Judd, C. M., Kenny,
D. A., & McClelland, G. H. (2001). Estimating and testing mediation and
moderation in within-participant designs. Psychological Methods, 6, 115-134. Kenny, D. A., & Judd, C. M. (1984). Estimating the nonlinear and interactive effects of latent variables. Psychological Bulletin, 96, 201-210. Kraemer, H. C., Stice, E., Kazdin, A., Offord, D., & Kupfer, D.
(2001). How do risk factors work together? Moderators, mediators, independent,
overlapping, and proxy risk factors. American
Journal of Psychiatry, 158, 848-856. Kraemer H. C.,
Wilson G. T., Fairburn C. G., & Agras W. S.
(2002). Mediators and moderators of treatment effects in randomized clinical
trials. Archives of General Psychiatry, 59, 877-883. Ping, R. A. (1996). Latent variable interaction and quadratic effect estimation: A two-step technique using structural equation analysis. Psychological Bulletin, 119, 166-175.