David A. Kenny
August 8, 2011
Recently revised.
Page viewed about 125
times a day.
Please send suggestions
for revision.
Moderator Variables: Introduction
Categorical Moderator and Causal Variables
Categorical Moderator and a Continuous
Causal Variable
Continuous Moderator and a
Categorical Causal Variable
Continuous Moderator and Causal
Variable
My Mediation page
DataToText
macro ModText
that performs a moderation analysis and writes text to describe the results
Andrew
Hayes macro ModProbe
Moderator Variables: Introduction
Basic Definitions
A moderation
analysis is an exercise of external validity in that the question is how
universal is the causal effect.
A key part of
moderation is the measurement of X
to Y causal relationship for
different values of M. We refer to the effect of X on Y for a given value of M
as the simple effect X on Y.
Deciding which
variable is the moderator depends in large part on the researcher's interest. For the earlier example in which gender
moderates the effect of psychotherapy, if one was a gender researcher, one
might say that psychotherapy moderates the effect of gender.
Causal Assumptions
Timing of Measurement
Moderator and Causal Variable
Relationship
Measurement of Moderation
Y = i + aX
+ bM + cXM + E
The interaction of X and M or coefficient c
measures the moderation effect. Note that path a measures the simple
effect of X, sometimes
called the main effect of X, when M equals zero. As will be seen, the test of moderation is
not always operationalized by the product term XM.
Alternative Interpretations of Moderator
Effects
Another worry is the actual moderator may not be the moderator but some other variable with which the moderator correlates. For instance, if we find that gender is a moderator, the real moderator might be height, masculinity-femininity, expectations of others, or income. Unless the moderator is a manipulated variable, we do not know whether it is a true moderator or just a proxy moderator.
Level of Measurement of the Variables
The remainder of
the page is organized around the levels of measurement of the moderator and the
causal variable. The causal variable, X,
can either be categorical (typically a dichotomy) or a continuous variable. So
for instance, X might be
psychotherapy versus no psychotherapy (a dichotomy) or it might be the amount
of psychotherapy (none, one month, two months, or six months; a continuous
variable). Much in the same way, the moderator or M can be either categorical (e.g., gender) or continuous (e.g.,
age).
Categorical Moderator and Causal Variables
When both X and M are dichotomous
(i.e., each have two levels), we have a 2 x 2 design. So for instance,
psychotherapy (therapy versus no therapy), might be more effective for women
than for men. We denote the four cells as X1M1, X1M2, X2M1,and X2M2. To estimate the above
regression equation, we need to dummy code the moderator and the causal
variable. So for instance, if we use codes of zero and one, then we have the
following interpretations of the coefficients in the above multiple regression
equation (Equation 1):
a the effect of X when M is zero
b the effect of M when X is zero
c how much the effect of X changes as M goes from 0 to 1
The focus on c because it captures the moderator effect. If c is positive, then it indicates that the effect of X on Y increases as M goes
from 0 to 1. If c is negative, then
it indicates that the effect of X on
Y decreases as M goes from 0 to 1.
If effect coding (one value of X and M
is 1 and the other value is 1) is used, the interpretation of the coefficients
is as follows:
a the effect of X averaged across M
b the effect of M averaged across X
c half of how much the effect of X changes as M goes from -1 to 1
Which particular coding method that is used
is largely a matter of personal preference.
The important thing is to know what coding system is used and interpret
coefficients accordingly. Although
coding affects the coefficients, it does not affect the inferential statistic
for the test of the interaction (but it does affect the tests of main effects),
the multiple correlation, the predicted values, and the residuals.
Regardless which coding system is used, there
are four means because the design is 2 x 2.
If effect coding were used, the means would equal (where i is the intercept in the regression
equation):
Cell Coding Predicted Mean
X1M1 X = -1; M = -1
i a b + c
X2M1 X = 1; M = -1 i + a b c
There might be an interest in the effect of
the causal variable or X for each of
the levels of the moderator or the simple effects of X. To estimate the simple effects, a different regression equation
is run and in each we recode the moderator so that a given level is set to zero
(Aiken & West, 1991). If we want to test the effect of X when the M = 1, the
equation is run but M is not used
but M׳ = M 1.
If X
or M have more than two levels, then
multiple dummy variables are needed (the number of levels less one), and
moderation is tested by a set of product variables.
If there are covariates (variables that cause
Y and measured prior to Y), they can be entered into the
equation. If the covariates are themselves considered to be moderators, then
they would be allowed to interact with X.
Effect Size Measurement of Moderator
Effects and Power Analysis
Categorical Moderator and Continuous Causal Variable
An example of this case, M is race, X is a personnel test, and Y
is some job performance score. Generally, it is assumed that the effect of X on Y is linear. It is also assumed (but it can be tested, see below)
that the moderation is linear. That is, as M
varies, the linear effect of X on Y might vary. Thus, the linear relationship
increases or decreases as M
increases.
It is almost always preferable
to measure the linear effect by using a regression coefficient and not a
correlation coefficient.
More Complex Specification
Y = d + a1X + b1M + b2M2 + c1XM
+ c2XM2 + E
Nonlinear moderation can be
tested by determining if c2
is different from zero. (Note that M2 effects can only be
estimated if M takes on at least 3
values.) The effect of X in Equation 2
is a1 + (c1 + c2M)M
which would be interpreted as follows:
If c1 were positive and c2 positive, then the effect
of X on Y would be increasing as M
increases, and this increase is increasing as M increases, accelerating.
If c1 were positive and c2 negative, then the effect
of X on Y would be increasing as M
increases, but this increase is declining as M increases,
de-accelerating.
If c1 were negative and c2 positive, then the effect
of X on Y would be decreasing as M
increases, but this decrease is declining as M increases, de-accelerating.
If c1 were negative and c2 negative, then the effect
of X on Y would be decreasing as M
increases, but this decrease is increasing as M increases, accelerating.
Baron and Kenny (1986, page
1175) discuss alternative specifications of moderation. For instance, the
moderator might act as a threshold variable and there would be no effect of the
causal variable when the moderator is low, but at a certain value of the
moderator the effect emerges.
Effect Size and Power
If f2
is known, one can conduct a power analysis using a power analysis
program. For instance, if f2 is assumed to be 0.025, one needs a sample size of 316 to have 80 percent power.
Power for tests of moderation is very low when
one or both of the variables are continuous (McClelland & Judd, 1993). Likely, the much greater interest in
mediation over moderation is due to the low power in tests of moderation.
Simple Effects
The second method is to re-estimate separate
regression equation but transform M
by subtracting 2 or M' = M 2. For this new equation, the effect of X refers to the case in which M is
2. This second method should result in
the same answer as the first.
The third method requires that M take on a few values. Separate regression equations would be
estimated for each value of M. This method does not assume homogeneity of
error variances and so it would likely produce estimates different from the
previous two.
Continuous Moderator and Categorical Causal Variable
An example is that the socio-economic status
moderates the effect of some intervention. One key issue is to center the
variable of socio-economic status; i.e., make sure that zero is a meaningful
value for the moderator.
We may want to determine the effect of X for various levels of the moderator, M, i.e., simple effects. In principal,
the values of M would be chosen
using some sort conceptual rationale. For instance, if IQ were the moderator,
we might use 140 (genius level) and 100 (average level) to compute the effects
of X on Y. More commonly, the values
are one standard deviation above the mean of M and one standard deviation below the mean of M.
Continuous Moderator and Causal Variable
If a product term is used, one must assume that both X and M are measured without error, an often dubious assumption. Latent variables are discussed below.
Centering of both X and M is
necessary if neither have zero as a meaningful value. To interpret the
results and determine simple effects, the effect of X at various levels of M would be measured. Ideally,
the levels of M would be theoretically motivated. If not possible,
one might use M at the mean and at plus and minus one standard deviation
from the mean.
Power of tests of moderation with two
continuous variables is particularly low (McClelland & Judd, 1993).
Latent Variables

Multilevel Modeling
If X is measured at level 1, one can determine a generic moderator, that is, measure the extent to which there is
variation in the X-Y relationship. Evidence of generic moderation would
be obtained if there was variation in the X-Y slopes.
Meta-analysis
Mediated Moderation and Moderated
Mediation
Papers
by Muller, Judd, and Yzerbyt (2005) and Edwards and Lambert (2007) discuss the
relationship between mediated moderation and moderated mediation. They also
present examples of each. Also Preacher, Rucker, and Hayes have developed
a macro for estimating moderated mediation (click
here).
Mixture Modeling
Aiken, L. S., & West, S. G. (1991). Multiple regression: Testing and interpreting
interactions.
Baron,
R. M., & Kenny, D. A. (1986). The moderator-mediator variable distinction
in social psychological research: Conceptual, strategic and statistical
considerations. Journal of Personality
and Social Psychology, 51, 1173-1182.
Cohen, J. (1988). Statistical power analysis for the behavioral sciences.
Edwards, J. R., & Lambert L. S. (2007). Methods for integrating
moderation and mediation: A general analytical framework using moderated path
analysis. Psychological Methods, 12, 1-22.
Frazier,
P. A., Tix, A. P. & Barron, K. E. (2004). Testing moderator and mediator
effects in counseling psychology research. Journal of Counseling Psychology,
51, 115-134.
Hamaker, E. L., Grasman, R. P. P. P., & Kamphuis, J.
H. (2010). Regime-switching models to study psychological processes. In P. C.
M. Molenaar & K. M. Newell (Eds.), Individual pathways of change:
Statistical models for analyzing learning and development, 155-168.
Hayes, A. F., & Matthes, J. (2009). Computational procedures
for probing interactions in OLS and logistic regression: SPSS and SAS
implementations. Behavior Research Methods, 41, 924-936.
Judd, C. M., & Kenny, D. A. (2010). Data analysis. In D. Gilbert, S. T. Fiske, G. Lindzey (Eds.), The handbook of social psychology (5th ed.,
Vol. 1, pp. 115-139), New York: Wiley.
Judd,
C. M., Kenny, D. A., & McClelland, G. H. (2001). Estimating and testing
mediation and moderation in within-participant designs. Psychological Methods, 6, 115-134.
Kenny,
D. A., & Judd, C. M. (1984). Estimating the nonlinear and interactive
effects of latent variables. Psychological Bulletin, 96, 201-210.
Klein,
A. G., & Moosbrugger, H. (2000). Maximum likelihood estimation of latent
interaction effects with the LMS method. Psychometrika, 65, 457-474.
Kraemer, H. C., Stice, E., Kazdin, A., Offord, D., &
Kupfer, D. (2001). How do risk factors work together? Moderators, mediators,
independent, overlapping, and proxy risk factors. American Journal of Psychiatry, 158, 848-856.
Kraemer
H. C., Wilson G. T., Fairburn C. G., & Agras W. S. (2002). Mediators and
moderators of treatment effects in randomized clinical trials. Archives of
General Psychiatry, 59, 877-883.
Marsh, H. W., Hau, K. T., Wen, Z., Nagengast,
B., & Morin, A. J. S. (2011). Moderation. In Little, T. D. (Ed.), Oxford handbook of quantitative methods.
New York: Oxford University Press.
Marsh, H. W., Wen, Z. L.,
& Hau, K. T. (2004). Structural equation models of latent
interactions: Evaluation of alternative estimation strategies and indicator
construction. Psychological Methods,
9, 275-300.
McClelland, G. H., & Judd, C. M. (1993).
Statistical difficulties of detecting interactions and moderator effects. Psychological
Bulletin, 114, 376-390.
Muller,
D., Judd, C. M., & Yzerbyt, V. Y. (2005). When moderation is mediated and
mediation is moderated. Journal of Personality and Social Psychology, 89, 852-863.