David A. Kenny
July 28, 2004

 

THIS PAGE NEEDS TO BE REVISED AND SHOULD NOT BE CONSIDERED YET AS FULLY COMPLETED. SUGGESTIONS WOULD BE APPRECIATED. THE PAGE PRESUMES SOME UNDERSTANDING OF MULTIPLE REGRESSION.

 

Moderator Variables: Introduction
Categorical Moderator and Causal Variables
Categorical Moderator and a Continuous Causal Variable
Continuous Moderator and a Categorical Causal Variable
Continuous Moderator and Causal Variables

Other Issues
Bibliography

 

Moderator Variables: Introduction

Considered is the case in which a variable M is presumed to change the X to Y causal relationship. So for instance, a certain form of psychotherapy may reduce depression more for men than for women, and so we would say that gender (M) moderates the causal effect of psychotherapy (X) on depression (Y). Although classically, moderation implies a weakening of a causal effect, a moderator can amplify or even reverse that effect. Complete moderation would occur in the case in which the causal effect of X on Y would go to zero when M took on a particular value. The reader might consult Kraemer et al. (2001; 2002) for a related but somewhat different approach to defining and testing of moderators.

Causal Assumptions

A key question is whether X a randomized variable. Much of what follows is based on this presumption. Uncertainties arise when X is not randomized. If X is not manipulated, then the direction of causation must be assumed. It is possible that the moderator effect can reverse if the direction of causation is flipped (presuming that Y causes X instead of vice versa).

Timing of Measurement

Ideally the moderator should be measured prior to variable X being measured. So if X is manipulated, then M should be measured prior to X being manipulated. Of course, if M is a variable that does not change (e.g., race), the timing of its measurement is less problematic.

Level of Measurement of the Variables

This page is largely organized around the levels of measurement of the moderator and the causal variable. The causal variable, X, can either be categorical (typically a dichotomy) or a continuous variable. So for instance, it might be psychotherapy versus no psychotherapy (a dichotomy) or it might be the amount of psychotherapy (none, one month, two months, or six months; a continuous variable). Much in the same way, the moderator or M can be either categorical (e.g., gender) or continuous (e.g., age).

Moderator and Causal Variable Relationship

If X is a manipulated variable, in principal, there should be no relationship between X and M. The two variable are said to be independent. If X is not randomized, it might be correlated with M. Unlike mediation, there is no need for X and M to be correlated and then correlation has no special interpretation.

Statistical Measurement of Moderation

Generally, moderator effects are indicated by the interaction of X and M in explaining Y. So the following regression equation is estimated:

Y = d + aX + bM + cXM + E

The interaction of X and M measures the moderation effect. As will be seen, the test of moderation is not always operationalized by the product term XM, but it often is.

Categorical Moderator and Causal Variables

This is the prototypical case. When both variables are dichotomous, we have a 2 x 2 design. So for instance, psychotherapy (therapy versus no therapy), might be more effective for women than for men. To estimate the above regression equation, we need to dummy code the moderator and the causal variable. So for instance, if we use codes of zero and one, then we have the following interpretations of the coefficients in the above multiple regression equation:

a – the effect of X when M is zero

b – the effect of M when X is zero

c – how the effect of X varies as M varies

I focus on c because it captures the moderator effect. If c is positive, then it indicates that the effect of X on Y increases as M goes from 0 to 1. If c is negative, then it indicates that the effect of X on Y decreases as M goes from 0 to 1.

If effect coding (one value of X and M is 1 and the other value is –1), the interpretation of the coefficients is as follows:

a – the effect of X averaged across M

b – the effect of M averaged across X

c – half of how much the effect of X changes as M changes

While coding affects the coefficients, it does not affect the inferential statistic for the test of the interaction, the multiple correlation, the predicted values, and the residuals.

It is generally not advised to trim out of the multiple regression equation main effects if the interaction is present in the equation.

There might be an interest in the effect of the causal variable or X for each of the levels of the moderator, something called the simple effects of X. To estimate the simple effects, a different regression equation is run and in each we recode the moderator so that a given level is set to zero. So, if we want to test the effect of X when the moderator or M = 1, the equation is run but M is not used but M׳ = M – 1.

If X or M have more than two levels, then multiple dummy variables are needed (the number of levels less one), and moderation is tested by a set of product variables.

If there are covariates (variables that cause Y and measured prior to Y), they can be entered into the equation and of course they themselves can be moderators.

Because the key results in this case are a set of means (or adjusted means), one can test whether there are differences between them using post hoc tests.

Categorical Moderator and Continuous Causal Variables

An example of this case, M might be race, X might be a personnel test, and Y might be some job performance score. Generally, it is assumed that the effect of X on Y is linear. It is also assumed (but it can be tested, see below) that the moderation is linear. That is, as M varies, the linear effect of X on Y might vary. Thus, the linear relationship increases or decreases as M increases.

It is almost always preferable to measure the linear effect by using a regression coefficient and not a correlation coefficient.

More Complex Specification

The typical way to estimate nonlinear moderation would be to estimate the following equation:

Y = d + a1X + a2X2 + bM + c1XM + c2X2M + E

Nonlinear mediation can be tested by determining if c2 is different from zero.

Baron and Kenny (1986, page 1175) discuss alternative specifications of moderation. For instance, the moderator might act as a threshold variable and there would be no effect of the causal variable when the moderator is low, but at a certain value of the moderator the effect emerges.

Power

Aguinas (2004) has shown that the power of this test can be very low, typically below 50%. One needs very large sample sizes over 200 to have reasonable power to detetect moderator effects when one of the variables is continuous.

Simple Effects

There are two ways to determine simple effects. The first and relatively simpler way is to estimate the simple effects within the regression equation. The second way is to estimate separate regression equations for each level of the moderator. The latter strategy is preferable if there are differences in error variance for the different levels of the moderator.

Continuous Moderator and Categorical Causal Variable

One example might be that the socio-economic status moderates the effect of some intervention. One key issue is to center the variable of socio-economic status; i.e., make sure that zero is a meaningful value.

We may want to determine the effect of X for various levels of the moderator, M. One idea is to determine the effect of X for different values of M. Ideally, these values would be chosen logically. Alternatively, you might pick values at the mean of M, one standard deviation above the mean of M, and one standard deviation below the mean of M.

Continuous Moderator and Causal Variables

This is the most complicated case. One key issue has to do with the type of moderation that is assumed.

One must assume that both X and M are measured without error, an often dubious assumption. Aiken and West (1991) describe methods that do allow researchers to have measurement error but such procedures are rather complicated.

Centering of both X and M is necessary if neither have zero as a meaningful value. We would then measure the effect of X at various levels of M. Ideally, the levels of M would be logically chosen. If not possible, one might use M at the mean and at plus and minus one standard deviation from the mean.

Other Issues

Two additional issues that are discussed here are repeated measures and multilevel modeling.

Repeated Measures

All of the above discussion presumes that the design is between participants. In some cases, the design is repeated measures. Judd, Kenny, and McClelland (2001) describe moderator analyses in this case. In essence moderation is indicating by computing a difference score and seeing if the moderator predicts that difference.

Multilevel Modeling

In multilevel modeling, there are two levels. For instance there might be students in classrooms. Sometimes, there are level 1 moderators; these being moderators that vary within the classroom. More typically there are level 2 moderators; these being moderators that vary between classrooms. One can also determine a generic moderator, that is, measure the extent to which there is variation in the X-Y relationship. .

Bibliography

Aguinis, H. (2004). Moderated regression. New York: Guilford.

Aiken, L. S., & West, S. G. (1991). Multiple regression: Testing and interpreting interactions. Newbury Park, CA: Sage.

Baron, R. M., & Kenny, D. A. (1986). The moderator-mediator variable distinction in social psychological research: Conceptual, strategic and statistical considerations. Journal of Personality and Social Psychology, 51, 1173-1182.

Judd, C. M., Kenny, D. A., & McClelland, G. H. (2001). Estimating and testing mediation and moderation in within-participant designs. Psychological Methods, 6, 115-134.

Kraemer, H. C., Stice E., Kazdin A., Offord D., & Kupfer D. (2001). How do risk factors work together? Moderators, mediators, independent, overlapping, and proxy risk factors. American Journal of Psychiatry, 158, 848-856.

Kraemer H. C., Wilson G. T., Fairburn C. G., & Agras W. S. (2002). Mediators and moderators of treatment effects in randomized clinical trials. Arch Gen Psychiatry, 59, 877-883.