Measuring Causality

Measuring Causal Effects

  • There are three unknown parameters that can be estimated to help measure causal effects:

    • ATEATE: Average treatment effect
    • ATTATT: Average treatment effect for the treatment group
    • ATUATU: Average treatment effect for the control group
  • The parameter ATEATE is dependent on ATTATT and ATUATU
  • If ATEATE is positive and large, then the treatment has a large, positive effect on average (compared to the control group)
  • If ATEATE is negative and large, then the treatment has a large, negative effect on average (compared to the control group)
  • If ATEATE is small or 00, then the treatment doesn't really have any effect on average (compared to the control group)

Defining Average Treatment Effects

  • The formula for ATEATE is defined as the following:
ATE=E[δ]=E[Y1Y0]\bold{ATE} = E[\delta] = E[Y^{1} - Y^{0}] =y01+y11+...+yn1ny00+y10+...+yn0n= \frac{y_{0}^{1} + y_{1}^{1} + ... + y_{n}^{1}}{n} - \frac{y_{0}^{0} + y_{1}^{0} + ... + y_{n}^{0}}{n}
  • Here, each yiy_{i} represents one known observation or one unknown observation (either from the control or treatment group)
  • We can use SDOSDO to estimate the population parameter ATEATE
  • And, SDOSDO represents the simple difference in means

    • Again, ATEATE is a theoretical, unknown population parameter
    • Whereas, SDOSDO is an actual, known estimator or statistic
  • In practice, SDOSDO is a naive estimation of ATEATE

    • There are more sophisticated methods for estimating ATEATE
    • Such as, subclassification, matching, etc.

Estimating the Unknown ATEATE Parameter

  • The following formula for SDOSDO is defined as the following:
ATE^=SDO=E[Yi1ti=1]E[Yi0ti=0]\hat{ATE} = SDO = E[Y_{i}^{1} | t_{i} = 1] - E[Y_{i}^{0} | t_{i} = 0] =y01+y11+...+yj1+ym1my00+y10+...+yk0+yp0p= \frac{y_{0}^{1} + y_{1}^{1} + ... + y_{j}^{1} + y_{m}^{1}}{m} - \frac{y_{0}^{0} + y_{1}^{0} + ... + y_{k}^{0} + y_{p}^{0}}{p}
  • Here, each yjy_{j} represents a known observation from the treatment group of mm total observations
  • And, each yky_{k} represents a known observation from the control group of pp total observations
  • Notice, the unknown observations are excluded for each yjy_{j} and yky_{k}
  • So, we are making more assumptions by using this as an estimator

Motivating Existing Biases within SDOSDO

  • Again, the SDOSDO estimator makes a few assumptions about ATEATE and includes a few biases as a result
  • Specifically, it includes the following biases:

    • Selection bias
    • Heterogeneous treatment effect bias
  • Mathematically, these biases can be depicted in the formula:
SDO=ATESDO = ATE +E[Y0ti=1]E[Y0ti=0]Selection bias+ \underbrace{E[Y^{0} | t_{i} = 1] - E[Y^{0} | t_{i} = 0]}_{\text{Selection bias}} +(1γ)(ATTATU)Heterogeneous bias+ \underbrace{(1-\gamma)(ATT-ATU)}_{\text{Heterogeneous bias}}
  • Here, the above notation represents the following:

    • γ\gamma is the number of observations in the treatment goup
    • (1γ)(1-\gamma) is the number of observations in the control group

Describing the Biases within ATEATE

  • Selection bias is the inherent difference between the two groups if both received the treatment
  • The heterogenous treatment effect bias is another form of bias
  • Typically, we'll assume that treatment effects are constant
  • Which, will cause ATU=ATTATU = ATT
  • Thus, it will make SDO=ATE+selection biasSDO = ATE + \text{selection bias}
  • These biases can be mitigated through methods like:

    • Subclassification
    • Matching
    • Etc.

Defining Average Treatment Effects for Treatments

  • The formula for ATTATT is defined as the following:

    • Here, mm is the number of observations in the treatment group
ATT=E[δti=1]=E[Yi1ti=1]E[Yi0ti=1]\bold{ATT} = E[\delta | t_{i} = 1] = E[Y_{i}^{1} | t_{i} = 1] - E[Y_{i}^{0} | t_{i} = 1] δ01+δ11+...+δm1m\approx \frac{\delta_{0}^{1} + \delta_{1}^{1} + ... + \delta_{m}^{1}}{m}

Defining Average Treatment Effects for Controls

  • The formula for ATUATU is defined as the following:

    • Here, pp is the number of observations in the control group
ATU=E[δti=0]=E[Yi1ti=0]E[Yi0ti=0]\bold{ATU} = E[\delta | t_{i} = 0] = E[Y_{i}^{1} | t_{i} = 0] - E[Y_{i}^{0} | t_{i} = 0] δ00+δ10+...+δp0p\approx \frac{\delta_{0}^{0} + \delta_{1}^{0} + ... + \delta_{p}^{0}}{p}

Verifying the Assumption of Independence

  • To draw conclusions about causality, we must verify there isn't any bias between the control and treatment groups
  • To do this, we must check that the following assumption is satisfied:
E[Y0t=0]=E[Y0t=1]E[Y^{0} | t=0] = E[Y^{0} | t=1]
  • Physical randomization can satisfy this assumption of independence (i.e. independent assignment of observations to groups)

    • Physical randomization refers to assigning an observation to a treatment or control group
    • The assignment is random if the groups are as balanced as if they were assigned by flipping a coin

Describing the Assumption of Independence

  • We can assume SDO=ATESDO = ATE when a treatment tt is assigned to patients independent of their potential outcomes Y0Y^{0} or Y1Y^{1}
  • In summary, independence implies that the observations in both the treatment and control groups have the same potential outcome on average in the population
  • Independence can be ensured using the following techniques:

    • Physical randomization
    • Conditional independence
  • In reality, achieving independence is difficult and usually unlikely

    • Hence, we must use additional methods like subclassification, matching, etc.

Enforcing Independence with Physical Randomization

  • In other words, all biases are eliminated by randomly assigning observations to a treatment group and control group
  • So, randomization of the treatment assignment would eliminate both selection bias and heterogeneous treatment effect bias

    • Thus, SDOSDO no longer suffers from selection bias
  • Mathematically, randomization of the treatment assignment ensures:
E[Y0t=1]E[Y0t=0]=0E[Y^{0} | t = 1] - E[Y^{0} | t = 0] = 0
  • Again, randomization of the treatment assignment also eliminates any heterogeneous treatment effect bias
  • Since, randomization of the treatment assignment also ensures:
E[Y1t=1]E[Y1t=0]=0E[Y^{1} | t = 1] - E[Y^{1} | t = 0] = 0

More Details about Association and Causation

  • Association refers to a statistical relationship between two variables
  • Causation refers to determing that an exposure of one variable produces an effect on a different variable
  • Association becomes causation if a variable YY in the treatment group doesn't behave any differently in the control group
  • So, association becomes causation when the following is true:
E[Y0T=0]=E[Y0T=1]E[Y_{0} | T=0] = E[Y_{0} | T=1]
  • Here, YtY_{t} refers to a variable being measured within the study
  • And, TT refers to a binary variable representing whether the observation received treatment or not

References

Previous
Next

What is Causality?

Evaluating Assumptions