Measuring Causal Effects
-
There are three unknown parameters that can be estimated to help measure causal effects:
- : Average treatment effect
- : Average treatment effect for the treatment group
- : Average treatment effect for the control group
- The parameter is dependent on and
- If is positive and large, then the treatment has a large, positive effect on average (compared to the control group)
- If is negative and large, then the treatment has a large, negative effect on average (compared to the control group)
- If is small or , then the treatment doesn't really have any effect on average (compared to the control group)
Defining Average Treatment Effects
- The formula for is defined as the following:
- Here, each represents one known observation or one unknown observation (either from the control or treatment group)
- We can use to estimate the population parameter
-
And, represents the simple difference in means
- Again, is a theoretical, unknown population parameter
- Whereas, is an actual, known estimator or statistic
-
In practice, is a naive estimation of
- There are more sophisticated methods for estimating
- Such as, subclassification, matching, etc.
Estimating the Unknown Parameter
- The following formula for is defined as the following:
- Here, each represents a known observation from the treatment group of total observations
- And, each represents a known observation from the control group of total observations
- Notice, the unknown observations are excluded for each and
- So, we are making more assumptions by using this as an estimator
Motivating Existing Biases within
- Again, the estimator makes a few assumptions about and includes a few biases as a result
-
Specifically, it includes the following biases:
- Selection bias
- Heterogeneous treatment effect bias
- Mathematically, these biases can be depicted in the formula:
-
Here, the above notation represents the following:
- is the number of observations in the treatment goup
- is the number of observations in the control group
Describing the Biases within
- Selection bias is the inherent difference between the two groups if both received the treatment
- The heterogenous treatment effect bias is another form of bias
- Typically, we'll assume that treatment effects are constant
- Which, will cause
- Thus, it will make
-
These biases can be mitigated through methods like:
- Subclassification
- Matching
- Etc.
Defining Average Treatment Effects for Treatments
-
The formula for is defined as the following:
- Here, is the number of observations in the treatment group
Defining Average Treatment Effects for Controls
-
The formula for is defined as the following:
- Here, is the number of observations in the control group
Verifying the Assumption of Independence
- To draw conclusions about causality, we must verify there isn't any bias between the control and treatment groups
- To do this, we must check that the following assumption is satisfied:
-
Physical randomization can satisfy this assumption of independence (i.e. independent assignment of observations to groups)
- Physical randomization refers to assigning an observation to a treatment or control group
- The assignment is random if the groups are as balanced as if they were assigned by flipping a coin
Describing the Assumption of Independence
- We can assume when a treatment is assigned to patients independent of their potential outcomes or
- In summary, independence implies that the observations in both the treatment and control groups have the same potential outcome on average in the population
-
Independence can be ensured using the following techniques:
- Physical randomization
- Conditional independence
-
In reality, achieving independence is difficult and usually unlikely
- Hence, we must use additional methods like subclassification, matching, etc.
Enforcing Independence with Physical Randomization
- In other words, all biases are eliminated by randomly assigning observations to a treatment group and control group
-
So, randomization of the treatment assignment would eliminate both selection bias and heterogeneous treatment effect bias
- Thus, no longer suffers from selection bias
- Mathematically, randomization of the treatment assignment ensures:
- Again, randomization of the treatment assignment also eliminates any heterogeneous treatment effect bias
- Since, randomization of the treatment assignment also ensures:
More Details about Association and Causation
- Association refers to a statistical relationship between two variables
- Causation refers to determing that an exposure of one variable produces an effect on a different variable
- Association becomes causation if a variable in the treatment group doesn't behave any differently in the control group
- So, association becomes causation when the following is true:
- Here, refers to a variable being measured within the study
- And, refers to a binary variable representing whether the observation received treatment or not