Describing Discrete Distributions
- A discrete distribution is a statistical distribution that shows the probabilities of outcomes as finite values
- A discrete distribution is a function that maps some finite value from the sample space to the probability space
- In other words, a discrete distribution represents the probabilities of discrete events across some pre-determined space
Bernoulli Distribution
- A bernoulli distribution is a discrete probability distribution of a random variable that takes the value 1 with a probability and the value 0 with probability
- We can define a bernoulli random variable as the following:
- The range of a bernoulli distribution is
- A bernoulli distribution represents the probability of observing a success
- In other words, a bernoulli distribution is represented by any two-element space
- There is only one parameter that makes up the bernoulli distribution
- Specifically, the parameter (or ) represents the probability of observing a 1
- In other words, or
-
The following are examples of random variables that are represented by a bernoulli distribution:
- A random variable that maps values to either heads or tails
- A random variable that maps values to either rain or shine
- A random variable that maps values to either democrat or republican
- Bernoulli distributions are especially interesting because logistic regression assumes the observations of the response variable is a benoulli random variable
Binomial Distribution
- A binomial distribution is a discrete probability distribution of a random variable that represents the number of successes in a sample of size
- We can define a binomial random variable as the following:
- A binomial distribution represents a sequence of bernoulli trials (or a bernoulli process)
- The range of a binomial distribution is
- A binomial distribution models the number of successes in a sample size (drawn with replacement from a population)
- Therefore, there are two parameters that make up a binomial distribution
- Specifically, the parameter represents the size of the sample and the parameter represents the probability of observing a success in the sample
- Binomial distributions are especially interesting because logistic regression assumes the response variable represents a binomial random variable
Poisson Distribution
- A poisson distribution is a discrete probability distribution of a random variable that represents the number of successes occurring in a fixed interval of time
- We can define a poisson random variable as the following:
- A poisson distribution expresses the probability of a given number of events occurring in a fixed interval of time or space if these events occur with a known constant rate (and independently of the time since the last event)
- There is only one parameter that makes up a poisson distribution
- Specifically, the parameter represents the expected number of occurrences (and doesn't need to be an integer)
- Poisson distributions are especially interesting because they are used in GLMs to model rates
- Essentially, the poisson distribution is popular for modeling the number of times an event occurs in an interval of time or space
-
The following are examples of random variables that are represented by a poisson distribution:
- The number of meteorites greater than 1 meter diameter that strike Earth in a year
- The number of patients arriving in an emergency room between 10 and 11 pm
- The number of photons hitting a detector in a particular time interval
References
Previous
Next