Mathematical function that describes the probability of occurrence of different possible outcomes in an experiment.
Probability distributions are a fundamental concept in statistics and are used to describe the likelihood of different outcomes in an experiment. They can be divided into two main types: discrete and continuous probability distributions.
Discrete probability distributions are used when the variables can take on a countable number of values. Here are some of the most common types:
A binomial distribution has two possible outcomes, often referred to as "success" and "failure". It is defined by two parameters: the number of trials (n) and the probability of success in a single trial (p). The distribution is used to determine the probability of observing a specified number of successes in a fixed number of trials.
The Poisson distribution is used to model the number of events occurring within a given time interval. The parameter λ is equal to the expected value. It is used in situations where events occur independently and at a constant average rate.
The geometric distribution models the number of trials needed to get the first success in repeated, independent Bernoulli trials. The parameter p is the probability of success on any given trial.
Continuous probability distributions are used when a variable can take on an infinite number of values within a certain range. Here are some of the most common types:
The normal distribution, also known as the Gaussian distribution, is a bell-shaped curve that is symmetric about its mean. It is defined by two parameters: the mean (μ) and the standard deviation (σ). The normal distribution is widely used in statistics and the natural and social sciences as a simple model for complex random variables.
The exponential distribution describes the time between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average rate. It has a single parameter λ, which is the rate parameter.
In a uniform distribution, all values have the same frequency/probability. It is defined by two parameters: the minimum (a) and the maximum (b). The uniform distribution is often used in computer simulations.
Understanding these distributions and their properties is crucial in many areas of statistics, including hypothesis testing, confidence intervals, and regression analysis. By mastering these concepts, you will be well-equipped to analyze and interpret statistical data effectively.