Joint Probability Distributions
Learning Objectives
- Understand joint probability mass and density functions for multiple random variables.
- Calculate and interpret marginal and conditional probability distributions.
- Determine whether two random variables are statistically independent.
- Calculate and interpret covariance and the correlation coefficient.
- Understand the properties and applications of the Bivariate Normal Distribution.
Joint probability distributions extend the concepts of single random variables to multiple variables, allowing engineers to analyze complex systems where multiple factors interact. This lesson covers joint, marginal, and conditional distributions, as well as measures of linear relationship like covariance and correlation.
Simultaneous Variables
In many engineering applications, we need to understand the relationship between two or more random variables simultaneously. For example, a structural engineer might study the joint distribution of wind speed () and atmospheric pressure () during a hurricane, or a transportation engineer might model the number of cars () and trucks () arriving at a toll booth.
Joint Probability Mass and Density Functions
Joint Probability Mass Function (Discrete)
For two discrete random variables and , the joint probability mass function gives the probability that takes the specific value AND takes the specific value simultaneously.
Joint Probability Mass Function
Formula for calculating joint discrete probability.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Joint probability mass function | - | |
| Discrete random variables | - | |
| Specific values of the random variables | - | |
| Probability | - |
Joint PMF Conditions
- for all .
- .
Joint Probability Density Function (Continuous)
For two continuous random variables and , the joint probability density function represents the probability that falls within a specific two-dimensional region in the -plane. The probability is the volume under the surface over region .
Joint Probability Density Function
Formula for calculating joint continuous probability over a region R.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Probability that X and Y fall in region R | - | |
| Two-dimensional region in the xy-plane | - | |
| Joint probability density function | - | |
| Differentials for x and y | - |
Joint PDF Conditions
- for all .
- .
Marginal Distributions
Isolating Variables
Sometimes we have the joint distribution of and , but we only care about the distribution of alone, regardless of . This is called the marginal distribution.
Marginal Probability Distributions
To find the marginal distribution of one variable, we sum (or integrate) out the other variable over its entire range. Similarly, is the marginal distribution for found by summing or integrating out .
Marginal Distribution (Discrete)
Formula for calculating marginal discrete probability.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Marginal distribution of X | - | |
| Joint probability mass function | - | |
| All possible values of Y | - |
Marginal Distribution (Continuous)
Formula for calculating marginal continuous probability.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Marginal distribution of X | - | |
| Joint probability density function | - | |
| Differential for y | - |
Conditional Distributions and Independence
Conditional Probability Distribution
The probability distribution of , given that has taken a specific value . This is analogous to basic conditional probability (). Similarly, provided .
Conditional Probability Distribution
Formula for conditional probability distribution.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Conditional distribution of X given Y=y | - | |
| Joint probability function | - | |
| Marginal distribution of Y | - |
Independence of Random Variables
Two random variables and are independent if and only if their joint probability distribution is the product of their marginal distributions for all possible values of . If this holds true, knowing the value of gives no information about the value of . (e.g., The compressive strength of concrete from Plant A vs. Plant B).
Independence Condition
Mathematical condition for independence of random variables.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Joint probability function | - | |
| Marginal distribution of X | - | |
| Marginal distribution of Y | - |
Covariance and Correlation
Covariance ()
A measure of how much two random variables change together. A positive covariance indicates that when is above its mean, also tends to be above its mean (e.g., traffic volume and noise levels on a highway). A negative covariance indicates an inverse relationship (e.g., the age of an asphalt pavement and its flexibility).
Covariance Formula
Formula for calculating the covariance between two random variables.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Covariance between X and Y | - | |
| Expected value operator | - | |
| Random variables | - | |
| Means of X and Y | - |
Properties of Covariance
- If and are statistically independent, their covariance is zero ().
- However, a covariance of zero does not necessarily mean they are independent (they could have a non-linear relationship).
Correlation Coefficient ()
A standardized measure of the linear relationship between two variables. Covariance depends on the units of and , making it hard to interpret the strength of the relationship. The correlation coefficient scales covariance by the standard deviations of both variables, producing a dimensionless value between -1 and 1.
Correlation Coefficient Formula
Formula for calculating the correlation coefficient.
Variables
| Symbol | Description | Unit |
|---|---|---|
| Correlation coefficient between X and Y | - | |
| Covariance between X and Y | - | |
| Standard deviations of X and Y | - |
Interpretation of Correlation
- : Perfect positive linear relationship.
- : Perfect negative linear relationship.
- : No linear relationship.
The Bivariate Normal Distribution
Bivariate Normal Distribution
When two continuous random variables are individually normally distributed and correlated, their joint behavior is described by the bivariate normal distribution. Its PDF forms a 3-dimensional bell surface (a mound) whose orientation depends on the correlation .
Key Properties of Bivariate Normal Distribution
- The marginal distributions and are both normal.
- The conditional distributions and are both normal.
- If the correlation for a bivariate normal distribution, then and are strictly independent. (This is a special property; for other distributions, does not guarantee independence).
Interactive Simulation
Interact with the simulation below to visualize joint probability distributions and marginals.
Engineering Data Analysis
Discrete Joint Probability Explorer
Probability Distribution Table
| 0.150 | ||||
| 0.600 | ||||
| 0.250 | ||||
| 0.200 | 0.500 | 0.300 | 1.000 |
If , X and Y tend to increase together. If , they vary inversely. If , there is no linear relationship.
Probability Distribution Visualizer
Bubble size & opacity indicate the magnitude of the joint probability .
Green bars represent the marginal distributions and .
Interactive Simulation
Explore the contours, density surface, and marginal distributions of a Bivariate Normal distribution by adjusting standard deviations and the correlation coefficient.
Engineering Data Analysis • Topic 7
Bivariate Normal Distribution Contours
- Joint Distributions (): Describe the simultaneous behavior of two random variables.
- Marginal Distributions (): Isolate one variable by summing or integrating out the other.
- Conditional Distributions (): The behavior of given a specific value of .
- Independence: If and are independent, .
- Covariance and Correlation: Measure the linear relationship between variables. Correlation () is standardized, always falling between -1 and 1.
- Bivariate Normal: The standard 3D bell-shaped curve for two correlated continuous variables.