Solved Problems

The following problems cover linear regression, correlation analysis, hypothesis testing, and related conceptual interpretations.

Problem 1: Simple Linear Regression Coefficients (Basic)

A concrete testing laboratory recorded the curing time (xx, in days) and the resulting compressive strength (yy, in MPa) for five samples: (7, 21), (14, 28), (21, 35), (28, 41), and (35, 45). Determine the simple linear regression equation, y^=b0+b1x\hat{y} = b_0 + b_1 x.

Step-by-Step Solution

0 of 4 Steps Completed
1

Problem 2: Correlation Coefficient (Basic)

Using the data from Problem 1 (curing time xx and compressive strength yy), calculate the Pearson correlation coefficient (rr). The additional sum required is y2=212+282+352+412+452=6156\sum y^2 = 21^2 + 28^2 + 35^2 + 41^2 + 45^2 = 6156.

Step-by-Step Solution

0 of 5 Steps Completed
1

Problem 3: Prediction and Residuals (Intermediate)

For the concrete testing model derived in Problem 1 (y^=15.7+0.871x\hat{y} = 15.7 + 0.871 x), what is the predicted compressive strength for a curing time of 21 days? Calculate the residual for the actual data point (21, 35).

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 4: Coefficient of Determination (Intermediate)

Given the correlation coefficient r=0.9948r = 0.9948 from Problem 2, calculate the coefficient of determination (R2R^2) and interpret its meaning in the context of the concrete curing data.

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 5: Sums of Squares (Advanced)

A transportation engineer analyzes the relationship between speed limit (xx) and average vehicle speed (yy). The regression model yields SST=450SST = 450 and SSE=75SSE = 75. Calculate the Sum of Squares due to Regression (SSRSSR) and the Coefficient of Determination (R2R^2).

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 6: Standard Error of the Estimate (Intermediate)

Using the data from Problem 5 (SSE=75SSE = 75) and assuming a sample size of n=12n = 12 roads, calculate the standard error of the estimate (ses_e).

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 7: Simple Linear Regression Hypothesis Testing (Advanced)

For the speed limit vs. average speed data, the estimated slope is b1=1.05b_1 = 1.05. The standard error of the slope (sb1s_{b1}) is calculated to be 0.150.15. Test the hypothesis that there is a significant linear relationship between the variables at α=0.05\alpha = 0.05 (sample size n=12n=12).

Step-by-Step Solution

0 of 4 Steps Completed
1

Problem 8: Linearizing an Exponential Relationship (Advanced)

A geotechnical engineer measures the settlement of a foundation over time and finds it follows an exponential decay model: y=aebxy = a e^{bx}, where yy is settlement and xx is time. Show how to transform this into a linear model to find parameters aa and bb using linear regression.

Step-by-Step Solution

0 of 3 Steps Completed
1

Problem 9: Multiple Linear Regression Calculation (Intermediate)

A multiple regression model predicts the heating load (YY, in kW) of a building based on wall area (X1X_1, in m2m^2) and roof area (X2X_2, in m2m^2). The estimated model is Y^=12.5+0.15X1+0.22X2\hat{Y} = 12.5 + 0.15 X_1 + 0.22 X_2. Calculate the predicted heating load for a building with 200m2200 \, \text{m}^2 of wall area and 150m2150 \, \text{m}^2 of roof area.

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 10: Multiple Regression Overall F-Test (Advanced)

For a multiple regression model predicting river flow rate based on rainfall and snowmelt, the sample size is n=30n = 30, the number of predictors is k=2k = 2, SST=1200SST = 1200, and SSR=950SSR = 950. Conduct an ANOVA F-test for the overall significance of the model at α=0.05\alpha = 0.05.

Step-by-Step Solution

0 of 4 Steps Completed
1

Problem 11: t-Tests for Individual Coefficients (Advanced)

Following Problem 10, the estimated coefficient for rainfall (X1X_1) is b1=2.4b_1 = 2.4 with standard error sb1=0.8s_{b1} = 0.8. The coefficient for snowmelt (X2X_2) is b2=1.1b_2 = 1.1 with standard error sb2=0.9s_{b2} = 0.9. Test the significance of each individual predictor at α=0.05\alpha = 0.05 (df=27df = 27).

Step-by-Step Solution

0 of 3 Steps Completed
1

Problem 12: Correlation vs. Causation (Conceptual)

A study shows a very high positive correlation (r=0.92r = 0.92) between the amount of asphalt used in a city during the summer and the number of heatstroke incidents. Can the city's health department conclude that laying asphalt directly causes heatstroke?

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 13: The Dangers of Extrapolation (Conceptual)

A structural engineer models the yielding stress of an alloy (yy) based on temperature (xx) for temperatures ranging from 20C20^\circ \text{C} to 100C100^\circ \text{C}, finding a strong linear relationship. Why might it be dangerous to use this regression model to predict the yielding stress at 300C300^\circ \text{C}?

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 14: Interpreting Residual Plots (Conceptual)

After fitting a simple linear regression model to predict pipe flow rate based on pressure, an engineer plots the residuals (eie_i) on the y-axis against the predicted values (y^i\hat{y}_i) on the x-axis. The plot shows a distinct "fan" or "funnel" shape, where the spread of the residuals increases as the predicted value increases. What does this indicate?

Step-by-Step Solution

0 of 2 Steps Completed
1

Problem 15: Adjusted R-squared in Multiple Regression (Conceptual)

An environmental engineer adds a fifth predictor variable to a model predicting air pollution levels. The standard R2R^2 value increases slightly, but the Adjusted R2R^2 value decreases. Explain why this happens and what it means.

Step-by-Step Solution

0 of 3 Steps Completed
1