Drill 2 · Math · Scatterplots and Lines of Best Fit
SAT Math: Scatterplots and Lines of Best Fit (Drill 2) is a Math practice drill covering Scatterplots and Lines of Best Fit. It contains 5 original questions created by Brian Stewart, a Barron's test prep author with over 20 years of tutoring experience.
SAT data modeling questions cover selecting between linear and exponential models, interpreting exponential growth and decay equations, understanding how data transformations affect model fit, and comparing predicted values to actual data points.
| x | y |
|---|---|
| 0 | 3 |
| 1 | 6 |
| 2 | 11 |
| 3 | 25 |
| 4 | 49 |
| 5 | 98 |
Question 1. Which type of function best models the relationship between x and y in the data?
Explanation: Check the ratios of consecutive y-values: 6/3 = 2, 11/6 ≈ 1.8, 25/11 ≈ 2.3, 49/25 ≈ 2.0, 98/49 = 2.0. The y-values approximately double each time x increases by 1, which is the hallmark of exponential growth. A linear model would require constant differences (not ratios), and the differences here are 3, 5, 14, 24, 49, clearly not constant. Choice A is incorrect because the increase is not constant.
Question 2. A scatterplot shows the amount of a medication, in milligrams, remaining in a patient's bloodstream over time. An equation for the exponential model can be written as y = a · bx, where a and b are positive constants, x is the number of hours, and y is the amount remaining. The model shows a decreasing trend. Which of the following is closest to the value of b?
Explanation: For an exponential model y = a · bx where a and b are positive constants: if 0 < b < 1, the model is decreasing (decay); if b > 1, the model is increasing (growth). Since the model shows a decreasing trend, b must be between 0 and 1. Of the given choices, only 0.72 satisfies 0 < b < 1. Choice B (1.00) would mean no change. Choices C and D are greater than 1 and would indicate growth.
| Time (s) | Height (m) |
|---|---|
| 0 | 2 |
| 1 | 27 |
| 2 | 42 |
| 3 | 47 |
| 4 | 42 |
| 5 | 27 |
| 6 | 2 |
Question 3. Of the following, which equation best models the data?
Explanation: The data shows a parabolic shape, height increases, reaches a maximum near x = 3, then decreases symmetrically. This indicates a downward-opening parabola (negative leading coefficient). Check Choice A at x = 0: y = −5(0) + 30(0) + 2 = 2 ✓. At x = 3: y = −5(9) + 30(3) + 2 = −45 + 90 + 2 = 47 ✓. At x = 6: y = −5(36) + 30(6) + 2 = −180 + 180 + 2 = 2 ✓. Choice B has a positive leading coefficient, which would open upward. Choice D is linear and cannot model the curved pattern.
Question 4. A line of best fit for a data set has the equation y = 1.5x + 8. A new data set is created by multiplying the y-coordinate of each data point by 4. Which of the following could be an equation of a line of best fit for the new data set?
Explanation: When every y-value is multiplied by 4, the entire equation is scaled by 4: ynew = 4(1.5x + 8) = 6x + 32. The slope is multiplied by 4 (1.5 × 4 = 6) and the y-intercept is multiplied by 4 (8 × 4 = 32). Choice B only multiplies the intercept. Choice C only multiplies the slope. Choice D adds 4 instead of multiplying.
| Age (years) | Price ($) |
|---|---|
| 1 | 28,500 |
| 2 | 24,200 |
| 3 | 22,800 |
| 4 | 18,500 |
| 5 | 16,100 |
Question 5. According to the line of best fit, what is the predicted price of a 3-year-old car, and how does it compare to the actual data point?
Explanation: Substitute x = 3 into the equation: y = −3,100(3) + 31,400 = −9,300 + 31,400 = 22,100. The actual price at age 3 is $22,800. The difference is 22,800 − 22,100 = 700. The actual price is $700 higher than the predicted price. This difference (called the residual) is positive, meaning the data point lies above the line of best fit.