“The z-score and t-score are not interchangeable—they are tools for different jobs.” Choosing the wrong one can lead to incorrect conclusions. This article clarifies when to use each statistic and why.
In statistics and econometrics, we often test if a sample result is significant. Two common tools are the z-statistic and the t-statistic. Both measure how far a sample mean is from a population mean, in units of standard error. The core difference is about knowledge: we use z when we know the population standard deviation (σ), and we use t when we must estimate it from the sample.
Core Concept: Known vs. Unknown Standard Deviation
The choice hinges on one piece of information: the population standard deviation (σ).
- z-statistic: Use when σ is known and certain. This is rare in real-world research but common in quality control or standardized tests.
- t-statistic: Use when σ is unknown and must be estimated from the sample data using the sample standard deviation (s). This is the standard in most econometric and social science research.
z-score calculation:
z = (Sample Mean - Population Mean) / (σ / √n)
z = (502 - 500) / (5 / √50) = 2 / 0.707 ≈ 2.83
t-score calculation:
t = (Sample Mean - Claimed Mean) / (s / √n)
t = (15 - 12) / (4 / √30) = 3 / 0.73 ≈ 4.11
Degrees of freedom (df) = n - 1 = 29.
The Distributions: Normal vs. t
The statistics follow different probability distributions, which affects the critical values for hypothesis testing.
| Feature | z-Distribution (Standard Normal) | t-Distribution |
|---|---|---|
| Shape | Fixed, bell-shaped curve | Similar bell shape, but heavier tails |
| Parameter | None (mean=0, sd=1) | Degrees of freedom (df = n-1) |
| Tails | Thinner | Fatter (more area in the tails) |
| Use Case | Known σ or very large n (n > 30) | Unknown σ and small to moderate n |
| Critical Value (95% CI) | ±1.96 | Depends on df (e.g., ±2.045 for df=29) |
⚠️ Common Pitfalls and Misconceptions
- Pitfall 1: Using z when σ is unknown. This underestimates uncertainty, making results seem more significant than they are. Always use t when estimating σ from the sample.
- Pitfall 2: Thinking n=30 is a strict rule. The "large sample" rule (use z if n > 30) is a simplification. Modern practice favors using t whenever σ is unknown, regardless of sample size.
- Pitfall 3: Confusing the test statistic with the critical value. The z or t score you calculate is compared to a critical value from its respective distribution table to make a decision.
Decision Rule: When to Use Which?
Follow this simple flowchart in your analysis:
- Do you know the population standard deviation (σ) with certainty?
- YES → Use the z-statistic and the standard normal distribution.
- NO → Proceed to step 2.
- You must estimate σ from your sample. Use the t-statistic and the t-distribution with df = n-1.
In econometrics, we almost always deal with unknown population parameters, making the t-statistic the default tool for regression coefficient significance tests (t-tests).
We estimate the coefficient β1 = 2.5 with a standard error (SE) of 0.8.
To test if education significantly affects wages (H0: β1 = 0):
t-statistic = (Estimated Coefficient - Null Value) / Standard Error
t = (2.5 - 0) / 0.8 = 3.125
We compare t=3.125 to a t-distribution (df based on sample size) to find the p-value.
Even with a large n=1000, because we are estimating the standard deviation (of the proportion) from the sample, modern practice uses the t-statistic. However, with such large df, the t-distribution is virtually identical to the z-distribution, so the critical values (±1.96) are often used for convenience.