What is Standard Deviation?
Standard deviation measures how spread out values are from the mean. A low standard deviation means data is clustered near the mean; a high value means data is widely spread.
Population Ļ = ā[Ī£(xįµ¢ ā μ)² / N]
Sample s = ā[Ī£(xįµ¢ ā xĢ)² / (Nā1)]
Sample s = ā[Ī£(xįµ¢ ā xĢ)² / (Nā1)]
Steps to Calculate
- Find the mean (average) of the dataset.
- Subtract the mean from each value; square the result.
- Find the average of all squared differences (variance).
- Take the square root of the variance.
Standard Deviation Step-by-Step Worked Example
Dataset: {2, 4, 4, 4, 5, 5, 7, 9} (classic textbook example)
Step 1 ā Mean: (2+4+4+4+5+5+7+9)/8 = 40/8 = 5
Step 2 ā Squared deviations:
(2ā5)²=9, (4ā5)²=1, (4ā5)²=1, (4ā5)²=1,
(5ā5)²=0, (5ā5)²=0, (7ā5)²=4, (9ā5)²=16
Step 3 ā Sum = 9+1+1+1+0+0+4+16 = 32
Step 4 ā Population variance = 32/8 = 4
Step 5 ā Population SD Ļ = ā4 = 2
Step 2 ā Squared deviations:
(2ā5)²=9, (4ā5)²=1, (4ā5)²=1, (4ā5)²=1,
(5ā5)²=0, (5ā5)²=0, (7ā5)²=4, (9ā5)²=16
Step 3 ā Sum = 9+1+1+1+0+0+4+16 = 32
Step 4 ā Population variance = 32/8 = 4
Step 5 ā Population SD Ļ = ā4 = 2
Standard Deviation Reference Ranges
| Range | % of Data (Normal Distribution) |
|---|---|
| Within ±1Ļ of mean | ā 68.27% |
| Within ±2Ļ of mean | ā 95.45% |
| Within ±3Ļ of mean | ā 99.73% (the "3-sigma rule") |
These ranges are the famous 68-95-99.7 empirical rule (bell curve rule), widely used in quality control and scientific research.
Coefficient of Variation (Relative SD)
CV = (Ļ / mean) Ć 100%
Allows comparison of variability across different datasets
Example: Dataset A has SD=10, mean=100 ā CV=10%
Dataset B has SD=2, mean=10 ā CV=20% (more variable despite smaller SD)
Allows comparison of variability across different datasets
Example: Dataset A has SD=10, mean=100 ā CV=10%
Dataset B has SD=2, mean=10 ā CV=20% (more variable despite smaller SD)
Real-World Applications
- Finance: Portfolio risk ā a stock with SD of 20% is twice as volatile as one with 10%
- Quality control (Six Sigma): Manufacturing processes aim for ±6Ļ ā defect rate under 3.4 per million
- Weather forecasting: Temperature variability and climate change analysis
- Academic testing: Standardized scores (Z-scores) use SD to normalize results