6.4 Normal Distribution (Pinkie Length)

3 min readjune 25, 2024

Normal distributions are key in statistics, shaping how we understand data spread. For pinkie length, this curve helps predict how common different measurements are, using and .

Z-scores tell us how far a pinkie length is from average, in standard deviations. The estimates data ranges, showing most lengths fall close to the mean. These tools help interpret pinkie length variations.

Normal Distribution and Pinkie Length

Probabilities in normal distributions

Top images from around the web for Probabilities in normal distributions
Top images from around the web for Probabilities in normal distributions
  • Normal distribution is a and bell-shaped ()
    • Defined by mean (μ\mu) and standard deviation (σ\sigma)
    • Total area under curve equals 1 or 100% ( of all possible outcomes)
  • Calculating probabilities for pinkie length measurements involves:
    • Finding mean and standard deviation of pinkie length distribution
    • Converting pinkie length to using formula: z=xμσz = \frac{x - \mu}{\sigma}
      • xx represents pinkie length measurement
      • μ\mu represents mean of distribution
      • σ\sigma represents standard deviation of distribution
    • Using table or calculator to find probability associated with
      • Values less than mean use area to left of z-score
      • Values greater than mean use area to right of z-score
      • Values between two pinkie lengths subtract area to left of smaller z-score from area to left of larger z-score

Interpretation of z-scores

  • Z-score represents number of standard deviations an individual pinkie length is from mean of distribution
    • indicate pinkie lengths above mean (longer than average)
    • indicate pinkie lengths below mean (shorter than average)
  • Magnitude of z-score indicates distance from mean in standard deviations
    • Z-score of 1 means pinkie length is 1 standard deviation above mean
    • Z-score of -2 means pinkie length is 2 standard deviations below mean
  • Comparing z-scores determines relative position of individual pinkie length within distribution
    • Larger positive z-score indicates pinkie length farther above mean compared to smaller positive z-score
    • Larger negative z-score indicates pinkie length farther below mean compared to smaller negative z-score
  • Z-scores can be used to determine percentiles, which represent the percentage of values below a given data point

Empirical rule for data ranges

  • Empirical rule () describes percentage of data within specific ranges of standard deviations from mean in normal distribution
    • ~68% of data falls within 1 standard deviation of mean (μ±1σ\mu \pm 1\sigma)
    • ~95% of data falls within 2 standard deviations of mean (μ±2σ\mu \pm 2\sigma)
    • ~99.7% of data falls within 3 standard deviations of mean (μ±3σ\mu \pm 3\sigma)
  • Estimating percentage of pinkie lengths within specific range:
    1. Determine range in standard deviations from mean
      • Pinkie lengths between 1 standard deviation below and 1 standard deviation above mean
    2. Use empirical rule to estimate percentage of pinkie lengths within range
      • In example above, ~68% of pinkie lengths would fall within range
  • Empirical rule provides quick estimate of percentage of data within specific ranges without calculating exact probabilities

Distribution characteristics

  • measures the asymmetry of the distribution, which is zero for a perfectly symmetric normal distribution
  • describes the shape of the distribution's tails and peak relative to a normal distribution
  • The states that the sampling distribution of the mean approaches a normal distribution as the sample size increases, regardless of the population's distribution

Key Terms to Review (23)

68-95-99.7 Rule: The 68-95-99.7 rule, also known as the empirical rule, is a fundamental concept in statistics that describes the distribution of data in a normal or bell-shaped curve. It provides a general guideline for understanding the proportion of data that falls within certain standard deviation ranges from the mean.
Bell-Shaped: A bell-shaped curve, also known as a normal distribution, is a symmetrical, unimodal probability distribution that is shaped like a bell. It is characterized by a single peak at the mean, with the data points tapering off evenly on both sides, creating a symmetrical, bell-like appearance. This distribution is widely observed in various natural and statistical phenomena, making it a fundamental concept in probability and statistics.
Central Limit Theorem: The Central Limit Theorem states that when a sample of size 'n' is taken from any population with a finite mean and variance, the distribution of the sample means will tend to be normally distributed as 'n' becomes large, regardless of the original population's distribution. This theorem allows for the use of normal probability models in various statistical applications, making it fundamental for inference and hypothesis testing.
Central limit theorem for means: The Central Limit Theorem for Sample Means states that the distribution of sample means will approximate a normal distribution, regardless of the population's distribution, provided the sample size is sufficiently large. This approximation improves as the sample size increases.
Continuous Probability Distribution: A continuous probability distribution is a type of probability distribution where the random variable can take on any value within a given range or interval, rather than being limited to discrete values. This type of distribution is used to model continuous phenomena, such as measurements or quantities that can vary smoothly and take on an infinite number of possible values.
Empirical Rule: The Empirical Rule, also known as the 68-95-99.7 rule, is a statistical principle that describes the distribution of data in a normal or bell-shaped curve. It provides a framework for understanding the relationship between the standard deviation and the percentage of data that falls within certain ranges around the mean.
Kurtosis: Kurtosis is a statistical measure that describes the distribution of a dataset, specifically the degree of peakedness or flatness of the distribution curve. It provides information about the shape of the tails of the distribution, indicating whether the tails are heavier or lighter compared to a normal distribution.
Mean: The mean, also known as the average, is a measure of central tendency that represents the arithmetic average of a set of values. It is calculated by summing up all the values in the dataset and dividing by the total number of values. The mean provides a central point that summarizes the overall distribution of the data.
Negative z-scores: A negative z-score indicates that a data point is below the mean of a distribution. In the context of normal distribution, this means that the pinkie length, for example, is shorter than the average pinkie length in the population being studied. Understanding negative z-scores helps in assessing how unusual or typical a particular measurement is within a normal distribution framework.
Normal Curve: The normal curve, also known as the Gaussian distribution, is a symmetrical, bell-shaped probability distribution that is widely used in statistics. It is a continuous probability distribution that describes the way in which many natural and social phenomena are distributed.
Percentile: A percentile is a measure used in statistics indicating the value below which a given percentage of observations fall. For example, the 50th percentile is the median.
Percentile: A percentile is a statistical measure that indicates the relative standing of a value within a distribution of values. It represents the percentage of values in the distribution that are less than or equal to the given value.
Positive Z-Scores: A positive z-score indicates that a data point is above the mean of a normal distribution. It represents the number of standard deviations a data point is above the mean, providing information about the relative position of the data point within the distribution.
Probability: Probability is the measure of the likelihood of an event occurring. It is a fundamental concept in statistics that quantifies the uncertainty associated with random events or outcomes. Probability is central to understanding and analyzing data, making informed decisions, and drawing valid conclusions.
Sigma (Σ): Sigma (Σ) is a mathematical symbol used to represent the summation or addition of a series of numbers or values. It is a fundamental concept in statistics and is used extensively in various statistical analyses and calculations.
Skewness: Skewness is a measure of the asymmetry or lack of symmetry in the distribution of a dataset. It describes the extent to which a probability distribution or a data set deviates from a normal, symmetric distribution.
Standard Deviation: Standard deviation is a statistic that measures the dispersion or spread of a set of values around the mean. It helps quantify how much individual data points differ from the average, indicating the extent to which values deviate from the central tendency in a dataset.
Standard normal distribution: The standard normal distribution is a normal distribution with a mean of 0 and a standard deviation of 1. It is used to standardize scores from different normal distributions for comparison.
Standard Normal Distribution: The standard normal distribution, also known as the Z-distribution, is a special case of the normal distribution where the mean is 0 and the standard deviation is 1. It is a fundamental concept in statistics that is used to model and analyze data that follows a normal distribution.
Symmetric: Symmetric refers to a balanced and equal distribution of data, where the left and right sides of a graph mirror each other. In this context, a symmetric distribution indicates that the mean, median, and mode are all located at the center, creating a visually appealing shape that is often associated with normal distributions. When analyzing data, recognizing symmetry helps in understanding the overall behavior and characteristics of the dataset.
Z-score: A z-score represents the number of standard deviations a data point is from the mean. It is used to determine how unusual a particular observation is within a normal distribution.
Z-Score: A z-score is a standardized measure that expresses how many standard deviations a data point is from the mean of a distribution. It allows for the comparison of data points across different distributions by converting them to a common scale.
μ: The symbol 'μ' represents the population mean in statistics, which is the average of all data points in a given population. Understanding μ is essential as it serves as a key measure of central tendency and is crucial in the analysis of data distributions, impacting further calculations related to spread, normality, and hypothesis testing.
© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.