9.5 Additional Information and Full Hypothesis Test Examples

3 min readjune 25, 2024

is a crucial tool in statistics, helping us make decisions about population parameters based on sample data. It involves setting up hypotheses, calculating test statistics, and interpreting p-values to draw conclusions about the population.

Understanding levels, p-values, and different types of tests is key. We'll explore how to conduct hypothesis tests for proportions, including steps for calculating test statistics and making decisions based on p-values. This knowledge forms the foundation for statistical inference.

Hypothesis Testing

Significance and p-value interpretation

Top images from around the web for Significance and p-value interpretation
Top images from around the web for Significance and p-value interpretation
  • (α\alpha) represents the probability of rejecting the null hypothesis when it is actually true ()
    • Common α\alpha values: 0.01, 0.05, 0.10
    • Smaller α\alpha indicates a more stringent test and higher threshold for rejecting the null hypothesis
  • represents the probability of obtaining a sample statistic as extreme as or more extreme than the observed statistic, assuming the null hypothesis is true
    • Reject the null hypothesis if p-value < α\alpha
    • Fail to reject the null hypothesis if p-value ≥ α\alpha
  • is determined before conducting the test, while p-value is calculated based on the sample data
    • Example: If α=0.05\alpha = 0.05 and p-value = 0.02, reject the null hypothesis since 0.02 < 0.05
  • The is the set of values for the that leads to rejecting the null hypothesis

Types of hypothesis tests

  • Type of determined by the (HaH_a or H1H_1)
  • : HaH_a states the population parameter is less than a specific value
    • located in the left tail of the distribution
    • Example: Ha:μ<100H_a: \mu < 100 (population mean less than 100)
  • : HaH_a states the population parameter is greater than a specific value
    • Critical region located in the right tail of the distribution
    • Example: Ha:p>0.5H_a: p > 0.5 ( greater than 0.5)
  • : HaH_a states the population parameter is not equal to a specific value
    • Critical region split equally between the left and right tails of the distribution
    • Example: Ha:μ75H_a: \mu \neq 75 (population mean not equal to 75)

Hypothesis testing for proportions

  • Steps for hypothesis testing with population proportions:
  1. State the null and alternative hypotheses
    • Null hypothesis (H0H_0): population proportion (pp) equals a specific value (p0p_0)
    • Alternative hypothesis (HaH_a): pp is less than, greater than, or not equal to p0p_0, depending on test type
  2. Determine the level of significance (α\alpha) and test type (left-tailed, right-tailed, or two-tailed)
  3. Calculate the (zz) using the (p^\hat{p}), null proportion (p0p_0), and (p0(1p0)n\sqrt{\frac{p_0(1-p_0)}{n}})
    • z=p^p0p0(1p0)nz = \frac{\hat{p} - p_0}{\sqrt{\frac{p_0(1-p_0)}{n}}}
  4. Find the p-value using the and
    • Left-tailed test: p-value = P(Z<z)P(Z < z)
    • Right-tailed test: p-value = P(Z>z)P(Z > z)
    • Two-tailed test: p-value = 2min(P(Z<z),P(Z>z))2 \cdot \min(P(Z < z), P(Z > z))
  5. Make a decision by comparing the p-value to the level of significance
    • Reject H0H_0 if p-value < α\alpha
    • Fail to reject H0H_0 if p-value ≥ α\alpha
  6. Interpret the results in the context of the problem
    • Example: If testing whether a coin is fair (H0:p=0.5H_0: p = 0.5) and p-value = 0.03 with α=0.05\alpha = 0.05, reject H0H_0 and conclude the coin is not fair

Additional Considerations in Hypothesis Testing

  • : A range of values that likely contains the true population parameter, providing a measure of uncertainty
  • : The probability of correctly rejecting a false null hypothesis, which increases with larger sample sizes and effect sizes
  • : A measure of the magnitude of the difference between groups or the strength of a relationship
  • : The number of independent observations in a dataset that are free to vary, affecting the shape of the sampling distribution

Key Terms to Review (30)

Alternative Hypothesis: The alternative hypothesis is a statement that suggests a potential outcome or relationship exists in a statistical test, opposing the null hypothesis. It indicates that there is a significant effect or difference that can be detected in the data, which researchers aim to support through evidence gathered during hypothesis testing.
Confidence Interval: A confidence interval is a range of values used to estimate the true value of a population parameter, such as a mean or proportion, based on sample data. It provides a measure of uncertainty around the sample estimate, indicating how much confidence we can have that the interval contains the true parameter value.
Critical Region: The critical region, also known as the rejection region, is a specific range of values for a test statistic that leads to the rejection of the null hypothesis in a statistical hypothesis test. It represents the area of the sampling distribution where the test statistic is considered extreme enough to provide evidence against the null hypothesis.
Degrees of Freedom: Degrees of freedom refer to the number of independent values or quantities that can vary in a statistical calculation without breaking any constraints. It plays a crucial role in determining the appropriate statistical tests and distributions used for hypothesis testing, estimation, and data analysis across various contexts.
Effect Size: Effect size is a quantitative measure that indicates the magnitude or strength of the relationship between two variables or the difference between two groups. It provides information about the practical significance of a statistical finding, beyond just the statistical significance.
Hypothesis test: A hypothesis test is a statistical method used to make inferences or draw conclusions about a population based on sample data. It involves comparing observed data with what we expect under the null hypothesis.
Hypothesis Testing: Hypothesis testing is a statistical method used to determine whether a claim or hypothesis about a population parameter is likely to be true or false based on sample data. It involves formulating a null hypothesis and an alternative hypothesis, collecting and analyzing sample data, and making a decision to either reject or fail to reject the null hypothesis.
Left-Tailed Test: A left-tailed test is a statistical hypothesis test where the alternative hypothesis specifies that the parameter of interest is less than a certain value. This type of test is used when the researcher is interested in determining if a population parameter, such as a mean or proportion, is significantly lower than a given target value.
Level of significance: Level of significance, denoted as $\alpha$, is the threshold for determining whether a null hypothesis should be rejected. It represents the probability of making a Type I error, which is rejecting a true null hypothesis.
Level of Significance: The level of significance, also known as the alpha level (α), is the probability threshold used to determine whether the null hypothesis in a statistical test should be rejected or not. It represents the maximum acceptable probability of making a Type I error, which is the error of rejecting the null hypothesis when it is actually true.
P-value: The p-value is the probability of obtaining a test statistic at least as extreme as the one actually observed, assuming the null hypothesis is true. It is a crucial concept in hypothesis testing that helps determine the statistical significance of a result.
Population Proportion: The population proportion is the percentage or fraction of a population that possesses a certain characteristic or attribute. It is a fundamental concept in statistics that is used to make inferences about the larger population based on a sample drawn from that population.
Rejection Region: The rejection region, also known as the critical region, is a specific range of values for a test statistic that leads to the rejection of the null hypothesis in a hypothesis test. It represents the set of outcomes that are considered statistically significant and unlikely to have occurred by chance if the null hypothesis is true.
Right-Tailed Test: A right-tailed test is a statistical hypothesis test where the alternative hypothesis specifies that the parameter of interest is greater than the value stated in the null hypothesis. This type of test is used when the researcher is interested in determining if a particular characteristic or outcome exceeds a certain threshold or standard.
Sample Proportion: The sample proportion is a statistic that represents the proportion or percentage of a sample that exhibits a certain characteristic. It is a crucial concept in statistics, as it allows researchers to make inferences about the characteristics of a larger population based on a smaller, representative sample.
Significance: Significance, in the context of statistical hypothesis testing, refers to the level of evidence required to reject the null hypothesis and conclude that the observed results are unlikely to have occurred by chance alone. It is a measure of the strength of the statistical evidence against the null hypothesis.
Standard deviation: Standard deviation is a measure of the dispersion or spread of a set of data points around its mean. It quantifies how much the individual data points deviate from the mean value.
Standard Error: Standard error is a statistical term that measures the accuracy with which a sample represents a population. It quantifies the variability of sample means from the true population mean, helping to determine how much sampling error exists when making inferences about the population.
Standard normal distribution: The standard normal distribution is a normal distribution with a mean of 0 and a standard deviation of 1. It is used to standardize scores from different normal distributions for comparison.
Standard Normal Distribution: The standard normal distribution, also known as the Z-distribution, is a special case of the normal distribution where the mean is 0 and the standard deviation is 1. It is a fundamental concept in statistics that is used to model and analyze data that follows a normal distribution.
Statistical Power: Statistical power refers to the likelihood that a hypothesis test will detect an effect or difference if it truly exists in the population. It is a crucial concept in hypothesis testing that determines the ability of a statistical test to identify meaningful differences or relationships.
Test statistic: A test statistic is a standardized value that is calculated from sample data during a hypothesis test. It is used to determine whether to reject the null hypothesis.
Test Statistic: A test statistic is a numerical value calculated from sample data that is used to determine whether to reject or fail to reject a null hypothesis in a hypothesis test. It serves as the basis for decision-making in statistical inference, providing a quantitative measure to evaluate the strength of evidence against the null hypothesis.
Two-Tailed Test: A two-tailed test is a statistical hypothesis test in which the critical region is two-sided, meaning that the test statistic can fall in either the upper or lower tail of the distribution. This type of test is used to determine if a parameter is different from a specified value, without specifying the direction of the difference.
Type I error: A Type I error occurs when a true null hypothesis is incorrectly rejected. It is also known as a false positive.
Type I Error: A Type I error, also known as a false positive, occurs when the null hypothesis is true, but it is incorrectly rejected. In other words, it is the error of concluding that a difference exists when, in reality, there is no actual difference.
Type II error: A Type II error occurs when the null hypothesis is not rejected even though it is false. This results in a failure to detect an effect that is actually present.
Type II Error: A type II error, also known as a false negative, occurs when the null hypothesis is true, but it is incorrectly rejected. In other words, the test fails to detect an effect or difference that is actually present in the population. This type of error has important implications in various statistical analyses and hypothesis testing contexts.
Z-score: A z-score represents the number of standard deviations a data point is from the mean. It is used to determine how unusual a particular observation is within a normal distribution.
Z-Score: A z-score is a standardized measure that expresses how many standard deviations a data point is from the mean of a distribution. It allows for the comparison of data points across different distributions by converting them to a common scale.
© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.