Browse By Unit
4 min read•june 18, 2024
Avanish Gupta
Josh Argo
Jed Quiaoit
Avanish Gupta
Josh Argo
Jed Quiaoit
Here, you'll learn:
After covering single-variable statistics, it’s time to increase the complexity a little bit with two-variable statistics! Just like the differences on univariate data, we also have two different types of bivariate data that we may encounter: categorical and quantitative.
With categorical variables, we can use two way tables to represent the relationship between two different categories of categorical variables. A common example for bivariate categorical data may be something like measuring a student's class level (freshman, sophomore, junior, and senior) along with their learning style preference for 2020-2021 (virtual or traditional). A statistician could take these numbers and see if there was a correlation between class level and learning choice.
Since every individual has two quantities assigned to them, one of these quantities will be plotted on the x-axis as the independent variable, while the other variable is plotted on the y-axis as the dependent variable. After creating that scatterplot, we can form a trend of our data points using various models, primarily linear regression models in AP Statistics. This means that we will fit a line to our points on the scatterplot so that we can make predictions about other x-values within the range of our model.
For example, we may look at someone's height in inches along the x-axis and their shoe size along the y-axis. In this particular example, one would expect to see a positive correlation, because as height increases, so would shoe size.
On the AP exam, it is highly unlikely that you will be asked to create a scatterplot, a two way table, or a linear regression model. Instead, the question will generally provide our models via computer outputs or printouts that students are required to be able to interpret. The most important part of this unit is being able to identify the important aspects of a model and interpret what they mean.
Just like in unit 1, being able to interpret your data in context of the problem is the biggest skill you will be tested on. This includes the aspects of categorical models (like two way tables) along with the different aspects of a linear regression model like slope, y-intercept, correlation coefficient, and correlation of determination.
1. Selecting Statistical Methods
This is useful when we decide whether we want to use two-variable statistics methods and the type to use, or to use inference techniques learned later on. As with the rest of AP Statistics, it is vital that students know whether a problem is employing quantitive data methods or categorical data methods prior to proceeding with any statistical methods.
2. Data Analysis
Using data analysis, we’ll figure out how to figure out different statistics from two-variable data sets and also find ways to model with them and draw conclusions.
3. Statistical Argumentation
In this unit, we will learn to argue about the strength of how much variables are related to each other, and also the most important sentence of this unit: correlation does not imply causation! For instance, if I gather the amount of rain everyday of the week for a year and find that the rain total on Tuesdays is quite a bit higher than Mondays, does this mean that the day of the week causes it to rain more? Obviously not! In this instance, the two variables (day of the week and rain totals) are correlated, but are not causing one another.
<< Hide Menu
4 min read•june 18, 2024
Avanish Gupta
Josh Argo
Jed Quiaoit
Avanish Gupta
Josh Argo
Jed Quiaoit
Here, you'll learn:
After covering single-variable statistics, it’s time to increase the complexity a little bit with two-variable statistics! Just like the differences on univariate data, we also have two different types of bivariate data that we may encounter: categorical and quantitative.
With categorical variables, we can use two way tables to represent the relationship between two different categories of categorical variables. A common example for bivariate categorical data may be something like measuring a student's class level (freshman, sophomore, junior, and senior) along with their learning style preference for 2020-2021 (virtual or traditional). A statistician could take these numbers and see if there was a correlation between class level and learning choice.
Since every individual has two quantities assigned to them, one of these quantities will be plotted on the x-axis as the independent variable, while the other variable is plotted on the y-axis as the dependent variable. After creating that scatterplot, we can form a trend of our data points using various models, primarily linear regression models in AP Statistics. This means that we will fit a line to our points on the scatterplot so that we can make predictions about other x-values within the range of our model.
For example, we may look at someone's height in inches along the x-axis and their shoe size along the y-axis. In this particular example, one would expect to see a positive correlation, because as height increases, so would shoe size.
On the AP exam, it is highly unlikely that you will be asked to create a scatterplot, a two way table, or a linear regression model. Instead, the question will generally provide our models via computer outputs or printouts that students are required to be able to interpret. The most important part of this unit is being able to identify the important aspects of a model and interpret what they mean.
Just like in unit 1, being able to interpret your data in context of the problem is the biggest skill you will be tested on. This includes the aspects of categorical models (like two way tables) along with the different aspects of a linear regression model like slope, y-intercept, correlation coefficient, and correlation of determination.
1. Selecting Statistical Methods
This is useful when we decide whether we want to use two-variable statistics methods and the type to use, or to use inference techniques learned later on. As with the rest of AP Statistics, it is vital that students know whether a problem is employing quantitive data methods or categorical data methods prior to proceeding with any statistical methods.
2. Data Analysis
Using data analysis, we’ll figure out how to figure out different statistics from two-variable data sets and also find ways to model with them and draw conclusions.
3. Statistical Argumentation
In this unit, we will learn to argue about the strength of how much variables are related to each other, and also the most important sentence of this unit: correlation does not imply causation! For instance, if I gather the amount of rain everyday of the week for a year and find that the rain total on Tuesdays is quite a bit higher than Mondays, does this mean that the day of the week causes it to rain more? Obviously not! In this instance, the two variables (day of the week and rain totals) are correlated, but are not causing one another.
© 2024 Fiveable Inc. All rights reserved.