Now, we can start talking about statistics. In the big picture, statistics is a rather broad subject, but we only have to know a few things for the test. For the purposes of the test, statistics consists of tools for making sense of data. So we'll get some data and we'll just be asked about this data. Read full transcript
So, one of the most fundamental questions we can ask about a data set is, what single number is most representative of the set as a whole? And this turns out to be very important in real world statistics. For example, if we had a list of the household income of everyone in the United States, we'd wanna know what's one single number that would be most representative of that entire list.
Well, such numbers are called, measures of center. And that's what we're we gonna start talking about in this video. The two most important measures of center are mean and the medium. A mean is simply an ordinary average. To find the mean, say of, seven numbers on a list, you would add up the seven numbers, and then divide this sum by seven.
In general, on a list with N entries, we add up all the entries, and then divide by N. That's the mean. Just an ordinary average. We can write the formula as mean equals the sum of N entries divided by N. That's the formula for the mean.
Notice that this formula is also quite useful in the following form. If we just multiply both sides by N, we get N times the mean. In other words, the number of people on the list times the mean has to equal the sum of the entries. Thinking about sums is often the key to many questions about average or mean. So this second form, it's really underappreciated how powerful this form is.
For example, here's a practice question. Pause the video and then we'll talk about this. Okay, in a class, 18 students took a test and had an average of 70. Alicia and Burt then took the test, and the average of all 20 students was 71. If Alicia got a 77, then what was Burt's grade? All right, so, a lot of people would find this a very hard question.
The key to this question is simply thinking about the sums. So, first of all, the old sum, those 18 people. I'm just gonna take 18 times the mean of 70. Multiply that out, and that's 1260. Well, now the new sum of all 20 students, that's going to be 20 times 71. Multiply that out, that's 1420.
Well, think about that. The old sum and the new sum, what's the difference between them? The only difference is, we added Alicia's score and Burt's score to the other 18 scores. So the difference between them should be the sum of the score of Alicia and the score of Burt.
So, we subtract them. That sum. So Alicia's score and Burt's score must add up to 160. Well, if we subtract Alicia's score of 77, then we get 83, and of course, that has to be Burt's grade. So that's how we answer that question, purely thinking about sums.
Now the median. The median is the middle number on a list. We have to be a little careful here. We have to put the list in ascending order first, that is from smallest to biggest. Technically, the median is the middle number on an ordered list. So, we can't just write them in any jumbled order and then say, okay, the median is the one that happened to be sitting in the middle of that jumble.
No, we have to put the list in order, from smallest to biggest. And often, incidentally, the test will not do that. They'll give you the numbers in a jumbled order, and then you, yourself, have to put them in order and then find the median. So, you have to be careful with that. So if we have this list for example, the number right smack dab in the middle is 4.
So, that's the median. There are three numbers below it, three numbers above it, clearly in the middle. Now this list is a little bit interesting, because there's an even number of terms on the list not an odd number, so there's no one number at the middle. At the middle, we have these two numbers, 4 and 5 and they can't both be the median. So what we do if there's an even number on a list, we average the two middle numbers.
And so the median is between 4 and 5. We take the average of four and five, which is 4.5. That is the median. Those were given this list, well the very first thing we have to do is put that list in order. So now it's in order.
So now the median, there are one two three four five six seven eight numbers on this list. And so the median has to be between 8 and 13. So we're gonna average 8 and 13. 8 + 13 divided by 2, 21 divided by 2. 10.5, 10.5 is the median of list K.
Notice that the median only takes into account the number or numbers at the very center. We could change the numbers at either end of the list, and this change wouldn't affect the median at all. So for example, in that particular list, suppose we changed the 55 to 55,000, the median wouldn't change at all, the mean would change, but not the median.
We'll talk about that in the next video. Here's a practice problem. Pause the video and then we'll talk about this. Okay, the median of List B is exactly 4 higher than the median of List A. All right, so the very first thing we need to do is find the median of List A. We put the terms in order.
The median is the average of 7 and 10, which is 8.5. That's the median of List A. The median of List B has to be 4 higher than this, so it has to be 12.5, that's 8.5 plus 4. All right, so now let's think about this. There are three numbers that are less than that median, 4, 7 and 10 are all less than the median.
18, 25 and x must be greater than the median and in fact, that 12.5 must be the average of 10 and x. So 12.5 is the average of 10 and x. We'll multiply by 2. What we get is 12.5 times 2 is 25 subtract 10, we get x = 15. And so that's the value of x.
If x has a value of 15, then set B would have a median of 12.5, which is exactly 4 higher than 8.5. One final measure of center is the mode. You often hear these three said together, mean, median and mode. What's the mode? The mode is the most frequently appearing number on the list.
The number that makes the greatest number of appearances. This is far less important than either the mean or the median for a variety of reasons. Some lists have a single mode, so for example there are a lot of threes on this list there are three threes and every other number only appears once. So the mode is clearly 3.
Some lists have two modes, for example in this list we have a pair of twos and also a pair of fives, so the modes are 2 and 5. If all the numbers on the list are different from one another, as is usually the case ,then there simply is no mode. So sometimes there is a mode, sometimes there's more than 1 and often there's just no mode at all.
So think about this, every single list on the planet has a mean, every single list on the planet has a median. But, only some lists have modes. Some may have more than one mode and many have no mode at all and so this is one of the big reasons why the mode is not nearly as important as the mean or the median. In summary, the mean is the simple average, often in questions about mean, it's helpful to think in terms of the sums.
So the sum of the entry equals the number of entries times the mean. We just rewrite that formula on the top to get this. The median is the middle number of an ordered list. If two numbers are in the middle, then we simply average those numbers. And the mode, the most frequently appearing number is just less important. Some lists have a mode, some have more than one, and many have no mode at all.
Theoretically the test could ask you about a mode, but it's much, much less common than questions about mean or median.