Skip to main content

My Courses

Chemistry

Biology

Math

Physics

Business

Social Sciences

Programming

Product & Marketing

AI Tools Channels Home

My Course
Learn
Bookmarks

Table of contents

Skip to main content

1. Intro to Stats and Collecting Data22m
- Intro to Stats
  22m
2. Describing Data with Tables and Graphs55m
3. Describing Data Numerically53m
4. Probability1h 29m
5. Binomial Distribution & Discrete Random Variables1h 16m
- Intro to Discrete Random Variables
  31m
- Binomial Distribution
  45m
6. Normal Distribution and Continuous Random Variables Coming soon
- Finding Probabilities and Z - Scores using a Calculator
  0m
7. Sampling Distributions & Confidence Intervals: Mean1h 3m
- Introduction to Confidence Intervals
  15m
- Confidence Intervals for Population Mean
  48m

2. Describing Data with Tables and Graphs

Frequency Distributions

2. Describing Data with Tables and Graphs

Frequency Distributions - Online Tutor, Practice Problems & Exam Prep

1

concept

Intro to Frequency Distributions

Video duration:

6m

Play a video:

Was this helpful?

Video transcript

So we talked a lot about how we can use different types of charts and graphs with their own strengths and weaknesses to visualize different types of data, whether it's qualitative or quantitative. But one of the very first things you'll have to do before you do any of that is organize your information and data into what's called a frequency distribution. I know that sounds like a scary word, but don't worry because what I'm going to show you in this video is that all this is is it's just a table. It's a table that helps you organize the frequency, the number of measurements that you have, versus different groups of numbers or labels. So let's go ahead and dive right in.

We're going to do an example together. I'll show you some definitions that you need to know, and we'll do some examples. Let's get started. Frequency distribution is just a table. It's a table of values, and it shows the frequency, remember the number of measurements over here in this column, versus chosen groups of numbers or labels.

These are things that are called classes. So, for example, in this example problem, we have this dataset which lists the amount of time in minutes that students spend studying for their exam, and it's listed over here. We're going to construct a frequency distribution using 6 evenly spaced classes. So, what does that mean? Well, basically, what happens is we have this table of values over here, and these 6 evenly spaced classes are where we see values from 20 to 29, 30 to 39, and so on and so forth.

These are all evenly spaced, and there are 6 of them if you scroll down. So, basically, these things over here are called your classes. In some cases, like this example, the groups are chosen for you. And in other cases, you'll have to come up with them yourself, and we'll see how to do that later on. But all that I'm going to do here is we're going to look through the data values.

And in order to figure out the frequency, the number of measurements, I just have to go figure out how many measurements belong in each one of these classes. So let's go ahead and do that. I'm going to look through my dataset over here. I see numbers that go from 20 all the way up to 75. What I'd like to do is just look through each one of them over here.

I've got my classes over here. And for the 20 to 29 range, all I have to do is just figure out how many of these things belong in that category. If I look through the number I see first is 20, so one of the things I like to do is either tally them or, or cross them out just so I don't double count them later on. It's going to be really helpful when you get, you know, bigger datasets of 20, let's say, numbers. So 20, I've counted that once.

I'm going to put a tally there. Then I count my thirties, 30 to 39. I've got 2 over here. That's 1, 2. I got my forties. That's 1, 2, and 3. That's 3. And I got my fifties. That's 2. And I got my sixties. That's 1. And then my seventies. That's 1. So now I'm done with all of these things, and I'm just going to replace these tally marks with numbers. So this is a 1 over here. This is a 2. This is a 3, 2. And then we got 1 and 1. Alright? So that is a frequency distribution.

We've got my classes over here, and then you've got my frequency over here. That's really all there is to it. Okay? So now a couple of definitions here because you'll notice that each one of these classes has an interval of numbers, 20 to 29. The lower class limit, which you'll need to know, is basically just the lowest of each one of those numbers, of each one of those classes.

So in this case, the lowest numbers are 20, 30, 40. Basically, it's just all of the left numbers in this column. So the lower class limit is 20, 30, 40, so on and so forth. Alright? Now if that's the lower class limit, what do you think the upper class limit is?

Well, hopefully, you realize that those are going to be the highest numbers. So in other words, those are just going to be the 29, 39, 49, so on and so forth. So these are going to be your upper class limits. Alright? So we've got 29, 39, and then so on and so forth.

Alright? So now something that you may have to know in these problems is not the lowest and highest numbers of each class, but what's the midpoint. And, essentially, that is just the middle number in each class. Now, most of the time, you'll have to calculate these, but it's actually pretty simple to calculate. All you just do is you take the average of the two numbers. In other words, you just take the lower of each class and the upper and then just divide by 2. An example of this would be for the first one to calculate the class midpoint, it would just be 20 +29 / 2, and that would give you 24.5. And then so on and so forth. You could calculate them for the rest of them. Alright.

So that's the class midpoint. And the very last thing, the very last definition you'll need to know is something called the class width. So the class width is essentially the difference between 2 consecutive, and that's the most important word there, consecutive lower or upper class limits. So be really careful here. Right?

The interval of this one class goes from 20 to 29, but the difference between a consecutive lower class is from 20 to 30. So in other words, it's basically the class width is 30 minus 20, which is 10. Or another way you could calculate this is just doing the upper class limits, 39 minus 29, in which you'll just get the same exact number. It's 10. So the class width for this dataset over here is 10, not 9.

So what you don't want to do is you don't want to do the upper minus the lower because that's going to give you 9. That's not what the class width is. If you do that, you're going to get the wrong answer. So just be very, very careful with this. The reason for this, by the way, is that if this was actually 30, then if you were to get a number of 30, you'd have to put it into both of these classes over here, so you'd have to double count them.

And that's why these things are always basically just, separated by at least one number here. So 20 to 29, and then the next one starts at 30, not 29. Okay? Alright. So that's it for the definitions over here.

The class width is 10. So now what I want to do is just talk about the relative frequency distribution because that was actually the other part of the problem here. So we figured out the frequency distribution. One of the things you may also have to do is calculate something called the relative frequencies of each. And, basically, the relative frequency is essentially just you're just going to show those frequencies as percents of a total number.

That total number of measurements that you have in your dataset is a variable. It's called n. So if you look through this dataset over here, what we have is that n is equal to 10. That's going to be a really important variable that we'll talk about later on. So essentially, all you have to do to calculate the relative frequencies is just take f divided by n.

That's going to turn into a decimal, then you multiply by 100 to turn into a percent. So in other words, this is going to be 1 divided by 10, which is going to be 0.1. And then if you turn it if you multiply it by 100, what you're going to get is 10 percent. And if you do the same exact thing over here, you're going to get 2 over 10, which ends up getting you 20%. And then you'll do the same thing, 3 over 10, that'll give you 30%, so on and so forth.

You just do the same thing throughout all. You're going to get, either you can express these as decimals or percentages. Either way, usually, they're decimals. This is going to be 10% and then 10%. Alright.

That just gives you sort of a relative percentage of the total number of measurements there. So that's it for a frequency distribution. Hopefully, that made sense. Let's go ahead and take a look at some practice.

2

Problem

Problem

Use the frequency distribution below to find the class width and class midpoints.

A

Class width = 10, class midpoints = 10, 20, 30, 40, 50, 60, 70

B

Class width = 10, class midpoints = 10, 21, 32, 43, 54, 65, 76

C

Class width = 11, class midpoints = 10, 21, 32, 43, 54, 65, 76

D

Class width = 11, class midpoints = 10,20, 30, 40, 50, 60, 70

3

Problem

Problem

The following data set shows the number of overtime hours that 12 employees worked in a month. Construct a frequency distribution, suing a lower class limit of 3 and a class width of 4.

A

option a

B

option b

C

option c

D

option d

4

example

Intro to Frequency Distributions Example 1

Video duration:

6m

Play a video:

Was this helpful?

Video transcript

Everyone. So let's go ahead and work out this example together. We're sitting in a cafe one day, and you're counting how many customers are served each hour over a 14-hour period. And we're going to construct a frequency distribution, and we're actually told what the lower class limit and class widths are. The lower class limit is 15, and the class width is 5.

So, essentially, we can figure out what our classes are going to be. We're going to get to the second part in just a second with the relative frequencies. Let's just start off with finding the frequency distribution. So remember, I don't like to draw the box. The first thing you should do is just draw the little T-chart here because you'll fill in the box later.

This is going to be customers on the left column, so customers per hour. We're going to come up with the classes for these, and this is going to be frequency. Alright? So, customers per hour. We're going to use a lower class limit of 15.

Right? Those are going to be all the numbers that go in the left column here. And then your class widths, the numbers that go, that sort of separate each consecutive lower class limits are going to be 5. So in other words, the next one is going to be 20. Next one is going to be 25, and 30, 35, and 40.

If we go one more, well, if you look at your dataset here, what we see is the highest number is 41. So if you go another class and get to 45, there's going to be no data values in this class. You basically just can stop it right there at 40. Alright? So we have a lower class limit of 40, and now we can figure out what the upper class limits are.

Remember, you just take this number and subtract 1. So this is going to be 19. This is going to be 24, and then so on and so forth. This is going to be 29. Then this is going to be 34, 39, and then 44.

Alright. So now that we have all of our classes and we have the upper and lower class limits, now we could just fill in the box because now we know exactly how many columns we need or how many rows we need. Sorry. So this is going to be these classes over here. And now we can just go ahead and stick to our normal method of just counting a bunch of data values.

Alright? Remember, so just mark off each one as you're counting it and just go 1 by 1 across the datasets instead of 1 by 1 trying to figure out what each of the classes are. It's just more efficient this way. So 15 goes in this category. 24 goes here.

30 goes here. 21 goes here. 27 goes here. 35 goes here. 32 goes here.

31 also goes there. 38 goes here. 41 goes here. 26 goes here. 33 goes this column.

36 goes this class, and then 40, finally goes in this class. Alright. So now we can just go ahead and add up all the tallies, and this is going to be 1, 2, 2, 4, 3, 2. Alright. If you add everything up, you should get 14.

So this is 1 plus 2 is 3, 5, 9, 12, and 14. Alright. So the total number here is 14 if you add up everything. Just a sanity check that we didn't miss something. Alright?

So this is basically our frequency distribution. We have the frequency versus the classes that we just made. Alright? Are we done? Well, not quite here because, remember, the second thing that this problem asks us to do is now calculate the relative frequencies of each.

Instead of just these numbers, 12432, we're actually going to calculate the percentage of each class relative to the whole dataset. So I'm going to add another column over here and basically just extend the rows because now we're going to calculate a bunch of percentages. Alright? So remember, this is going to be relative frequencies. Relative frequencies is just you take \( f \) divided by \( n \) and then multiply it by 100%.

Okay. So in other words, I'm going to take 1 and divide it by n, which is the total number of measurements. If you look through this dataset or you just look through the problem, there were 14 numbers that we got. So \( n = 14 \). Alright?

So in other words, 1 divided by 14 is going to give you 0.071. Now sometimes these things get represented as percentages, but it's actually perfectly fine to leave them as decimals. In fact, that's most commonly what you're going to see. So instead of, so what I'm going to do here is 2 over 14, and if you keep on doing this, you're going to get 0.14. Now I'm not going to write it out for all of them, but you would basically just do the exact same thing.

In fact, what it's going to happen is most of these things end up being 2 or 3, so you can kind of just figure it out. You don't have to recalculate. So this is going to be 0.14. This is also going to be this one also here is going to be 0.14. All right.

If you do this, which is 3, which is 4 over 14, this is going to end up being 0.28. And then this 3 over 14 is going to be 0.21. Alright. So these are the relative frequencies of each expressed as decimals, and so you can convert them to percentages if you want.

But the question actually asks, what percentage of the day is the cafe serving 30 or more customers per hour? So you have to really read into this question. And in fact, that's a lot of what this course is going to be, sort of reading into what these questions are asking you. It's what percentage is that they're serving 30 or more customers per hour. According to our classes, we actually have 3 classes in which we have frequencies that are 30 customers or above.

So the whole idea here is to figure out this question, the percentage, we're basically going to so 30 or more 30 or more means that we're actually going to be looking at all of this data. It's basically a minimum of 30 and then all the way up to your final class, which is from 40 to 42. Okay. And what we can see here is that you're basically going to take each one of these numbers that's percentages or decimals, and you're just going to add them all. So in other words, 0.28 plus 0.21 plus 0.14.

If you add this up, you're going to get 0.63, which if you convert now this to a percentage, this is going to be 63%. So in other words, 63% of the time, the cafe is serving more than 30 or more customers per hour. That's the percentage relative to the whole day. Alright? So that is how to do this problem.

Thanks for watching. Let me know if you have any questions, and let's move on.

5

concept

How to Create Frequency Distributions

Video duration:

6m

Play a video:

Was this helpful?

Video transcript

In the last couple of videos, we were introduced to frequency distributions. Remember, these are just tables that organize the frequency versus different classes of numbers or labels. And in those types of problems up until now, we were always given exactly what those limits are and how many classes to use, like 6 or 8 or something like that. But in some problems, like the one we're going to work out down here, you're only just going to be given the number of classes, like eights or something like that, but you're not actually going to know what those class limits are, and you're going to have to go find them yourself. If that sounds kind of scary, don't worry about it because I'm going to show you a step-by-step process for how to get what those numbers are, and then you can go right back to just counting up the frequency for those classes.

Alright? I just want to warn you, there are slightly different ways of doing this. If there's a preferred method in your class, you should stick to that. But if not, these steps are going to be really helpful. So let's go ahead and take a look.

We're just going to jump right into this problem here. So again, we're working with the amount of time, in minutes, that students spend studying for their exam each week. And we're going to construct the frequency distribution, and we're going to use 8 classes. So let's go ahead and get started here. So I know I'm going to have to count up frequencies versus the classes.

How do I actually get those? Well, let's just go ahead and look at the first step here. The first step is you're going to have to calculate the class width. Remember, that's basically just the number where if you have two lower class limits, the class width is the number that separates the two. It's the number between two consecutive upper or lower class limits or something like that.

Now and, again, in most problems, you could actually just figure that out. But in this one, we're going to have to actually go calculate it. And here's a way we could do this. You're going to take the maximum minus the minimum number of the dataset and divide it by the number of classes you're supposed to use. So let's go ahead and get started here.

I'm going to use this formula to calculate this. This is going to be max minus min. So in other words, the maximum value of my dataset is 115 and the minimum is 5. So 115 minus 5. Let me rephrase that.

And then divided by the number of classes, which is 8. When you plug this into your calculator, you should get 13.75. Alright? Now is this the right answer? Is this the class width?

The problem with this number here is that, eventually, if I just start, you know, adding 13.75, there's going to be a bunch of weird decimals that pop up in these lower class limits. So this is where you have a choice. So a lot of times what's going to happen is you're going to round up to either the nearest whole number, so we can round this up either to 14, or you can sometimes round this up to the nearest convenient number. A lot of times these will be, like, the nearest 5 or the nearest 10 or something that kinda makes sense given the context of the problem that you're working with. In this case, because we're talking about minutes, another convenient number that we could have used is 15.

Right? So, basically, an hour is cut up into four 15-minute chunks. That's a pretty convenient number to use. Here's the thing. You actually could use either one of these.

It's perfectly fine. So there's really no one right way to do this. It's just that I'm going to use 15 in this example because it makes the numbers a little bit easier. So this is 15. That's going to be my class width.

So whatever these two numbers are, they're going to be separated by 15. So that's the first step. We're done with the class width. So now let's move on to the second one. We're going to actually figure out what those lower class limits are.

Remember, the lower class limits are basically just all the numbers that go on the left side of those intervals, and I'm going to have to go find them. So here's what you can do here. The first lower is the most important one. And, basically, what you're going to do is pick a number that is either less than or equal to whatever the data minimum is. So in other words, it's going to be a number that's less than or equal to this 5.

A lot of times, you're just going to see that data minimum just gets used as the lower class limit, and that's exactly what we're going to do here. It's perfectly fine to use 5. You may see something like 0 sometimes, but 5 is perfectly fine. So let's go ahead and do, and that's 5. That's the first lower class limit.

How do I find the next ones? Well, to find the next lowers, what you're going to do is take the previous lower and then just add the class width. Remember, this 15 over here is going to be the difference between each consecutive lower class limit. So in other words, this is 5, and the next one's going to be 20, and the next one's going to be 35, then 50, then 65, then 80, then 95, and then finally, 110. Alright?

So those are all your lower class limits. Cool. So now that we're done with the lower class limits, we're going to have to find the upper class limits, which are basically just the numbers that go on the right side of those intervals. How do we find those? Well, basically, what we're going to do here is to find the first upper.

You're just going to take the second lower and then subtract 1 from it. Here's why. So if I used 20, then if, again, if I get a data value that's 20, it's going to go into two classes, and I'm going to double count something. What I have to use is take the second lower and subtract 1. So this is going to be 19.

Alright? And then what happens is I could just do the same exact thing for this one. This is going to be 34. It's always going to be going to be the next lower minus 1. This is going to be 49 and then 64 and then 79 and then so on and so forth.

You can kinda see the pattern that's going on here. It's going to be 109. And then finally, this is going to be 124, again, because everything should be separated by 15. Alright? So these are going to be your upper class limits.

Alright? Again, really important here, your class widths are 15. The range between each one of these things is 14. The class widths are 15. Alright?

Now we're actually done. We have what all of our classes are for this frequency distribution. And the last step after you're done is now you can just go ahead and find the frequency for each and then tally everything up in its appropriate class. This is exactly what we were just doing before. I'm going to fly through this.

We've already done this before. I'm going to include all the data values that are between 5 to 19. And that's going to be 1, 2, so that's going to be over here. Then I've got, I've got 20 through 24. This is going to be 1, 2,

6

example

How to Create Frequency Distributions Example 2

Video duration:

5m

Play a video:

Was this helpful?

Video transcript

Welcome back, everyone. So we're going to work out this problem together here. This problem shows us the datasets that show the sales in dollars of 15 sales representatives at some company. And what we're going to do in this problem is construct a frequency distribution. So what they're not telling in this problem is what the lower class limits are and the class widths.

They're only just telling us the number of classes to use, which is 5. And so in the absence of all that information, we're going to have to use these steps to figure out what those class limits are and the class widths. Okay? So we're going to stick to the steps here. Remember that, eventually, I'm just going to end up with a frequency distribution.

I'm going to just go ahead and just draw a little table like this. I've got sales over here versus frequency. The first step, remember, is that you have to calculate the class width. I'm going to do that over here. So this is going to be my class width.

I'm going to call this c_w just to, you know, abbreviate. But, essentially, what we're going to do is you're going to take the maximum of the data values minus the minimum and divide it by the number of classes. We have all those pieces of information. Right? We just have to look at the dataset and figure out what's the max and the min.

Alright. So if you look at the numbers over here, I see some elevens, some tens, and then I only see one number that has a 12 in it. So it's 1223. Alright. So c_w, the class width is going to be 1223 minus the minimum number.

And if you look again through this dataset over here, you see a couple of nines, a couple of tens. I see some eights, 843, but the numbers the number that's the lowest is 819. There's nothing lower than that. So, 1223-8195 which, again, we're using 5 classes over here. Right?

So this 5 is directly, why you use 5 there. Okay? When you work this out, what you're going to get here is 80.8. So remember, it's kind of weird to have class widths that involve decimals because then your class widths are going to get all really weird. They're going to have a bunch of decimals in there.

So what a lot of times you're going to do is you're just going to round up to the nearest whole number. In this case, because you have this data that's all over the place and deals with dollars, there's really not like a convenient number to use. So I'm going to just go ahead and round it to the nearest integer, and I'm just going to use 81. Okay? That's a perfectly valid, class width to use.

Alright? So now what happens is, remember, whatever these lower class limits are, I know that the spacing between them is going to be 81. Okay? So now I have to go to the second step, which is figuring out what those lower and upper class limits are. So to do that, so what I'm going to do here so remember, this is the left numbers.

The first lower is just going to be a number that is less than or equal to whatever the data minimum is here. And in this case, because I have numbers that go from 819 to 1223, I'm just actually going to use whatever the minimum is. So in this case, I'm just going to use 819, and that's it. Alright. So that's my first lower.

The next lower is going to be this number plus the class width. So in other words, 819 plus a class width of 81. So if I add this over here, I just get 900. Alright. And if you go ahead and do this for the rest of the data values, you're going to get 981, then you're going to get 1062, and then you're going to get 1143.

If you go one more, what you're actually going to get here is 1224, and this number is actually bigger than the biggest data value that you have here. So in other words, you're going to get a frequency of 0, and you might as well just not even have it. Okay? So these are going to be my classes over here. Alright?

So I've got 819, 900, and I got the rest of these numbers here. Those are the lower class limits. Now I have to figure out the upper class limits. And remember, to do this, all you have to do is find the uppers. You just find the second lower and subtract 1.

Alright. So this number can't be 900. It has to be one number less than that, so 899. This is going to be 980. This is going to be 1061.

This is going to be 1142. And then what happens here is this is going to be 1223 because this is going to be 1224. Alright? So now that you have your upper and lower class limits, now you actually have defined what your classes are. Now we can go ahead and just count up all the, you know, and tally up all the frequencies.

Okay? So this 1223 is going to belong over here. And by the way, now you can go ahead and close the box and also fill out the rest of the rows because now you know how many classes you have. Alright? So this is going to be 1.

The second number over here, 1136, is going to fall over here. 819 is going to fall over here. 1089 is going to fall, over here. 1011 goes into this class over here. 997 goes into this class.

973 goes into this one. 1025 goes here. 1017 also goes here. 1118 goes here. Then I've got 988 which goes here.

And I've got 843, which goes here, 1196, which goes here, 1081, which goes here, and then we've got 942, which goes here. Alright. So in other words, I can just tally up everything and essentially get 22. This is going to be 1, 2, 3, 4, 5. This is going to be 4, and this is going to be 2.

Alright. So, hopefully, if you've done this right, you should get 15 numbers. We've got 2, that's 4, 9, 13, 15. Alright. So that is going to be my frequency distribution.

Alright. So that's basically my answer. I've got my frequency distribution for the classes, and that's it for this one. So let me know if you have any questions, and I'll see you in the next.

7

Problem

Problem

A data set has a minimum value of 16 and a maximum value of 71. Without constructing a table, find the class width if you organized this data into 7 classes. Write the lower and upper class limits.
min = 16; max = 71; 7 classes.

A

Lower: 16, 23, 30, 37, 44, 51, 58, 65

Upper: 24, 32, 40, 48, 56, 64, 72

B

Lower: 16, 24, 32, 40, 48, 56, 64

Upper: 23, 31, 39, 47, 55, 63, 74

C

Lower: 16, 23, 30, 37, 44, 51, 58, 65

Upper: 24, 32, 40, 48, 56, 64, 71

D

Lower: 16, 24, 32, 40, 48, 56, 64

Upper: 23, 31, 39, 47, 55, 63, 71

Previous Topic: Visualizing Qualitative vs. Quantitative Data

Next Topic: Histograms

Your Statistics tutor

Physics and Math Lead Instructor

Download the Mobile app

Do not sell my personal information

© 1996–2024 Pearson All rights reserved.