I need to come up with a data set for my project. Here are the requirements:
Find a data set online and illustrate aspects of the data using the three sections of this statistics class. The goal is to do proper statistics. You are not being graded on discovering any specific relationship, just on how well you interpret and communicate any results you get from the data.
Your data set should contain at least 50 data points. Each subject should have at least three values associated with it. That is, when you put the data into Minitab (mandatory) you will have at least three columns. This will be explained more as time goes on, but you will need at least two numerical variables and one categorical variable for each of your 50+ subjects.
Can you help me?

Document Preview:

Project
Find a data set online and illustrate aspects of the data using the three sections of this statistics class. The goal is to do proper statistics. You are not being graded on discovering any specific relationship, just on how well you interpret and communicate any results you get from the data.
Your data set should contain at least 50 data points. Each subject should have at least three values associated with it. That is, when you put the data into Minitab (mandatory) you will have at least three columns. This will be explained more as time goes on, but you will need at least two numerical variables and one categorical variable for each of your 50+ subjects.
You aren’t going to hand in the Minitab file. Instead you will hand in a printed Word (or other word processor) document that will include your graphs, Minitab outputs and written explanations. Your project should be a properly written document with the topics covered in this order:
[5] Comment on the selection process used in getting your sample. Also talk about what your conclusions could mean, that is, what population can you generalize to?
Using your data and Minitab, illustrate:
Your explained variable, y. Create a graph to show its distribution, and interpret that graph. [2]
Use Minitab to find the appropriate measure of center and spread for that variable y. [1+1]
Create a confidence interval for that variable, and explain what the results mean. Explain whether the confidence interval is appropriate to use.[1]
Find the SST for the variable y. Explain what it means, and discuss what our prediction for y would be if we had no other predictor.[1+1]
Divide the data into two sets logically and do a two-sample t-test to see if you can prove a difference. Explain the Minitab output. Draw an appropriate graph for this test. [2+1]
Divide the data into more than two sets and run an ANOVA. Explain the Minitab output. Draw an appropriate graph for this test. [2+1]
Use a different numerical variable to...