Your final project for this class consists of analyzing a data set of your choosing using methods that we have discussed in this course. You have essentially free reign to choose a data set and research question; as far as grading is concerned, the focus of the project is on the proper application of statistical methods.
You will be graded on two components, each a different way of communicating your problem, your data, and how you analyzed it:
- A 15-minute presentation, to be given during class May 3
- A written report (3-5 pages) due May 5
If you have any questions on the project or whether what you have in mind is feasible and/or suitable, please let me know.
Please keep the following questions in mind as you prepare your presentation and report (some questions may not apply to certain types of projects):
- What is the main question I am trying to answer?
- How was the data collected/gathered/sampled?
- Are there any confounding relationships present?
- Are there any interactions present?
- How did you select your model? Is overfitting a concern?
- Is your model reasonable? What assumptions is it making?
- What are the limitations of my analysis (assumptions which may not hold, limitations of the data, etc.)?