The purpose of this assignment is for you to implement and reflect upon what you have learned throughout the semester. In this project, you will need to demonstrate your thoughtful mastery of statist
STUCK with your assignment? When is it due? Hire our professional essay experts who are available online 24/7 for an essay paper written to a high standard at a reasonable price.
Order a Similar Paper Order a Different Paper
The purpose of this assignment is for you to implement and reflect upon what you have learned throughout the semester. In this project, you will need to demonstrate your thoughtful mastery of statistic topics from our class on a data set of your choosing using the Common Online Data Analysis Platform (CODAP (Links to an external site.)).
Project Overview:
You will choose a data set to work with from those below. You will use CODAP to help you visualize and interpret the data set and calculate relevant statistics. I know we have never used CODAP in class before, but I chose it because of its intuitive and accessible structure. we will discuss CODAP in more detail on Wednesday of the last week of classes. Additionally, you are always welcome to email me questions.
You will answer a collection of questions about your data which span the topics we have covered in M132A this semester including data collection, data visualization, and hypothesis testing.
Instructions:
0. Familiarize yourself with CODAP using the “Getting Started with CODAP” Tutorials – Part I (Links to an external site.) and Part 2 (Links to an external site.).
- Choose a data set from the ones provided below and upload it to CODAP. (Links to an external site.)
- Explore the capabilities of CODAP by investigating your data set with visuals and summary statistics; in particular, you should explore all the options you have for the graphs that you create!
- In a 3-6 page write-up, address the attached questions (Links to an external site.) in a narrative style. You should include pictures from CODAP as needed to illustrate your work and explanations.
Deliverables
-
An approximately 3-6 page write-up with graphics from your work in CODAP addressing the questions (Links to an external site.) above.
- Writing specifications: 12 point font, 1 inch margins, double spaced.
- Please submit as PDF if possible! This way we don’t have to worry about our different word editors interpreting symbols and such incorrectly.
-
CODAP file
- Save your work in CODAP to a local file and then upload it to Canvas.
Comments
- Your writing style should be narrative, like you were explaining your work to a peer or classmate.
- Show off your statistics knowledge by using proper terminology!
- You can reference your class notes, videos, and assignments as needed to help you answer questions and explain your results, but the work you submit must be your own!
Supporting Documents
- Data Sets
-
The data sets are saved as csv or txt files which can be downloaded, then uploaded into CODAP. A brief description of each data set is given; more information is given for Data Set 3 because it uses several medical abbreviations that need to be defined.
-
Cars93.csv
- Data from 93 cars on sale in the USA in 1993 including manufacturer, price, MPG, engine size, horsepower, etc.
-
Pixar vs DreamWorks Data Sheet.csv
- Data comparing budgets, profit, and ratings of Pixar and Dreamworks movies from 1998-2015.
-
Sample-Data-Birth-Weight-Risk.csv
- Data looking at 9 potential risk factors for low birth weight with birth weight outcomes (bwt). Risk factors include: indicator of birth weight less than 2.5kg (low), mother’s age (age), mother’s weight in pounds at last period (lwt), mother’s race (1 = white, 2 = black, 3 = other), smoking status during pregnancy (smoke), number of previous premature labours (ptl), history of hypertension (ht), presence of uterine irritability (ui), and number of physicians visits during the first trimester (ftv).
-
Sample-Data-Kidney-Transplant.csv
- Data on the time to death of a sample from 863 kidney transplants performed at The Ohio State University Transplant Center during the period 1982-1992. The maximum follow up time for the study was 9.47 years.
-
scottish-hill-races.csv
- Data on Scottish Hill Racing which are races that climb generally steep hills throughout Scotland. The data set contains records for men and women in these races as well as length and climb for each race.
-
wine.csv
- Data on the results of a chemical analysis of wines grown in the same region in Italy but derived from three cultivars. The analysis determined the quantities of 13 constituents found in each of the three types of wine.
-
Cars93.csv
DataSets 3, 4, and 5 are adapted from Wolfram Data Repository, DataSets 1 and 6 are adapted from the UCI Machine Learnging Repository, and DataSet 2 is adapted from TuvaLabs.
-
CODAP Help/Intro
- CODAP User Manual (Links to an external site.)
- New Document Screen – CODAP (Links to an external site.)
- CODAP Help (Links to an external site.)
- Getting Started with CODAP Tutorial Part I (Links to an external site.) and Part 2 (Links to an external site.)
-
Questions
- Write-Up Questions
The purpose of this assignment is for you to implement and reflect upon what you have learned throughout the semester. In this project, you will need to demonstrate your thoughtful mastery of statist
Answer the following questions in a 3-6 page write up. Data and Collection Introduce the data set you chose Why did you pick this data set? Do some research into what your data set describes? (e.g. what is Scottish Hill Racing?) What types of data are presented in your data set (e.g. hill height, weights, categories, etc.). How would you describe these kinds of data – continuous? discrete? categorical? What level of measurement? As a student of statistics, what questions do you have about how the data was collected? Think critically about the different ways data can be collected as well as the possible bias involved. Come up with a good data collection strategy as well as a flawed data collection strategy for this data set. Explain your reasoning/choices! Visualizing Data and Summary Statistics Choose at least two numerical aspects of your data (e.g. length, time, etc.). Create a histogram for each data set. Create a boxplot for each data set and give the five number summary. Indicate the summary statistics for each data set including mean, median, and standard deviation. Discuss what the visualizations and summaries tell you, if anything, about the data (center, spread, distribution). If relevant, compare and contrast your data sets based on the visualizations and summaries. Use complete sentences and justify any observations by tying back to your visualizations and summaries. Confidence Intervals and Hypothesis Testing Construct and interpret a confidence interval for a mean and proportion. This will involve: Confidence Interval for Mean Choose a numerical aspect of your data and calculate the sample mean (using CODAP) Choose a confidence level and calculate the Error (this will require you to know the sample size and sample standard deviation, which you can find on CODAP) Construct the confidence interval Interpret for someone without any statistics background. Confidence Interval for Proportion Choose a numerical aspect of your data and calculate the sample proportion (note – not every numerical aspect lends itself directly to proportions; you need to interpret the data as as a part of a whole that has some property – e.g. the proportion of racers who finished in under 30 minutes) Choose a confidence level and calculate the Error (this will require you to know the sample size, which you can find on CODAP) Construct the confidence interval Interpret for someone without any statistics background. Construct and create two hypothesis tests – one involving a proportion or mean, and the test comparing two proportions, two means, or a contingency table*. This will involve the following for each: Make a claim based on your data (e.g. the average time is less than…) and choose a significance level Set up the Null and Alternative Hypotheses Find the appropriate test statistic (you may need information like the sample size or sample standard deviation which can be found on CODAP) Draw your conclusion and interpret for Interpret for someone without any statistics background. Write descriptions for both what a Type I Error and a Type II Error looks like for the hypothesis test for mean or proportion * If you are going to conduct a test for independence and need to create a contingency table – this help article may prove very useful! Some notes/links on CODAP: You can save your work in CODAP either on GoogleDrive or to your computer. When you want to open a CODAP file that was saved to your computer, simply drag it to this window. CODAP Help Menu CODAP – Getting Started Part 1 and Part 2 (interactive tutorials) CODAP User Manual
The purpose of this assignment is for you to implement and reflect upon what you have learned throughout the semester. In this project, you will need to demonstrate your thoughtful mastery of statist
The purpose of this assignment is for you to implement and reflect upon what you have learned throughout the semester. In this project, you will need to demonstrate your thoughtful mastery of statistic topics from our class on a data set of your choosing using the Common Online Data Analysis Platform (CODAP (Links to an external site.)). Project Overview: You will choose a data set to work with from those below. You will use CODAP to help you visualize and interpret the data set and calculate relevant statistics. I know we have never used CODAP in class before, but I chose it because of its intuitive and accessible structure. we will discuss CODAP in more detail on Wednesday of the last week of classes. Additionally, you are always welcome to email me questions. You will answer a collection of questions about your data which span the topics we have covered in M132A this semester including data collection, data visualization, and hypothesis testing. Instructions: 0. Familiarize yourself with CODAP using the “Getting Started with CODAP” Tutorials – Part I (https://codap.concord.org/app/static/dg/en/cert/index.html#file=examples:Getting%20started%20with%20CODAP) and Part 2 (https://codap.concord.org/app/static/dg/en/cert/index.html#shared=97226). Choose a data set from the ones provided below and upload it to CODAP. (https://codap.concord.org/app/static/dg/en/cert/index.html#) Explore the capabilities of CODAP by investigating your data set with visuals and summary statistics; in particular, you should explore all the options you have for the graphs that you create! In a 3-6 page write-up, address the attached questions (Links to an external site.) in a narrative style. You should include pictures from CODAP as needed to illustrate your work and explanations. Deliverables An approximately 3-6 page write-up with graphics from your work in CODAP addressing the questions (Links to an external site.) above. Writing specifications: 12 point font, 1 inch margins, double spaced. Please submit as PDF if possible! This way we don’t have to worry about our different word editors interpreting symbols and such incorrectly. CODAP file Save your work in CODAP to a local file and then upload it to Canvas. Comments Your writing style should be narrative, like you were explaining your work to a peer or classmate. Show off your statistics knowledge by using proper terminology! You can reference your class notes, videos, and assignments as needed to help you answer questions and explain your results, but the work you submit must be your own! Supporting Documents Data Sets The data sets are saved as csv or txt files which can be downloaded, then uploaded into CODAP. A brief description of each data set is given; more information is given for Data Set 3 because it uses several medical abbreviations that need to be defined. Cars93.csv Data from 93 cars on sale in the USA in 1993 including manufacturer, price, MPG, engine size, horsepower, etc. Pixar vs DreamWorks Data Sheet.csv Data comparing budgets, profit, and ratings of Pixar and Dreamworks movies from 1998-2015. Sample-Data-Birth-Weight-Risk.csv Data looking at 9 potential risk factors for low birth weight with birth weight outcomes (bwt). Risk factors include: indicator of birth weight less than 2.5kg (low), mother’s age (age), mother’s weight in pounds at last period (lwt), mother’s race (1 = white, 2 = black, 3 = other), smoking status during pregnancy (smoke), number of previous premature labours (ptl), history of hypertension (ht), presence of uterine irritability (ui), and number of physicians visits during the first trimester (ftv). Sample-Data-Kidney-Transplant.csv Data on the time to death of a sample from 863 kidney transplants performed at The Ohio State University Transplant Center during the period 1982-1992. The maximum follow up time for the study was 9.47 years. scottish-hill-races.csv Data on Scottish Hill Racing which are races that climb generally steep hills throughout Scotland. The data set contains records for men and women in these races as well as length and climb for each race. wine.csv Data on the results of a chemical analysis of wines grown in the same region in Italy but derived from three cultivars. The analysis determined the quantities of 13 constituents found in each of the three types of wine. DataSets 3, 4, and 5 are adapted from Wolfram Data Repository, DataSets 1 and 6 are adapted from the UCI Machine Learnging Repository, and DataSet 2 is adapted from TuvaLabs. CODAP Help/Intro CODAP User Manual (Links to an external site.) New Document Screen – CODAP (Links to an external site.) CODAP Help (Links to an external site.) Getting Started with CODAP Tutorial Part I (Links to an external site.) and Part 2 (Links to an external site.) Questions Write-Up Questions

Everyone needs a little help with academic work from time to time. Hire the best essay writing professionals working for us today!
Get a 15% discount for your first order
Order a Similar Paper Order a Different Paper