Using data visualization to better determine which measures of central tendenc

 
Using data visualization to better determine which measures of central tendency and spread you should use.
Using the STATES10 data set:
1. Generate measures of central tendency and spread. Report the measures of central tendency and spread, and provide a scatterplot for the following variables.  If using SPSS, provide a box plot as well. For scatterplots, place STATEID on the X-Axis.  
A. PCI_15 (Per Capita Income: 2015)
B. CRS63 (Prisoners under Sentence of Death: 2008)
C. HrtDRT17 (Heart Disease Mortality Rate: 2017)
For each of variables, report which measure of central tendency (pick 1) and spread (pick 1) do you think would work best based on the available information?
2. Using the GSS2018 dataset, what is the proper graph (scatterplot, histogram, bar chart)-make sure to provide it, and report what you think is the correct measure of central tendency. Why?
A. REALRINC (R’s Income in constant $)
B. PARTNERS5 (How many sex partner’s R has in last 5 year)
C. RELITEN (Strength of Affiliation)
3. Using the GSS2018 dataset, what is the proper graph (scatterplot, histogram, bar chart-make sure to provide it), for:
A. HAPPY (General Happiness) by SEX (Respondent’s Sex)
B. ANCESTRS (Believe in Supernatural Power of Deceased Ancestors) by SEX (Respondent’s Sex)

  According to a survey by paint manufacturer, DuPont, 22% of all cars in the Un

 
According to a survey by paint manufacturer, DuPont, 22% of all cars in the United States are red. Suppose 20 cars are randomly selected and the number of red cars are recorded.  Round probabilities to 4 decimal places.
Explain why this is a binomial experiment.
Find and interpret the probability that exactly 6 cars are red.
Find and interpret the probability that fewer than 6 cars are red.
Find and interpret the probability that at least 6 cars are red.
Compute the mean and standard deviation of the binomial random variable.
Please show work 

 The following data represent the speed at which a ball was hit​ (in miles per​

 The following data represent the speed at which a ball was hit​ (in miles per​ hour) and the distance it traveled​ (in feet) for a random sample of home runs in a Major League baseball game in 2018. Complete parts​ (a) through​ (f).  
​(a) Find the​ least-squares regression line treating speed at which the ball was hit as the explanatory variable and distance the ball traveled as the response variable.
Speed (mph) Distance (feet)
103.0 393
105.3 420
103.5 422
105.5 414
105.4 418
100.3 392
103.5 395
107.9 441
101.4 399
98.0 395
100.8 394
103.4 394
n Critical Values for Correlation Coefficient
3 0.997
4 0.950
5 0.878
6 0.811
7 0.754
8 0.707
9 0.666
10 0.632
11 0.602
12 0.576
13 0.553
14 0.532
15 0.514
16 0.497
17 0.482
18 0.468
19 0.456
20 0.444
21 0.433
22 0.423
23 0.413
24 0.404
25 0.396
26 0.388
27 0.381
28 0.374
29 0.367
30 0.361

Please see attachement with instructions 1- in word doc-   -Submit your research

Please see attachement with instructions
1- in word doc-  
-Submit your research question- If it needs explanation include 1-2
paragraphs. 
-Data Collection- Submit a clean dataset. This means it should not have any extra text or
code in it. If it should be formatted, keep it organized such as a comma separated file.
– Project Organization
1. Show where you get the data from and describe your dataset.
2. Explain your statistical research questions and methods. Apply at
least two
3. Share the conclusion (answer)
2- paper is in word document – Refer to the uploaded file to submit your project paper where you   summarize your dataset selected, research questions, analysis, and  conclusions. Your writing should be your own. If you are citing other  people’s work, reference it clearly. Do not appropriate it without  giving clear credit. This summary should serve to explain your research  and conclusions.
3- excel- Please refer to the final project  instrcutions and upload here your statisitical analysis of the excel  dataset you chose to work on. Show your analysis in separate tabs for  each method selected. Label clearly. Your work should be original,  meaning not already published by someone else or publicaly available. It  should reflect your own analysis of the dataset selected. 
Your analysis should be performed using excel methods as we learned in the course (not manual formulas)

Suppose that you have two sets of data to work with. The first set is a list of

Suppose that you have two sets of data to work with. The first set is a list of all the injuries that were seen in a clinic in a month’s time. The second set contains data on the number of minutes that each patient spent in the waiting room of a doctor’s office. You can make assumptions about other information or variables that are included in each data set.
For each data set, propose your idea of how best to represent the key information. To organize your data would you choose to use a frequency table, a cumulative frequency table, or a relative frequency table? Why?
What type of graph would you use to display the organized data from each frequency distribution? What would be shown on each of the axes for each graph?
In a separate paragraph consider how different distributions might affect the different graphs. How might other variables affect the graphs? How could graphs be made to be biased? If a graph were biased, how might you change it to guard against that bias?
Minimum of 1 scholarly source AND one appropriate resource such as the textbook, math video and/or math website
In your reference for this assignment, be sure to include both your text/class materials AND your outside reading(s).