Take a sample of 100 homes for sale in your home town. (please use zip cose 18964 for this portion ) Information about homes can be found on websites such as Realtor, Zillow, and Trulia. For each home, record the address, zip code, current price, number of bedrooms, number of bathrooms, square footage, and the company of the listing agent. The use of EXCEL or other data software may be beneficial. You will be creating a 2-3 page report describing the housing market in your home town.
Write an introduction that includes the context of the data that has been collected. The introduction should include an identification of who, what, where, how, and why.
Create frequency tables to organize the responses of EACH variable. For quantitative variables, it may be appropriate to bin data into groups; for example, you may wish to group home prices into bins of width 25k or 50k. The variables are highlighted in green above.
Accompany each frequency table with an appropriate display/chart. Write a sentence that describes the distribution represented in each display. The description should include the context of the report.
Create a contingency table that compares the variables “zip code” and “company”. Accompany your contingency table with an appropriate display. Describe the marginal distribution of “zip code”.
Group your data by zip code and create a boxplot and 5-number summary describing the current home prices in each area. Note any outliers that may be present using the “fence” method.
Write a sentence describing the distribution of home prices using the 68%-95%-99.7% rule. Then comment on whether or not this rule is appropriate in estimating the distribution of home prices.
Write a conclusion summarizing the report.
Category: Statistics homework help
Read/review the following resources for this activity: OpenStax Textbook: Chap
Read/review the following resources for this activity:
OpenStax Textbook: Chapter 2
Lesson
Chamberlain University Library
Internet
Week 3 Lab TemplateLinks to an external site.
Required Software
Microsoft Word
Internet access to read articles
Scenario/Summary
This week’s lab highlights the use of graphics, distributions, and tables to summarize and interpret data.
Instructions
Part 1:
Your instructor will provide you with a scholarly article. The article will contain at least one graph and/or table. Please reach out to your instructor if you do not receive the article by Monday of Week 3.
Part 2:
Title your paper: “Review of [Name of Article]”
State the Author:
Summarize the article in one paragraph:
Post a screenshot of the article’s frequency table and/or graph.
Example:
Frequency Distribution -OR- Graph
Answer the following questions about your table or graph.
What type of study is used in the article (quantitative or qualitative)?
Explain how you came to that conclusion.
What type of graph or table did you choose for your lab (bar graph, histogram, stem & leaf plot, etc.)?
What characteristics make it this type (you should bring in material that you learned in the course)?
Describe the data displayed in your frequency distribution or graph (consider class size, class width, total frequency, list of frequencies, class consistency, explanatory variables, response variables, shapes of distributions, etc.)
Draw a conclusion about the data from the graph or frequency distribution in the context of the article.
How else might this data have been displayed?
Discuss the pros and cons of 2 other presentation options, such as tables or different graphical displays.
Why do you think those two other presentation options (i.e., tables or different graphs) were not used in this article?
Give the full APA reference of the article you are using for this lab.
Be sure your name is on the Word document, save it, and then submit it under “Assignments” and “Week 3: Lab”.
Requirements
The deliverable is a Word document with your answers to the questions posed above based on the article you were assigned.
YOU SHOULD INDICATE AT THE BEGINNING OF ANY EXPLANATION FOR EACH EXERCISE WHAT
YOU SHOULD INDICATE AT THE BEGINNING OF ANY EXPLANATION FOR EACH EXERCISE WHAT THE STATISTICAL TEST IS ACTUALLY MEASURING. IT WILL HELP YOU BETTER UNDERSTAND THE UTILITY OF EACH TEST
1) Test whether there is an association between a person’s gender and the prestige of their occupation. Use the GSS2018 data set to perform an independent samples t-test on SEX and PRESTIG10 depending on the dataset. Report the following:
Mean prestige score for men ___________________
Mean prestige score for women ___________________
t-test equality of means significance level ___________________
Is the relationship statistically significant Yes No
On the basis of these data, would you say that gender is associated with occupational prestige? What could explain this relationship? Remember to explain this relationship based on what the test measures.
2) You will report the information listed below for the GSS 2018 data set: Perform an independent samples t-test to compare the mean socioeconomic index (SEI10) of those who have had a born again experience with those who have not (REBORN).
NOTE: SEI: Socioeconomic index scores reflect the education, income, and prestige associated with different occupations.
Mean SEI of those non-born again ___________________
Mean SEI of those born again ___________________
t-test equality of means significance level ___________________
Is the relationship statistically significant Yes No
On the basis of these data, would you say that the religious experience of being born again is associated with socioeconomic status? Remember to explain this relationship based on what the test measures.
What were you expecting and what did you find?
3) Perform a paired t-test to compare the respondent’s mother’s occupational prestige score (MAPRES10) to the respondent’s father’s occupational prestige score (PAPRES10) using the GSS2018 data set.
Respondent’s mother’s occupational prestige score? ___________________Respondent’s father’s occupational prestige score? ___________________Significance for the Paired Samples Test? ___________________Is the relationship statistically significant Yes NoAre prestige related to generation? What were you expecting to find and did you find it?
USE STATES10 DATA FOR THE NEXT SET OF QUESTIONS
4. Perform a paired t-test to compare the median earnings of male full-time workers (EMS168) to the median earnings of female full-time workers (EMS169) using the STATES10 data set.
Mean earnings of men? ___________________
Mean earnings of women? ___________________
Significance for the Paired Samples Test? ___________________
Is the relationship statistically significant Yes No
Are earnings related to gender? Remember to explain this relationship based on what the test measures. What were you expecting to find and did you find it?
5. Using a variable called WAGEGAP that is the difference between median earnings of male full-time workers (EMS168) and the median earnings of female full-time workers (EMS169),
Create a histogram of WAGEGAP’s distribution (Remember Histogram is under either frequencies-click on chart from the dialog box OR under GRAPH). GRAPH IS PART OF THE RIBBON OF COMMANDS ALONG THE TOP
Describe the shape of the distribution.
In which state do women have earnings closest to men’s? ___________________
In which state do women’s women have earnings most disparate from men’s? ___________________
(If you are having trouble with this, you can sort the data (sort cases) in DATA VIEW from lowest to highest by the WAGEGAP variable and …)
What might account for the variation in wage gaps observed across states?
6. From the States10 dataset preform a paired sample t-test and report the results using Overdose Deaths 1999 and 2005 (Pair 1) and Overdose Deaths 2005 and 2017 (Pair 2).
Pair 1
Mean ________?
Mean ________?
Significance for the Paired Samples Test? ___________________
Is the relationship statistically significant Yes No
Pair 2
Mean ________?
Mean ________?
Significance for the Paired Samples Test? ___________________
Is the relationship statistically significant Yes No
What do the t-test results suggest about deaths by overdose over the two decade period?
Using data visualization to better determine which measures of central tendenc
Using data visualization to better determine which measures of central tendency and spread you should use.
Using the STATES10 data set:
1. Generate measures of central tendency and spread. Report the measures of central tendency and spread, and provide a scatterplot for the following variables. If using SPSS, provide a box plot as well. For scatterplots, place STATEID on the X-Axis.
A. PCI_15 (Per Capita Income: 2015)
B. CRS63 (Prisoners under Sentence of Death: 2008)
C. HrtDRT17 (Heart Disease Mortality Rate: 2017)
For each of variables, report which measure of central tendency (pick 1) and spread (pick 1) do you think would work best based on the available information?
2. Using the GSS2018 dataset, what is the proper graph (scatterplot, histogram, bar chart)-make sure to provide it, and report what you think is the correct measure of central tendency. Why?
A. REALRINC (R’s Income in constant $)
B. PARTNERS5 (How many sex partner’s R has in last 5 year)
C. RELITEN (Strength of Affiliation)
3. Using the GSS2018 dataset, what is the proper graph (scatterplot, histogram, bar chart-make sure to provide it), for:
A. HAPPY (General Happiness) by SEX (Respondent’s Sex)
B. ANCESTRS (Believe in Supernatural Power of Deceased Ancestors) by SEX (Respondent’s Sex)
For this first exercise, go to the General Social Survey (GSS) website and dow
For this first exercise, go to the General Social Survey (GSS) website and download the 1980 data set for SPSS. This is the only dataset not uploaded for you. I want to see if can find it yourself. Let me know if you have difficulties.
Answer the following:
1. Report the frequency and percentage results for HAPMAR statistics? For GSS 1980
2. Provide the proper graph (Histogram, Bar Chart, Scatter Plot) –submit the output and you need to identify which graph you should use (word document) along with a description of what is displayed in the graph. For GSS 1980.
A. AGE
B. RACE
C. INCOME
3. Using the States10 data set present descriptive statistics for the following variables (I have not included which descriptive statistics you should report so I can assess students on this knowledge)-report only the measures (should be one measure of central tendency and one measure of spread) that best represent the data:
A. DMS429 (Percent of Households Headed by Married Couples, 2008)
B. ECS445 (Homeownership Rate, 2008)
C. EMS170 (State Minimum Wage Rates, 2010)
The program Evaluation Assignments will focus on ethical and/or practical conc
The program Evaluation Assignments will focus on ethical and/or practical concerns as well as provide examples of program evaluation research.
Address the following (PART 1):
1. What are the NEP/SEP research questions (Valente, 2001; Kerr et al., 2010). Provide the properly cited direct quote for each. Why are the questions being asked?
2. Briefly describe the sample in terms of size, important characteristics, location, and time.
3. What are the primary dependent and independent variables?
4. Describe how the data was collected.
5. What is the most important finding? Was the research question answered?
Note that syringe sharing is the common topic here. By the Program Evaluation Assignment 3 due date, submit a one-page document (single-spaced, 1” margins) that first clearly and concisely summarizes the above information along with the key findings of the study, and then (PART 2) include a discussion regarding how the behavior of the target population (as well as how target populations are defined program/politically) can impact program evaluation in general and for these particular studies; and how it can impact 1) perceptions of the NEP/SEP and 2) how the success of such programs are assessed (think bias). Be specific.
You should also include (in general) how perceptions of the target population (did you define what is meant by this term), and how it can bias program evaluations (how they are designed and conducted, particularly of programs that serve target populations that have been negatively portrayed by politicians (and their surrogates). Schneider and Ingram and the other supplemental readings should be integrated into this part but not just summaries–integrate the ideas or use for examples. This latter part should be 40-50% of the paper.
The two parts should be about an equal length. Be sure to follow APA and writing academic paper guidelines. IMPORTANT: These are not original research papers so DO NOT follow that format–you are learning to do a clear and concise summary of the research including the important points of that research.
MAKE SURE TO SAVE YOUR SPSS/PSPP OUTPUT AS A PDF AND TURN IN WITH YOUR RESPONS
MAKE SURE TO SAVE YOUR SPSS/PSPP OUTPUT AS A PDF AND TURN IN WITH YOUR RESPONSES and be sure to properly label your responses]
1) Is there a relationship between a person’s age (AGE) and the number of hours spent per day watching TV (TVHOURS)? Use the GSS 2018 data to test this relationship.
Make a prediction:
What is the mean (average) number of hours people spend watching television each day? _________
Would expect the correlation between age and television viewing to be
Negative Nonexistent Positive
Now perform the analysis and find:
Correlation __________________ (Pearson’s)
Significance Level __________________ (as stated in PSPP output below the correlation coefficient )
Is the relationship statistically significant? Yes No
Do people tend to watch more or less TV as they age More Less
How would you explain these findings?
2A) Using the STATES10 data, perform a test to see if there is a relationship between the percent of the population that graduated high school (EDS131) and median earnings of male full-time workers (EMS168) in the state.
Summarize and interpret your findings below:
Correlation __________________ (Pearson’s)
Significance Level __________________ (as stated in SPSS/PSPP output below the correlation coefficient )
Is the relationship statistically significant? Yes No
Use the Scatterplot function under graph (Include Scatterplot here or with the output)
How would you explain these findings as part of an information strategy to lobby for additional resources to prevent high school drop-outs?
A) How would you interpret this result without just reference to the level of significance? What do you observe in the findings? Be specific.
2B) How does the percent graduated (EDS131) relate to the median earnings of female full-time workers (EMS169)? Is the relationship the same as for male workers (EMS168)?
Summarize and interpret your findings below
Correlation __________________ (Pearson’s)
Significance Level __________________ (as stated in SPSS/PSPP output below the correlation coefficient )
Is the relationship statistically significant? Yes No
Include a scatterplot
How would you explain these findings? Is there a difference?
If, yes, what could explain any difference? From a policy standpoint should this be a concern?
2C) Percent of the Population with a Bachelor’s Degree or More (EDS154) and Median Earnings of Female Full-Time Earnings (EMS169)Correlation __________Significance Level ___________Is the relationship statistically significant? _____________From a policy standpoint what do any differences suggest should be done. Be specific.
3) Is there a relationship between the average salary of classroom teachers (EDS125) and the percent of the population that has graduated high school (EDS131). Use the STATES10 data to test this relationship. Summarize and interpret your findings below:
Now perform the analysis and find:
Correlation __________________ (Pearson’s)
Significance Level __________________ (as stated in SPSS/PSPP output below the correlation coefficient )
Is the relationship statistically significant? Yes No
Include a scatterplot
How would you explain these findings?
Based on these results-from a policy standpoint, what could be done?
Make sure you read the question and use the proper data set. Also, provide as
Make sure you read the question and use the proper data set. Also, provide as much information so it is readily apparent what exercise and part of an exercise you are answering (and it is always a good idea to attach the output). You might think this goes without saying but you would be surprised.
Please respond by due date–If first post is 1 day late (-33%), 2 days late (-66%), 3 days late (no points). Make sure you check to see if you need to respond to any question I might have for you. PLEASE NOTE–FULL CREDIT IS GIVEN IF EVERYTHING IS CORRECT ON THE FIRST TRY. Revise and Resubmits will be marked down slightly for each resubmit
PART 1
Based on the PSPP Chi Square Distribution lecture answer the following question
Using the GSS2008 data, examine the relationship between attitudes toward the level of national assistance for childcare (NATCHILD) and the sex of the respondent (SEX). Fill in the following information:
Now perform the analysis with PSPP and find (You need to go to Analyze, Descriptives, Crosstabs–click. Enter in the appropriate variables–click on statistics on the dialogue box where you enter the variables (for row, column) and make sure the chisq box is checked.)
Percentage of men (out of only men) stating the current level is too little ________________________
Percentage of women (out of only women) stating the current level is too little ________________________
Chi Square significance level ________________________ (Pearson Chi Square Sig)
Is the relationship statistically significant YES NO
How would interpret this result? What might explain why there is or is not a significant difference in the means between the two groups (male and female). The null hypothesis for the Chi Square test is that there is no difference between the mean percentages of the various groups or categories. So, if the result is significant, we can reject (rather than confirm) the null hypothesis and indicate that there is a significant difference.
How would you describe the practical meaning of statistically significant to a group of managers with no experience with statistics or research methods?
PART 2
Using the GSS2012 data, examine the relationship between sex (SEX) and the belief that a woman will not get a job or promotion over a man (DISCAFFW).
Fill in the following information:
Make a prediction
What percent of Americans believe women are less likely to get a job or promoted over a man:
___________
Do you think males and females vary on their perspectives on this issue? YES NO
Now perform the analysis with PSPP and find (Remember how to read a table!):
Percentage of men (out of only men) saying “Somewhat Likely” __________________
Percentage of women (out of only women) saying “Somewhat Likely” __________________
Chi Square significance level __________________
Is the relationship statistically significant YES NO
How would interpret this result? In other words, what might explain why there is or is not a difference between the groups. Do not just restate the statistics without any interpretation.
What do the results suggest about perceptions of Affirmative Action?
Attach the SPSS/PSPP data all images
Read the article “Managing Marijuana: The Role of Data-Driven Regulation” and
Read the article “Managing Marijuana: The Role of Data-Driven Regulation” and the following:
https://ascend.aspeninstitute.org/an-evidence-based-approach-to-child-support/
48.Aspx
Podcast: Privacy and Predictions
1. What did you find most interesting related to the use of data in the articles and podcasts?
2. Collecting information about people is part of any program evaluation. Whether it is a needs assessment (Royse, Thyer, and Padgett) or outcome evaluation, how people need, want, and use a program is of immense interest to a variety of political actors.
Question: What considerations should a program evaluator take to keep the identities of individuals private?
For this first exercise, go to the General Social Survey (GSS) website and dow
For this first exercise, go to the General Social Survey (GSS) website and download the 1980 data set for SPSS. This is the only dataset not uploaded for you. I want to see if can find it yourself. Let me know if you have difficulties.
Answer the following:
1. Report the frequency and percentage results for HAPMAR statistics? For GSS 1980
2. Provide the proper graph (Histogram, Bar Chart, Scatter Plot) –submit the output and you need to identify which graph you should use (word document) along with a description of what is displayed in the graph. For GSS 1980.
A. AGE
B. RACE
C. INCOME
3. Using the States10 data set present descriptive statistics for the following variables (I have not included which descriptive statistics you should report so I can assess students on this knowledge)-report only the measures (should be one measure of central tendency and one measure of spread) that best represent the data:
A. DMS429 (Percent of Households Headed by Married Couples, 2008)
B. ECS445 (Homeownership Rate, 2008)
C. EMS170 (State Minimum Wage Rates, 2010)