Filter the data to include only residential real estate of your neighborhood. Group it by year, and summarize the data to show the average price of 1 square foot of real estate. List the results.
1. Do numbers show for all years? If a year shows N/A, why is that happening?
2. Filter your data to focus on residential real estate with adequate information for your model (remove fields with 0 for price or square feet)
3.Group the filtered data by year, and summarize the data to show the average price of 1 square foot of real estate. List the results.
4.Compare your selected neighborhood with 2 nearby neighborhoods. List the numbers.
5.Produce a plot that compares the neighborhoods. Explain the plot and the reason you chose that type of plot.
Please only expert! I do not need bad work!
Instruction from instructor:
What we have in here is a spreadsheet with data in it. Bicycle data we have to cleanse the Data to get it into a format we want to use for analytics . We have the day columns for example, what if I wanna show the upper management A chart that show instead of date, Bicycle sales by day of the week so I can see is Friday is a better day for sale day or Saturday is a better day. Add that to the chart. I don’t want you to come here and stat type MONDAY I WANT YOU TO USE FORMULAS TO derive that. Then take that data and build charts and graphs from it. Transform that to better do analytics. I have the data in US dollars but I got place over here from GERMANY I also want to see theses charts in EUROS AS WELL AS US DOLLARS. I want you to derive that. The yellow columns these are the once we are giving you but we want you to Find out five more columns out of this data by looking at it. For example If I am preparing this for upper management do I need to see DE on it… I would rather see the word GERMANY IN MY CHART. I want also see north south for example on a chart. So you need to derive that using a formula. +1 (415) 815-5403:
Noticed that this sales quantity right here some technical person look at this and tell me what’s odd? If I want to do arithmetic Calculate the price by sales quantity? Can I do that With this field 24,000 with sales five each?
Sales quantity five each so I need to do transformation to make it a number So I can do arithmetic with that number. You need to to find five more Columns to do analytic
Get data in each one of these rows to help us do analytics
Five more columns on the spreadsheet that will help us do analytics
Description: A new hospital is seeing an increase in the number of their patients. They want to create a software system that manage their employees and patients. A part of their software requirement is given below. You also are required to do a self-study on how the out-patient departments of hospitals function and create a design for managing the department. Requirements: The people in the clinic are either staff members or patients. The hospital works in three shifts, each of 8 hours. The first shift starts at 7am and staff members are required to punch-in. All staff members work only in one shift. The staff members are either Administrative Staff, Operations Staff, or Technical Staff. A patient first approaches the front desk and registers to see a doctor based on availability and the receptionist issues a number to the patient to wait in queue. A patient is then met by the Operations staff, i.e. the doctors, surgeons, or nurses. Based on the outcome of the consultation the patient goes to front desk to finalize the payment of their visit and exits the system. The hospital also has Technical Staff, who are the surgical technologists that deal with surgical equipment and technicians that manage other office equipment. All relevant details like IDs, names, salaries, addresses, and phone numbers, of the patients and staff members are stored in the hospital system. At any point in time, the system is able to generate reports to show the details of staff members that are on a particular shift, the number of patients that are in queue, and the number of patients or technical issues handled by each staff member. Submission: Submit a report that has the following: A paraphrased description of the system based on the requirements given above and a self-study of the required system. A list of all classes (related attributes and behaviors), relationships between classes, and assumptions made. UML class diagram with all class relationships included. Python code that represent classes, which includes the constructor, setter/getter, and other functions for the given requirements. 5% of the total score will be allocated for good documentation of the code and timely submission Report Format Title Page: Include case-study title, student ID, and full name Problem Analysis (20%): In this section, based on given requirements and self-study, a detailed list of all the requirements is given. The list of all classes, their attributes, and behaviors are also listed with data types. Functional Design (20%): In this section, the algorithm or flow chart is provided to explain the logical flow that will drive the use of the system. File structure and information stored in files are described. Class Design (30%): The UML class diagram, with class relationships, and cardinality for the business case is provided. Each relationship is explained, and assumptions are listed. Pseudocode (20%): In this section, provide the Python class structures for all the identified classes with required functionalities. The testing of the system is NOT required. Conclusion (5%): In this section include a reflection on what was learned in this exercise, the challenges faced while working on this assignment, and how the system can be further expanded.
Requirements are given in detail in each problem, so please follow those instructions accordingly. One thing to notify is that this assignment really cares that Matlab codes are written correctly. Thus, I would really appreciate if you pay extra attention to codes and plots. If you need codes, which are given by instructors, please let me know so that I can send you codes promptly.
We willing using 1994 wave of NLYSY1979, a survey data of young adults in USA. The data set and the code books are below.
Data 1994 Codebook NLSY1979.zip (download codebook and data set)
Note that a negative number usually indicates missing observations. The list of majors are listed below, although I would recommend that you aggregate up to a fewer major categories if you want to use those variables.
https://www.nlsinfo.org/content/cohorts/nlsy79/other-documentation/codebook-supplement/nlsy79-attachment-4-fields-study#business (Links to an external site.)
Using the data set provided, complete an exploratory data analysis. Extract or derive the following variables:
Year of birth, country of birth, race, and sex
3 to 5 more variables of interest
Using the extracted variables:
generate additional variables such as age and indicators for categorical variables such as white, black, gender, region, undergraduate major, employment status etc. Black indicator variable, for example, equals 1 if black and 0 otherwise. You will need many indicator variables.
obtain descriptive statistic of the data (count, proportion, means, SD, etc.) for the entire sample and by group (by gender and by race, or employed vs. unemployed, for example)
create visualizations to represent the data (histograms, bar charts, line charts, etc.) for the entire sample and by group
As you explore your data and see it visualized, you may find that your dataset has extreme outliers, incomplete data or just wrong data. If this is the case, you will need to clean your data to get a better understanding of the data. Once you have cleaned the data, perform the exploratory analysis again.
In your report be sure to:
Describe your dataset.
What is the purpose of the dataset? What is your data source?
Sample size and the distribution of important variables (the distribution of income across race/ethnic categories, for sample).
Provide tables for key variables and groups of of interest. This should be done for categorical data, discrete data and continuous data.
Provide visualizations of key variables and groups of interest.
Provide written analysis above and beyond the graphs and tables. Explain what the tables and visualizations tell you about the data.
Present your Exploratory Data Analysis in a report. Incorporate visualizations and tables into the textual analysis of your report. If appropriate, add an appendix of additional data tables and graphs.
The report should follow this flow:
Introduction: introduce the data, its purpose, the sources, the reason for choosing the data and what you hope to learn from the data. Incorporate a discussion of the data cleaning methods used.
Data Analysis: This is the body of the report where you provide descriptions of the data, basic statistical measures, graphs, tables and analysis.
2 to 5 tables with written interpretation
2 to 5 charts with written interpretation
Summary: Summarize the report. Identify the key take-aways from your analysis. Describe what you want to explore further about your data. Identify questions you want to answer with the data.
Your report should be 5 to 10 pages in length, including graphs but excluding the appendix or references.
What to Submit
You must submit 2 files:
The Exploratory Data Analysis report
The R code used to analyze the data
you will find in the attachment there is a file under name (requested project) this is the guideline to help you to write and use the data.
in addition , there are 2 file attached for your reference as am example of similar topic but with different number of data , so the 1st file is called (project reference 1 ) , 2nd one is called (report reference 2) this 2 file are similar to what i request but with different data so i post to see how you can make a similar report with similar format.
This assignment requires to do a data analysis using Machine Learning and report. The topic is Fake News Detection Using Python. For this order, it’s Chapter 2 Data Management. I have uploaded my Chapter 2 report as well as grading feedbacks, requirements. Your task is read requirements and feedbacks carefully, then revise and expand on the current Chapter 2 report. Make sure what have been taken in points will be satisfied after revision. Other additional materials such as Project Scope are also uploaded for you to review. There’s no specific requirement regarding how many pages you need to expand, I would say increase it to 7-9 pages depends on your revision, the current Chapter 2 report has 5.
You should only submit ONE data set (if some variables came from different sources, you must submit your combined data). This data set should contain all variables that you anticipate using in your model, including any potential sources of omitted variable bias. A minimum of 6 variables (including your dependent variable and independent variable) and 100 observations or more are required.
It MUST be in Stata format. (It should be a .dta file; do not submit your log file or your do file!)
*Find a research question from the guideline and make a data with Stata. Please email the question that you made and attach the state file with it.
Project is done. Everything was approved by the instructor except for the video. Please review the project(provided original project files) and the original video submitted. Please make a transcript for this video. Video example transcript also provided.
When u finish i need two things: 1. the code u wrote to anwser the questions in a Jupyter Notebook.The Python code should runnable. 2. 2 pages report in word format, detailing the methodology you followed and write the anwsers u got.