Competencies In this project, you will demonstrate your mastery of the following competencies: Apply statistical techniques to address research problems Perform regression analysis to address an...

1 answer below »

Competencies


In this project, you will demonstrate your mastery of the following competencies:



  • Apply statistical techniques to address research problems

  • Perform regression analysis to address an authentic problem


Overview


The purpose of this project is to have you complete all of the steps of a real-world linear regression research project starting with developing a research question, then completing a comprehensive statistical analysis, and ending with summarizing your research conclusions.


Scenario


You have been hired by the D. M. Pan National Real Estate Company to develop a model to predict median housing prices for homes sold in 2019. The CEO of D. M. Pan wants to use this information to help their real estate agents better determine the use of square footage as a benchmark for listing prices on homes. Your task is to provide a report predicting the median housing prices based square footage. To complete this task, use the provided real estate data set for all U.S. home sales as well as national descriptive statistics and graphs provided.


Directions


Using the Project One Template located in the What to Submit section, generate a report including your tables and graphs to determine if the square footage of a house is a good indicator for what the listing price should be. Reference the National Statistics and Graphs document for national comparisons and the Real Estate County Data spreadsheet (both found in the Supporting Materials section) for your statistical analysis.



Note:Present your data in a clearly labeled table and using clearly labeled graphs.


Specifically, include the following in your report:



Introduction




  1. Describe the report:Give a brief description of the purpose of your report.

    1. Define the question your report is trying to answer.

    2. Explain when using linear regression is most appropriate.

      1. When using linear regression, what would you expect the scatterplot to look like?



    3. Explain the difference between response and predictor variables in a linear regression to justify the selection of variables.





Data Collection




  1. Sampling the data:Select a random sample of 50 counties.

    1. Identify your response and predictor variables.




  2. Scatterplot:Create a scatterplot of your response and predictor variables to ensure they are appropriate for developing a linear model.



Data Analysis




  1. Histogram:For your two variables, create histograms.


  2. Summary statistics:For your two variables, create a table to show the mean, median, and standard deviation.


  3. Interpret the graphs and statistics:

    1. Based on your graphs and sample statistics, interpret the center, spread, shape, and any unusual characteristic (outliers, gaps, etc.) for the two variables.

    2. Compare and contrast the shape, center, spread, and any unusual characteristic for your sample of house sales with the national population. Is your sample representative of national housing market sales?





Develop Your Regression Model




  1. Scatterplot:Provide a graph of the scatterplot of the data with a line of best fit.

    1. Explain if a regression model is appropriate to develop based on your scatterplot.




  2. Discuss associations:Based on the scatterplot, discuss the association (direction, strength, form) in the context of your model.

    1. Identify any possible outliers or influential points and discuss their effect on the correlation.

    2. Discuss keeping or removing outlier data points and what impact your decision would have on your model.




  3. Findr:Find the correlation coefficient (r).

    1. Explain how thervalue you calculated supports what you noticed in your scatterplot.





Determine the Line of Best Fit.Clearly define your variables. Find and interpret the regression equation. Assess the strength of the model.




  1. Regression equation:Write the regression equation (i.e., line of best fit) and clearly define your variables.


  2. Interpret regression equation:Interpret the slope and intercept in context.


  3. Strength of the equation:Provide and interpretR-squared.

    1. Determine the strength of the linear regression equation you developed.




  4. Use regression equation to make predictions:Use your regression equation to predict how much you should list your home for based on the square footage of your home.



Conclusions




  1. Summarize findings:In one paragraph, summarize your findings in clear and concise plain language for the CEO to understand. Summarize your results.

    1. Did you see the results you expected, or was anything different from your expectations or experiences?

      1. What changes could support different results, or help to solve a different problem?

      2. Provide at least one question that would be interesting for follow-up research.





Answered Same DayMay 26, 2021

Answer To: Competencies In this project, you will demonstrate your mastery of the following competencies: Apply...

Subhanbasha answered on May 27 2021
147 Votes
2
[Insert Your Own Title Here]
Report: Median Housing Price Prediction
[Name]
Median Housing Price Prediction Model for D. M. Pan National Real Estate Company    1
Southern New Hampshire University
Introduction
The following report will provide D.M. Pan National Real Estate Company with a better understanding of 2019 U.S. home sales and whethe
r the square footage of a home is a good indicator of what it’s listing price will be. To gain this understanding, this report will cover median housing prices for homes sold in 2019 and what their median square footage was. This report will help real estate agents at D.M. Pan National Real Estate to better determine the use of square footage as a benchmark for listing prices on homes. Using linear regression is most appropriate for this set of data because we want to predict the value of a variable (median selling price) based on the value of another variable (median square feet of the home). The median selling price is the dependent variable in this situation so when using linear regression, we would expect a scatterplot to have a high, positive, linear correlation line. This line ultimately looks like a diagonal line because as selling price increases, we would expect square footage of a home to increase as well. The response variable is the median selling price of the home whereas the predictor variable is the square feet of the home. Overall, the report will provide the company with valuable data to that will explain which location within the U.S. the most profits per square foot of a house can be made for the company and the company’s clients.
Data Collection
Sampling the data:
The sampled data obtained from the population data by making the sample number generation it will give the record numbers to be samples so we take those records of observation into the sample. Here we are taken 25% of the data from the population using simple random sampling method.
Scatterplot:
Scatter plot for median listing price:
Scatter plot for Median Square feet:
Data Analysis
The data should be contains more than one variable to do linear regression. In that one should be dependent variable and another one or more variables as independent variables. Here we can also call the dependent variable as response variable and independent variables as predictor variables.
    The response variable depends on the predictor variables and predictor variables are independent of each other these will affect the response variable.
Histogram:
Histogram for Median listing prices:
Histogram for Square feet:
Summary statistics:
    Column
    n
    Mean
    SD
    Min
    Q-1
    Median
    Q-3
    Maxi
    median listingprice
    244
    $288,602
    $140016.1
    $75,780
    $190890
    $265937
    $341447
    $1,001385
    
    
    
    
    
    
    
    
    
    median $'s per square foot
    244
    $140.7
    $79.7258
    $45
    $95
    $123
    $152.5
    $747
    median square feet
    244
    1975
    367.7532
    995
    1748
    1949
    2159
    3314
Interpret the graphs and statistics:
The above plot is boxplot of the median listing prices from that we can say that the data is not following normal it is right skewed data and there present outliers.
The above plot is boxplot of the median $'s per square foot from that we can say that the data is not following normal it is right skewed data and there present outliers.
The above plot is boxplot of the median square feet from that we can say that the data is following approximately normal and there present outliers.
The above all...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here