Outline The project is to use a multiple regression analysis to analyze a data set that is of interest to you. If you have a strong interest in two group or analysis of variance you can do that with...

1 answer below »
Multiple Linear Regresion


Outline The project is to use a multiple regression analysis to analyze a data set that is of interest to you. If you have a strong interest in two group or analysis of variance you can do that with my concurrence. The final report for the project should be a 5-7 page paper that describes the questions of interest, how you used your data set to analyze these questions with details on the steps you used in your analysis, your findings about your question of interest and the limitations of your study. Specifically, your report should contain the following: 1. Abstract: A one paragraph summary of what you set out to learn, and what you ended up finding. It should summarize the entire report. 1. Introduction: A discussion of what questions you are interested in. 1. Data Set: Describe details about how the data set was collected and the variables in the data set. 1. Analysis: Describe how you used multiple regression to analyze the data set. Specifically, you should discuss how you carried out the steps in analysis discussed in class, i.e., exploration of data to find an initial reasonable model, checking the model and changes to the model based on your checking of the model. 1. Results: Provide inferences about the questions of interest and discussion. 1. Limitations of study and conclusion: Describe any limitations of your study and how they might be overcome in future research and provide brief conclusions about the results of your study. 1. Abstract 2. Introduction a. What’s the problem? b. Why is it important c. How do you plan to solve it? d. Who cares? e. Why do they care f. (industry graph) g. Lit Review (Background) Industry Review i. Describe the industry ii. Scholar.google.com iii. Academic papers iv. Articles v. What have other people done in this research in the past vi. *cited!!! 2. Data a. What is your data b. Where did you get it c. Descriptive statistics table 3. Methodology a. Describe your methodology b. Include equations c. Linear regression d. 4. Results a. Describe your results b. Are they significant? c. 10 step process d. Alpha (.05) 5. Conclusion a. The model is good b. Why is it important c. This is important to whom? d. How will this change the industry e. Why should I care?? Data Sets The project will be of most interest to you if you find questions of interest and a data set that are of interest to you. Examples of questions of interest are as follows: 1. What properties of a baseball team best predict its success over the course of a season? 1. What properties of a college are related to its rank in the U.S. News and World Report rankings? 1. Is the unemployment rate related to economic measures such as interest rates, stock returns, and the inflation rate? 1. What properties of a state predict the proportion of the vote that George Bush (John Kerry) received in it? You will need a data set to explore your question of interest. I will be happy to help you with suggestions. The data set should ideally contain at least 30-50 observations or more (e.g., companies, people, countries, etc., as the case may be), and at least 4 variables (pieces of information about the observations; e.g., stock price, revenues, profits, salaries, gender, etc.), although if that is not possible, exceptions will be allowed (subject to my approval). Do not be concerned if your dataset is large. One of the variables should be such that it is a numerical variable that would be of interest to try to model or forecast (e.g., for the examples above, team winning percentage, stock price change, U.S. News and World Report rank, gas mileage, unemployment rate, and proportion of vote received respectively). I will be happy to discuss ideas with you. Here are a few potential sources of ideas and data: 1. http://kaggle.com 1. http://www.hawkeslearning.com/Statistics/dis/datasets.html 1. https://www.springboard.com/blog/free-public-data-sets-data-science-project/ 1. https://www.dataquest.io/blog/free-datasets-for-projects/ 1. http://lib.stat.cmu.edu/DASL/ Samples A good sample of what I’m expecting from the projects and reports is contained at the web site http://pages.stern.nyu.edu/~jsimonof/classes/1305/projdoc/ . Note that these reports are for a class taught at New York University by Jeffrey Simonoff, so some of the methods used in the regression analyses may be unfamiliar to you.
Answered Same DayNov 27, 2021

Answer To: Outline The project is to use a multiple regression analysis to analyze a data set that is of...

Shakeel answered on Dec 01 2021
149 Votes
7
Abstract
In this paper, it is tried to analyze the factors affect the price of used car. For that purpose, the underlying variables – Years of registration, Power PS and Kilometer are taken. Historical data of past 25 years is taken according to the car model. Descriptive statistic
shows that there is a huge variation in price data and no of kilometer a car has run. Data of all four variables are also not normally distributed. Further, the regression analysis shows that there is a significant relationship between price of used cars and the other three variables. Such findings would be useful for better pricing of used car based on significant variables. Industry would grow in guise of better dealing between dealers and customers. Government authority would also be in benefited in deciding motor vehicle tax and insurance premium.
Introduction
The value of running car depends upon the several factors like what is the manufacturing model, how run the car has run; the power of engine, fuel efficiency and so on. Correct pricing of old car is important for both dealers as well as customers. People who don’t afford the new car generally pursue to buy the old one and put them on restoration and customization to make them as per choice. Therefore, correct pricing is important to have their dream car in budget. Pricing of old car is also important for deciding the premium of motor bike insurance. However, estimation of old car’s price is generally not carried out mathematically and therefore, it always remain in blind alley to find the correct price. Hence, we have no sufficient literature and research papers on this field. Therefore, it is interesting to find how the value of old car is interrelated to the different underlying variables. Such findings would be quit helpful for dealer as well as customers. Government authority may also use such report for their calculation of motor bike tax and insurance purpose. Car making companies may also use the finding to better gauge their production process to yield better price of their car after the usage for certain period.
Data
Four major variables are taken – Price of used car, Year of registration, Power PS and Kilometer of running. According to the model of car, figure of these four variables are found. Data is taken for the last 25 years. All variables are classified as numeric variables because they have certain values and can be measured on the scale.
The descriptive statistics of all four variables is as follows –
    
    Price
    Year Of Registration
    Power PS
    Kilometer
    
    
    
    
    
    Mean
    17,295.44
    2,004.58
    115.55
    125,617.94
    Standard Error
    5,886.26
    0.15
    0.32
    65.81
    Median
    2,950.00
    2,003.00
    105.00
    150,000.00
    Mode
    0.00
    2,000.00
    0.00
    150,000.00
    Standard Deviation
    3,587,905.48
    92.87
    192.14
    40,113.10
    Sample...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here