· Page length: Three page maximum. · The project involves: · Collecting data suitable for an analysis of positioning on a minimum of 15 to 20 objects. · Product-service price: this will serve as the...

1 answer below »

· Page length: Three page maximum.


· The project involves:


· Collecting data suitable for an analysis of positioning on a minimum of 15 to 20 objects.


· Product-service price: this will serve as the dependent variable in your regression model


· Two other product attributes that will serve as independent variables in your regression model


· Creating a positioning scatterplot, examining and reporting on multiple regression model results, and providing an executive summary analyzing the situation


· Comments on selecting objects:


· To reduce extraneous variance and increase the relevancy of your results, you need to pay attention to the conditions under which the data is collected. For example, the objects must be in the same product category. Notice that in the sedan example, light trucks and SUVs were not included. In the hotel example, apartment and condominium rental units were not included. In addition, prices were collected for one specific day.


· If necessary, you should control for location and time: In the hotel example, hotels were restricted to downtown hotels and on a specific day.


· Please feel free to replicate in part the sedan and hotel examples, but with some twists. Examples:


· An analysis of Los Angeles hotels. You would model price as the dependent variable. One regression independent variable might be average review out of five. The other might be whether the hotel is located at the airport or in the downtown city center. This would allow you to understand the average price difference across locations. If this option is chosen, you should consider modeling 20 objects, with 10 in each location. On the other hand, you could model just one location and select a different attribute such as the presence of a swimming pool.


· An analysis of light trucks modeling horsepower and vehicle size as independent variables. You could model size as wheelbase length. You could engage in the same analysis, but for SUVs.


· Report format:


· Page 1:


· Project title and team


· Executive summary


· Pages 2 and 3: Charts, model, process notes, and additional technical details


· Provide a scatterplot using price on the vertical axis and one of your two independent variables on the horizontal axis.


· Use effective labeling and label your objects in the scatterplot


· Note that charting all three variables at once is challenging and is therefore not required. Please select just one of your independent variables and plot just that one.


· Data file (Excel): the actual data itself


· Variable descriptions. For example, price in the sedan example was defined as “Price was taken from Kelly’s Blue using the base automatic model MSRP.”


· Report the results of your regression model. The following template may be used (fictitious example):




Model r-square = .75


Model F = 12.278; p-value < or=""> .10


Regression model:


Predicted value of Y = Intercept + Slope estimate 1 x Variable 1 + slope estimate 2 x Variable 2


Regression model: Y = 2.79 + $8,000 x Sedan Horsepower + $8,200 x Sedan Size


Sedan Horsepower p-value < or=""> .10


Sedan Size p-value < or=""> .10




Answered 1 days AfterJul 31, 2021

Answer To: · Page length: Three page maximum. · The project involves: · Collecting data suitable for an...

Mohd answered on Aug 02 2021
130 Votes
Executive summary:
Our purpose of this analysis is to leverage hotel data to make more informed decisions and get actionable insights from data. How can we utilize the booking information to better understand people's perspective and behavior?
We have retrieved this dataset from Kagg
le. In our dataset we have variables like ID, Hotel name, price for one night, hotel star rating, Distance from city Centre, customers rating on booking, number of rooms area and city. We have done some exploratory analysis including descriptive measures, histogram, scatter plot, boxplot and correlation analysis. We have found that several variables are strongly correlated with prices.
We have built a regression model on the basis of these relationships. After conducting regression analysis, we have found that hotel star rating and number of rooms and area are statistically significant to predict our response variable price. We have adjusted r square value of 0.5311, that means almost 53 percent variability in response variable prices we can explain with this model. 
Data preparation: We have retrieved this dataset from kaggle. It has no missing value. Organized In the Excel sheet in such a manner it's easy to read. We can code categorical variables in order to smoothly build regression models.
Continuous Variables:
In case of continuous variables, we want to see the central tendency and the distribution of the variable. These are measured using various statistical visualization methods such as Histogram. In the case of scatter plot we want to analyse variation of two variables (both must be continuous).
Categorical Variables
For categorical variables, we need to understand the distribution of each category. We have built a boxplot of price across all cities. It is measured using aggregation like count against each category.
Data dictionary: In appendix
Correlation matrix:
     
    Customer rating
    Price(BAM)
    Hotel star rating
    Distance
    Rooms
    Squares
    Customer rating
    1.00
     
     
     
     
     
    Price(BAM)
    0.32
    1.00
     
     
     
     
    Hotel star rating
    0.50
    0.49
    1.00
     
     
     
    Distance
    -0.34
    -0.10
    -0.07
    1.00
     
     
    Rooms
    -0.05
    0.44
    -0.02
    0.06
    1.00
     
    Squares
    0.28
    0.66
    0.44
    0.05
    0.42
    1.00
Figure: 1
Prices are moderately correlated with rooms and house star ratings. Prices are strongly correlated with area. Distance has weak negative correlation with prices.
Scatter plot:
Now, we want to explore the type of relationship between prices and star ratings, whether it is linear, polynomial, or exponential.
We have drawn a scatter plot between prices and star ratings and it's not clear. If we want to fit a regression line in a scatter plot it would be a good fit, visualization.
As we can see from the above graph, if we want to fit a regression...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here