Note: Your grade is based on the PROJECT REPORT. You should submit your code as well; but grading...

Question

Note: Your grade is based on the PROJECT REPORT.
- You should submit your code as well; but grading will be based on code and results described within the report.
- Include images of figures in the report along with your description of results.
- Use appendices to include relevant code (appendices do not count towards page limits)
Submission: Two files
- Single pdf file for the project report
- Zip file with all relevant code for your project

Sheet1 CarMPGWeightCylindersHorsepowerCountry Buick Skylark28.42670490U.S. Dodge Omni30.92230475U.S. Mercury Zephyr20.83070685U.S. Fiat Strada37.32130469Italy Peugeot 694 SL17.834106133France VW Rabbit31.91925471Germany Plymouth Horizon34.22200470U.S. Mazda GLC34.11975465Japan Buick Estate Wagon16.943608155U.S. Audi 500022.528305103Germany Chevy Malibu Wagon19.236058125U.S. Dodge Aspen18.636206110U.S. VW Dasher30.52190478Germany Ford Mustang 426.52585488U.S. Dodge Colt35.11915480Japan Datsun 810222815697Japan VW Scirocco31.51990471Germany Chevy Citation28.825956115U.S. Olds Omega26.827006115U.S. Chrysler LeBaron Wagon18.539408150U.S. Datsun 51027.22300497Japan AMC Concord D/L18.134106120U.S. Buick Century Special20.633806105U.S. Saab 99 GLE21.627954115Sweden Datsun 21031.82020465Japan Ford LTD17.637258129U.S. Volvo 240 GL1931406125Sweden Dodge St Regis18.238308135U.S. Toyota Corona27.52560495Japan Chevette302155468U.S. Ford Mustang Ghia21.929106109U.S. AMC Spirit27.42670480U.S. Ford Country Squire Wagon2840548142U.S. BMW 320i23.126004110Germany Pontiac Phoenix33.52556490U.S. Honda Accord LX29.52135468Japan Mercury Grand Marquis16.539558138U.S. Chevy Caprice Classic1738408130U.S. Engineering Department, UMass Boston Spring 2022 ENGIN 322: Probability and Random Processes Project #2: Sampling Theory and Linear Regression Project Description In this project, you will assume the part of an automotive manufacturer. In part I, you will provide background analysis for the development of a new vehicle. In part II, you will develop a method for sampling manufactured parts in order to determine the likelihood of failure. (Note: Data for Part I of the project is slightly modified from: https://www.statcrunch.com/5.0/viewresult.php?resid=1878105). Part I The spreadsheet ‘Vehicle_Info.xlsx’ provides detailed information about vehicles on the market. In Part I, you should load the data into Matlab and compare vehicle properties to average miles per gallon (mpg). a) Load the data set into Matlab with the readtable() function b) Determine the data set correlation (i.e., Pearson’s r) for average mpg versus weight, number of cylinders, and horsepower. c) Use the scatter() and polyfit() functions to display the data points and linear regression curves for average mpg versus weight, number of cylinders, and horsepower. For each of the 3 plots, show average mpg on your Y axis. Label the resulting plots and include them in your report. Question 1: Would you say that a vehicles horsepower impacts the average mpg? What about the impact of weight and number and cylinders on average mpg? Explain your reasoning. Question 2: Your company’s newest vehicle model will be a 6 cylinder that weighs 3,250lbs and has 100 horsepower. If your goal is to be above the regression line for every category, what average mpg should you strive for? Part II Assume that you have received reports of faulty headlights. The data values in the file ‘population_data.mat’ represent the population of headlights on the production line (0 represents a good headlight, 1 represents a faulty headlight). You are unable to test all headlights; but you are able to randomly sample ?? of the headlights as they come off the production line. In Part II, you should simulate the following in Matlab for values of ?? ranging from 1 to 2000. a) For each value of ??, randomly sample n headlights from the population and determine the sample mean and sample variance (i.e., the percentage of faulty headlights in your sample and the associated variance of the sample). b) Plot the sample mean and sample variance versus the sample size. Question 1: What do you notice about the sample mean and sample variance as you increase sample size? Question 2: Based on your plots, what can you predict about the percentage of headlights in the population that are faulty? Explain your reasoning. (Hint: Zoom in on the values from n=1000 to n=2000 for a better visualization. You can compare your prediction to the true mean by determining the mean value of the full set of values from ‘population_data.mat’) https://www.statcrunch.com/5.0/viewresult.php?resid=1878105 Grading Metrics • Coding and Results: 30% • Theoretical Analysis: 40% • Written Report: 30% Coding and Results: This portion of the project will be graded based on the implementation of code as described for Part I and II above. Results should include a description of the observed outcomes and some depiction of the results from your code. Theoretical Analysis: This portion of the project will be graded based on your answers to the questions above. Be sure to clearly indicate the answers AND REASONING for each of the questions within your written report. Written Report: The written report should be submitted on blackboard by midnight on April 8. The report should be 3-5 pages including an overview of the project, expected outcomes, your analysis method, results, and observations. You may include any code as an appendix. Project Description Part I Part II Grading Metrics Part II - Old

vehicleinfo-a0fvrbg5.xlsx populationdata-hevffxkz.mat project-2-description-2-55uwxxaw.pdf

Vishvajeet · Accepted Answer

Engineering Department, UMass Boston Spring 2022
ENGIN 322: Probability and Random Processes
Project #2: Sampling Theory and Linear Regression
Name:……………………………………………… ID:………………………..
Name:……………………………………………… ID:……………………….
Table of Contents
Title Page
1 
Objective
3 
Introduction
3 
Apparatus
3 
Procedure
3 
Results and Discussion
4-11 
Conclusion
References                         
11
11
Objective: In this project, you will assume the part of an automotive manufacturer. In part I, 
you will provide background analysis for the development of a new vehicle. In part II, you will 
develop a method for sampling manufactured parts in order to determine the likelihood of 
failure. 
Introduction:  
Regression analysis is an important statistical method for the analysis of data. By applying 
regression analysis, we are able to examine the relationship between a dependent variable and 
one or more independent variables. In this article, I am going to introduce the most common 
form of regression analysis, which is the linear regression. As the name suggests, this type of 
regression is a linear approach to modeling the relationship between the variables of interest. 
Method 
Linear regression is used to study the linear relationship between a dependent variable (y) and 
one or more independent variables (X). The linearity of the relationship between the 
dependent and independent variables is an assumption of the model. The relationship is modeled 
through a random disturbance term (or, error variable) ε. The disturbance is primarily 
important because we are not able to capture every possible influential factor on the dependent 
variable of the model. To capture all the other factors, not included as independent variable, that 
affect the dependent variable, the disturbance term is added to the linear regression model. 
In this way, the linear regression model takes the following form:
where
are the regression coefficients of the model (which we want to estimate!), and K is the number 
of independent variables included. The equation is called the regression equation. 
Simple linear regression 
Let’s take a step back for now. Instead of including multiple independent variables, we start 
considering the simple linear regression, which includes only one independent variable. Here, 
we start modeling the dependent variable yi with one independent variable xi:
where the subscript i refers to a particular observation (there are n data points in total). Here, β0 
and β1 are the coefficients (or parameters) that need to be estimated from the data. β0 is the 
intercept (a constant term) and β1 is the gradient. 
In simple linear regression, we essentially predict the value of the dependent variable yi using 
the score of the independent variable xi, for observation i.
Apparatus/Tools: 
• Personal Computer or Laptop. 
• MATLAB etc
Procedure: 
Results and Discussion: 
Part 1 –  
a) Load the data set into Matlab with the readtable() function  
b) Determine the data set correlation (i.e., Pearson’s r) for average mpg versus weight, number 
of cylinders, and horsepower.  
c) Use the scatter() and polyfit() functions to display the data points and linear regression curves 
for average mpg versus weight, number of cylinders, and horsepower. For each of the 3 plots, 
show average mpg on your Y axis. Label the resulting plots and include them in your report. 
Part 2 – 
Assume that you have received reports of faulty headlights.

Sheet1 CarMPGWeightCylindersHorsepowerCountry Buick Skylark28.42670490U.S. Dodge Omni30.92230475U.S. Mercury Zephyr20.83070685U.S. Fiat Strada37.32130469Italy Peugeot 694...

Answer To: Sheet1 CarMPGWeightCylindersHorsepowerCountry Buick Skylark28.42670490U.S. Dodge...

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment