Please refer to the Project video for complete context about the “Advance Statistics” problem.
The objective of the project is to use the dataset ‘Factor-Hair-Revised.csv‘ to build an optimum regression model to predict satisfaction. You are expected to
- Perform exploratory data analysis on the dataset. Showcase some charts, graphs. Check for outliers and missing values (8 marks)
- Is there evidence of multicollinearity ? Showcase your analysis(6 marks)
- Perform simple linear regression for the dependent variable with every independent variable (6 marks)
- Perform PCA/Factor analysis by extracting 4 factors. Interpret the output and name the Factors (20 marks)
- Perform Multiple linear regression with customer satisfaction as dependent variables and the four factors as independent variables. Comment on the Model output and validity. Your remarks should make it meaningful for everybody
Please note the following:
- Your submission should be a Word Document or PDF.
- You must give the sources of data presented. Do not refer to blogs; Wikipedia etc.
- Any assignment found copied/ plagiarized with other group(s) will not be graded and marked as zero.
- Please ensure timely submission as post deadline assignment will not be accepted.