100% Guaranteed Results


Stat656 – Week
$ 24.99
Category:

Description

5/5 – (1 vote)

Assignment: Starting with this week, you are only expected to complete one of these parts. If you do both parts, you can double the points you receive for the assignment. In other words, you can look at this as two assignments. If you do one successfully you get 100 points for the assignment. If you have time to do both and you complete them successfully you can get 200 points. You are expected to complete at least one part successfully.

Data File: sonar3by5.xlsx

Part 1: Create a SAS EM project names “Week 3 Homework”. In that project read this data file for this assignment. Import the data, ensuring that all attributes have the proper metadata described in the data dictionary. In this case, the target is binary ‘Object’, and all independent attributes are interval.

The data dictionary for this file was described in the Week 1 assignment.

Object: The binary target with values “R” for rock and “M” for mine.

R1’ thru ‘R60’ : Interval attributes. All of their values must be greater than or equal to 0 and less than or equal to 1. These are used as input attributes.

Using the data dictionary for this file, identify outliers and replace them with missing. Impute all missing values using the ‘Tree’ method.

Use a 70/30 partition to build regression models for predicting ‘Amount’ using forward, stepwise and backward regression. Use the HP and Non-HP models, and select the best.

Using the best model, predict the object for each case. Report the confusion matrix for your predictions along with the misclassification rate, Sensitivity and Specificity.

Also print the top 15 observations from the final file containing the data plus, imputed values and predictions.
Part 2: Do the same assignment as Part 1 using Python.
However since Python does not impute using the ‘Tree’ method, impute using the average.

Do not scale the interval attributes. Since there are no nominal attributes, encoding is not necessary (“NONE”)

From these data, after imputation, fit a logistic regression model for predicting ‘Object’ . Use 70/30 cross validation (split) as you did in SAS EM.

Report a listing of your code, plus the same statistics you reported from Part 1.

Upload a pdf file containing your solutions to parts 1 and 2.

Part 1: SAS EM
1. A screen shot of your project window
2. The confusion matrix for predicting “object” along with the misclassification rate, Sensitivity and Specificity.
3. A listing of the first 15 observations after imputation and prediction.

Part 2: Python
1. A copy of your python program 2. & 3. Same as part 1.

In order to receive full credit, please ensure all screen shots, code and tables are readable.

Reviews

There are no reviews yet.

Be the first to review “Stat656 – Week”

Your email address will not be published. Required fields are marked *

Related products