Which statistic is based on the maximum vertical distance between the primary event EDF and the secondary event EDF?
Which method is NOT an appropriate way to score new observations with a known target in a logistic regression model?
Refer to the exhibit:
On the Gains Chart, what is the correct interpretation of the horizontal reference line?
Refer to the confusion matrix:
An analyst determines that loan defaults occur at the rate of 3% in the overall population. The above confusion matrix is from an oversampled test set (1 = default).
What is the sensitivity adjusted for the population event probability?
Enter your answer in the space below. Round to three decimals (example: n.nnn).
An analyst knows that the categorical predictor, zip_code, is an important predictor of a binary target. However, zip_code has too many levels to be a feasible predictor in a model. The analyst uses PROC CLUSTER to implement Greenacre's method to reduce the number of categorical levels.
What is the correct application of Greenacre's method in this situation?
The selection criterion used in the forward selection method in the GLMSELECT procedure is:
While building a predictive model, median imputations are performed while preparing the training data.
How should the imputations be addressed in the validation data?
Customers were surveyed to assess their intent to purchase a product. An analyst divided the customers into groups defined by the company's pre-assigned market segments and tested for difference in the customers' average intent to purchase. The following is the output from the GLM procedure:
What percentage of customers' intent to purchase is explained by market segment?
Click the calculator button to display a calculator if needed.
PROC GLMSELECT was used for building a model predicting the natural log of a baseball player's salary from certain performance and longevity statistics. The model used backward elimination using SBC as its selection criterion. The sequence of steps is summarized in the graphic shown below:
At Step 9 number of at bats (nAtBat) was removed from the model.
Why was it removed?
The question will ask you to provide a missing statement. Given the following SAS program:
Which SAS statement will complete the program to correctly score the data set NEW_DATA?