# Business Analysis Statistics Test #4

#### A data collection with numerous variables that lack values is used to create a prediction model. What two issues might this model have? (Select two.)

Please select 2 correct answers

The correct answer:

Complete case analysis means that fewer observations will be used in the model building process.

New cases with missing values on input variables cannot be scored without extra data processing.

#### A logistic regression model is fitted by an analyst to determine whether a client will default on a loan or not. Agents, who each service 15-20 clients, are one of the predictors in the model. Model convergence is not achieved. The analyzed data is printed by the analyst, along with the breakdown of defaulted loans by agent. View a sample of the output below: What is the main cause of the model's inability to converge?

The correct answer:

There is quasi-complete separation in the data.

#### A multiple linear regression model already exists with the addition of a non-contributing predictor variable (Pr > |t| =0.658). What will happen as a result?

The correct answer:

An increase in R-Square

#### Choose the LOGISTIC procedure model statements that are equal. (Select two.)

Please select 2 correct answers

The correct answer:

Mode1 Purchase * Gender Age Region;

Mode1 Purchase * Gender|Age|Region @1;

#### A linear model includes the following features: a subordinate variable (y) a continuous predictor variable set of three (x1-x3) an individual category predictor variable (c1with 3 levels) Which SAS module does this model fit?

The correct answer:

Proc glm data=sasuser.mlr;class c1;model y=c1 x1-x3 /solution;run;

#### The data 13, 15, 16, 17, 19, and 20's median are as follows:

The correct answer:

33/2

#### When taken from: the total of the absolute deviations is at its minimum.

Explanation:

A dataset listed in ascending order is given a median value using a statistical technique (i.e., from smallest to largest value). The measure separates the dataset's lower and upper halves. The median is a measurement of central tendency together with mean and mode.

#### If a series' maximum value is 25, and its range is 15, then the series' maximum value is:

The correct answer:

30

#### A debate competition with 10 competitors had a rank correlation coefficient of 0.6, according to the calculations. Later on, it was found that the difference in some participants' ranks was actually 8 instead of 3. How to calculate the right correlation coefficient:

The correct answer:

0.933

#### The dispersion is referred to as follows if all exam scores tend to cluster around the mean:

The correct answer:

Small

#### Find the set's mode (8, 5, 7, 10, 15, 21, 5, 7, 2, 5)

The correct answer:

5