Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

ISYE 6501 Exam Questions and Answers (Solved Papers), Exams of Nursing

ISYE 6501 Exam Questions and Answers (Solved Papers)

Typology: Exams

2024/2025

Available from 07/14/2025

Allivia
Allivia ๐Ÿ‡จ๐Ÿ‡ฆ

3.7

(78)

9.5K documents

1 / 17

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
ISYE 6501 Exam Questions and Answers
(Solved Papers)
Classification problems are commonly solved using what
model(s)? - Correct Answers โœ…Support Vector Machine
Clustering problems are commonly solved using what model(s)? -
Correct Answers โœ…k-means
Response Prediction questions are commonly solved using what
model(s)? - Correct Answers โœ…-ARIMA
-CART
-Exponential smoothing
-linear regression
-logistic regression
-Random Forest
Validation questions are commonly solved using what model(s)? -
Correct Answers โœ…-Cross Validation
Variance Estimation questions are commonly solved using what
model(s)? - Correct Answers โœ…-GARCH
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff

Partial preview of the text

Download ISYE 6501 Exam Questions and Answers (Solved Papers) and more Exams Nursing in PDF only on Docsity!

(Solved Papers)

Classification problems are commonly solved using what model(s)? - Correct Answers โœ…Support Vector Machine Clustering problems are commonly solved using what model(s)? - Correct Answers โœ…k-means Response Prediction questions are commonly solved using what model(s)? - Correct Answers โœ…-ARIMA -CART -Exponential smoothing -linear regression -logistic regression -Random Forest Validation questions are commonly solved using what model(s)? - Correct Answers โœ…-Cross Validation Variance Estimation questions are commonly solved using what model(s)? - Correct Answers โœ…-GARCH

(Solved Papers)

Examples of models that are designed for use with time series data - Correct Answers โœ…-ARIMA -CUSUM -Exponential Smoothing -GARCH In the soft classification SVM model where we select coefficients a_0 ... a_m to minimize sum(max(0, 1 - (sum(a_i * x_ij) + a_0 ) * y_j )

  • C * sum(a_i ^ 2) ). If we want to have a larger margin even though it means possibly having more classification error, the value of C should get: - Correct Answers โœ…Larger Best way to split data - Correct Answers โœ…-70% for training -15% for validation -15% for test Purpose of a test set - Correct Answers โœ…Estimate quality of selected model Purpose of a training set - Correct Answers โœ…Fit parameters of all models

(Solved Papers)

True or False: When using a random forest model, it's easy to interpret how its results are determined. - Correct Answers โœ…False Explanation: Unlike a model like regression where we can show the result as a simple linear combination of each attribute times its regression coefficient, in a random forest model there are so many different trees used simultaneously that it's difficult to interpret exactly how any factor or factors affect the result. Lesson

A common rule of thumb is to stop branching if a leaf would contain less than 5% of the data points. Why not keep branching and allow models to find very close fits to each very small subset of data? - Correct Answers โœ…Fitting to very small subsets of data will cause overfitting. Explanation: With too few data points, the models will fit to random patterns as well as real ones. Lesson 10.

(Solved Papers)

True or false: In a regression tree, every leaf of the tree has a different regression model that might use different attributes, have different coefficients, etc. - Correct Answers โœ…True. Explanation: Each leaf's individual model is tailored to the subset of data points that follow all of the branches leading to the leaf. Lesson 10. True or false: Tree-based approaches can be used for other models besides regression. - Correct Answers โœ…True. Explanation: For example, a classification tree might have a different SVM or KNN model at each leaf. It might even use SVM at some leaves and KNN at others (though that's probably rare). Lesson 10. What does "heteroscedasticity" mean? - Correct Answers โœ…The variance is different in different ranges of the data. Lesson 9. You might want to de-trend data before... - Correct Answers โœ…...using time-series data in a regression model.

(Solved Papers)

When would regression be used instead of a time series model? - Correct Answers โœ…When there are other factors or predictors that affect the response. Explanation: Regression helps show the relationships between factors and a response. Lesson 8. If two models are approximately equally good, measures like AIC and BIC will favor the simpler model. Simpler models are often better because... - Correct Answers โœ…1. Simple models are easier to explain and "sell" to managers and executives

  1. The effects observed in simple models are easier for everyone, including analytics professionals, to understand
  2. Simple models are less likely to be over-fit to random effects Explanation: Simpler models are less likely to be over-fit, easier to understand, and easier to explain. Lesson 8. Which of the following is not a common use of regression? - Correct Answers โœ…Prescriptive analytics: Determining the best course of action.

(Solved Papers)

Explanation: Regression is often good for describing and predicting, but is not as helpful for suggesting a course of action. Lesson 8. True or false: regression is a way to determine whether one thing causes another. - Correct Answers โœ…False. Explanation: Regression can show relationships between observations, but it doesn't show whether one thing causes another. Lesson 8. Suppose our regression model to estimate how tall a 2-year-old will be as an adult has the following coefficients: 0.56xFatherHeight + 0.51xMotherHeight - 0.02xFatherHeightxMotherHeight The negative sign on the coefficient of FatherHeightxMotherHeight means: - Correct Answers โœ…People with two taller-than-average parents won't be as tall as the individual effects of father's height and mother's height add up to Explanation: The negative coefficient for the interaction term brings down the overall estimate. Lesson 8.

(Solved Papers)

True or False: In the exponential smoothing equation ๐‘†๐‘ก=๐›ผ๐‘ฅ๐‘ก+(1โˆ’๐›ผ)๐‘†๐‘ก โˆ’1 only the current observation ๐‘ฅ๐‘ก is considered in calculating the estimate ๐‘†๐‘ก. - Correct Answers โœ…False. Explanation: Plugging in for ๐‘†๐‘ก โˆ’1 , and then for ๐‘†๐‘ก โˆ’2 , etc., shows that ๐‘†๐‘ก=๐›ผ๐‘ฅ๐‘ก+(1โˆ’๐›ผ)๐›ผ๐‘ฅ๐‘ก โˆ’1+(1 โˆ’๐›ผ)2๐›ผ๐‘ฅ๐‘ก โˆ’2+(1 โˆ’๐›ผ)3๐›ผ๐‘ฅ๐‘ก โˆ’3+... Lesson

Is exponential smoothing better for short-term forecasting or long-term forecasting? - Correct Answers โœ…Short-term Explanation: Exponential smoothing bases its forecast primarily on the most-recent data points. For forecasts of the longer-term future, there aren't data points close to the time being forecasted. Lesson 7. What does autoregression mean? - Correct Answers โœ…Previous values of the thing being estimated are used to calculate the estimate. Explanation: Its own previous values are used in the estimate. Lesson 7.

(Solved Papers)

Why is GARCH different from ARIMA and exponential smoothing? - Correct Answers โœ…GARCH estimates variance Explanation: ARIMA and exponential smoothing both estimate the value of an attribute; GARCH estimates the variance. Lesson 7. In the CUSUM model, having a higher threshold T makes it... - Correct Answers โœ…...detect changes slower, and less likely to falsely detect changes. Explanation: A higher threshold makes it slower to detect both true and false changes. Lesson 6. Why are hypothesis tests often not sufficient for change detection? - Correct Answers โœ…They are often slow to detect changes. Explanation: Hypothesis tests generally have high threshold levels, which makes them slow to detect changes. Lesson 6. Which of these is generally a good reason to remove an outlier from your data set?

(Solved Papers)

Explanation: Because you know the correct classification for hundreds of images, you can build a model to classify the rest (supervised learning). Lesson 4. The k-means algorithm for clustering is a "heuristic" because... - Correct Answers โœ…...it isn't guaranteed to get the best answer. Explanation: Heuristic algorithms are not guaranteed to find the best answer. Lesson 4. Straight-line distance โˆ‘๐‘›๐‘–=1(๐‘ฅ๐‘–โˆ’๐‘ฆ๐‘–)2โ€พโ€พโ€พโ€พโ€พโ€พโ€พโ€พโ€พโ€พโ€พโ€พโ€พโ€พ โˆš2 corresponds to which distance metric? - Correct Answers โœ…2-norm Explanation: The power and root are the same as the norm. Lesson 4. In k-fold cross-validation, how many times is each part of the data used for training, and for validation? - Correct Answers โœ…k- times for training, and 1 time for validation

(Solved Papers)

Explanation: Each of the k times the model is fit, a different part of the data is used for validation and the rest is used for training. Lesson 3. Which should we use most of the data for: training, validation, or test? - Correct Answers โœ…Training. Explanation: Most experts recommend using 50-70% of the data for training, and splitting the rest equally between validation and test. Lesson 3. When comparing models, if we use the same data to pick the best model as we do to estimate how good the best one is, what is likely to happen? - Correct Answers โœ…The model will appear to be better than it really is. Explanation: The model with the highest measured performance is likely to be both good and lucky in its fit to random patterns. Lesson 3. If we use the same data to fit a model as we do to estimate how good it is, what is likely to happen? - Correct Answers โœ…The model will appear to be better than it really is.

(Solved Papers)

Explanation: The multiplier for classification errors is 200 for data points 21-50, much more than 5 for data points 1-20. Lesson 2. Which of these two terms measures the error in classifying all of the data points? A. โˆ‘๐‘š๐‘–=1(๐‘Ž๐‘–) B. โˆ‘๐‘›๐‘—=1๐‘š๐‘Ž๐‘ฅ{0,1 โˆ’ โˆ‘( ๐‘š๐‘–=1๐‘Ž๐‘–๐‘ฅ๐‘–๐‘—+๐‘Ž0)๐‘ฆ๐‘—} - Correct Answers โœ…B. Explanation: This term measures classification error. Lesson 2. A survey of 25 people recorded each person's family size and type of car. Which of these is a data point? A. The 14th person's family size and car type B. The 14th person's family size C. The car type of each person - Correct Answers โœ…A. Explanation: A data point is all the information about one observation. Lesson 2.

(Solved Papers)

A survey of 25 people recorded each person's family size and type of car. Which of these is structured data? A. The contents of a person's Twitter feed B. The amount of money in a person's bank account - Correct Answers โœ…B. Explanation: Every entry will be a number of dollars and cents. Lesson 2. A survey of 25 people recorded each person's family size and type of car. Which of these is time series data? A. The average cost of a house in the United States every year since 1820. B. The height of each professional basketball player in the NBA at the start of the season - Correct Answers โœ…A. Explanation: The same thing is measured at yearly time intervals. Lesson 2.