Which of the following describes a neural network without an activation function?
A classifier has been implemented to predict whether or not someone has a specific type of disease. Considering that only 1% of the population in the dataset has this disease, which measures will work the BEST to evaluate this model?
Which two of the following criteria are essential for machine learning models to achieve before deployment? (Select two.)
Which of the following describes a benefit of machine learning for solving business problems?
Which of the following is the definition of accuracy?
What is the open framework designed to help detect, respond to, and remediate threats in ML systems?
Which of the following describes a typical use case of video tracking?
For each of the last 10 years, your team has been collecting data from a group of subjects, including their age and numerous biomarkers collected from blood samples. You are tasked with creating a prediction model of age using the biomarkers as input. You start by performing a linear regression using all of the data over the 10-year period, with age as the dependent variable and the biomarkers as predictors.
Which assumption of linear regression is being violated?
Your dependent variable data is a proportion. The observed range of your data is 0.01 to 0.99. The instrument used to generate the dependent variable data is known to generate low quality data for values close to 0 and close to 1. A colleague suggests performing a logit-transformation on the data prior to performing a linear regression. Which of the following is a concern with this approach?
Definition of logit-transformation
If p is the proportion: logit(p)=log(p/(l-p))
A dataset can contain a range of values that depict a certain characteristic, such as grades on tests in a class during the semester. A specific student has so far received the following grades: 76,81, 78, 87, 75, and 72. There is one final test in the semester. What minimum grade would the student need to achieve on the last test to get an 80% average?
Your dependent variable Y is a count, ranging from 0 to infinity. Because Y is approximately log-normally distributed, you decide to log-transform the data prior to performing a linear regression.
What should you do before log-transforming Y?
Which of the following is a common negative side effect of not using regularization?
An organization sells house security cameras and has asked their data scientists to implement a model to detect human feces, as distinguished from animals, so they can alert th customers only when a human gets close to their house.
Which of the following algorithms is an appropriate option with a correct reason?
Which of the following sentences is true about model evaluation and model validation in ML pipelines?
Which of the following are true about the transform-design pattern for a machine learning pipeline? (Select three.)
It aims to separate inputs from features.
Which two encodes can be used to transform categories data into numerical features? (Select two.)
In general, models that perform their tasks:
Word Embedding describes a task in natural language processing (NLP) where:
Normalization is the transformation of features:
Why do data skews happen in the ML pipeline?
Which of the following best describes distributed artificial intelligence?
A healthcare company experiences a cyberattack, where the hackers were able to reverse-engineer a dataset to break confidentiality.
Which of the following is TRUE regarding the dataset parameters?
Which database is designed to better anticipate and avoid risks of AI systems causing safety, fairness, or other ethical problems?
Which of the following is the correct definition of the quality criteria that describes completeness?
Which two techniques are used to build personas in the ML development lifecycle? (Select two.)
You are building a prediction model to develop a tool that can diagnose a particular disease so that individuals with the disease can receive treatment. The treatment is cheap and has no side effects. Patients with the disease who don't receive treatment have a high risk of mortality.
It is of primary importance that your diagnostic tool has which of the following?
You are developing a prediction model. Your team indicates they need an algorithm that is fast and requires low memory and low processing power. Assuming the following algorithms have similar accuracy on your data, which is most likely to be an ideal choice for the job?