Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Exam Professional Machine Learning Engineer Topic 4 Question 74 Discussion

Actual exam question for Google's Professional Machine Learning Engineer exam
Question #: 74
Topic #: 4
[All Professional Machine Learning Engineer Questions]

You are developing an ML model to predict house prices. While preparing the data, you discover that an important predictor variable, distance from the closest school, is often missing and does not have high variance. Every instance (row) in your data is important. How should you handle the missing data?

Show Suggested Answer Hide Answer
Suggested Answer: A

Sampled Shapley is a fast and scalable approximation of the Shapley value, which is a game-theoretic concept that measures the contribution of each feature to the model prediction. Sampled Shapley is suitable for online prediction requests, as it can return feature attributions with minimal latency. The path count parameter controls the number of samples used to estimate the Shapley value, and a lower value means faster computation. Integrated Gradients is another explanation method that computes the average gradient along the path from a baseline input to the actual input. Integrated Gradients is more accurate than Sampled Shapley, but also more computationally intensive. Therefore, it is not recommended for online prediction requests, especially with a high path count. Prediction drift is the change in the distribution of feature values or labels over time. It can affect the performance and accuracy of the model, and may require retraining or redeploying the model. Vertex AI Model Monitoring allows you to monitor prediction drift on your deployed models and endpoints, and set up alerts and notifications when the drift exceeds a certain threshold. You can specify an email address to receive the notifications, and use the information to retrigger the training pipeline and deploy an updated version of your model. This is the most direct and convenient way to achieve your goal. Training-serving skew is the difference between the data used for training the model and the data used for serving the model. It can also affect the performance and accuracy of the model, and may indicate data quality issues or model staleness. Vertex AI Model Monitoring allows you to monitor training-serving skew on your deployed models and endpoints, and set up alerts and notifications when the skew exceeds a certain threshold. However, this is not relevant to the question, as the question is about the feature attributions of the model, not the data distribution.Reference:

Vertex AI: Explanation methods

Vertex AI: Configuring explanations

Vertex AI: Monitoring prediction drift

Vertex AI: Monitoring training-serving skew


Contribute your Thoughts:

Lawanda
14 days ago
Ah, the classic 'replace with zeros' approach. It's like trying to fix a leaky faucet with duct tape - it just doesn't work!
upvoted 0 times
...
Vivan
17 days ago
Replacing missing values with zeros? That's about as useful as a chocolate teapot. We need a better solution here.
upvoted 0 times
Omega
3 days ago
Agreed, replacing missing values with zeros is not a good idea.
upvoted 0 times
...
...
Izetta
25 days ago
Predicting the missing values using linear regression? Hmm, I'm not sure that's a good idea if the variable doesn't have high variance.
upvoted 0 times
Hubert
3 days ago
B) Apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Vincenza
20 days ago
A) Delete the rows that have missing values.
upvoted 0 times
...
...
Whitley
1 months ago
Ooh, feature crossing! Sounds fancy. But won't that just mask the problem instead of actually solving it?
upvoted 0 times
Fernanda
9 days ago
Ooh, feature crossing! Sounds fancy. But won't that just mask the problem instead of actually solving it?
upvoted 0 times
...
Barabara
22 days ago
B) Apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Basilia
30 days ago
A) Delete the rows that have missing values.
upvoted 0 times
...
...
Ellen
2 months ago
Deleting rows with missing values? That's like throwing the baby out with the bathwater. We need to preserve every data point!
upvoted 0 times
Stephaine
14 days ago
User 2: How about predicting the missing values using linear regression?
upvoted 0 times
...
Percy
16 days ago
User 1: I agree, we can't just delete rows with missing values.
upvoted 0 times
...
...
Virgina
2 months ago
I think deleting the rows with missing values is the best option to ensure accurate predictions.
upvoted 0 times
...
Johna
2 months ago
I disagree, I believe we should apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Socorro
2 months ago
I think we should predict the missing values using linear regression.
upvoted 0 times
...

Save Cancel