New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Professional Machine Learning Engineer Exam - Topic 4 Question 74 Discussion

Actual exam question for Google's Professional Machine Learning Engineer exam
Question #: 74
Topic #: 4
[All Professional Machine Learning Engineer Questions]

You are developing an ML model to predict house prices. While preparing the data, you discover that an important predictor variable, distance from the closest school, is often missing and does not have high variance. Every instance (row) in your data is important. How should you handle the missing data?

Show Suggested Answer Hide Answer
Suggested Answer: A

Sampled Shapley is a fast and scalable approximation of the Shapley value, which is a game-theoretic concept that measures the contribution of each feature to the model prediction. Sampled Shapley is suitable for online prediction requests, as it can return feature attributions with minimal latency. The path count parameter controls the number of samples used to estimate the Shapley value, and a lower value means faster computation. Integrated Gradients is another explanation method that computes the average gradient along the path from a baseline input to the actual input. Integrated Gradients is more accurate than Sampled Shapley, but also more computationally intensive. Therefore, it is not recommended for online prediction requests, especially with a high path count. Prediction drift is the change in the distribution of feature values or labels over time. It can affect the performance and accuracy of the model, and may require retraining or redeploying the model. Vertex AI Model Monitoring allows you to monitor prediction drift on your deployed models and endpoints, and set up alerts and notifications when the drift exceeds a certain threshold. You can specify an email address to receive the notifications, and use the information to retrigger the training pipeline and deploy an updated version of your model. This is the most direct and convenient way to achieve your goal. Training-serving skew is the difference between the data used for training the model and the data used for serving the model. It can also affect the performance and accuracy of the model, and may indicate data quality issues or model staleness. Vertex AI Model Monitoring allows you to monitor training-serving skew on your deployed models and endpoints, and set up alerts and notifications when the skew exceeds a certain threshold. However, this is not relevant to the question, as the question is about the feature attributions of the model, not the data distribution.Reference:

Vertex AI: Explanation methods

Vertex AI: Configuring explanations

Vertex AI: Monitoring prediction drift

Vertex AI: Monitoring training-serving skew


Contribute your Thoughts:

0/2000 characters
Sommer
3 months ago
Missing values should definitely be predicted, not just tossed out.
upvoted 0 times
...
Tamie
3 months ago
Wait, replacing missing values with zeros? That sounds risky!
upvoted 0 times
...
Mindy
4 months ago
Feature crossing could work too, but not sure if it's the best.
upvoted 0 times
...
Shonda
4 months ago
Disagree, deleting rows is too drastic!
upvoted 0 times
...
Tomas
4 months ago
I think option C is the best choice. Predicting missing values makes sense.
upvoted 0 times
...
Julene
4 months ago
Replacing missing values with zeros seems risky; I feel like that could misrepresent the data, but I can't recall the exact implications.
upvoted 0 times
...
Sharen
4 months ago
Predicting the missing values with linear regression sounds familiar, but I wonder if that would introduce bias since the variance is low.
upvoted 0 times
...
Elinore
5 months ago
I think applying feature crossing could be useful, but I’m not entirely confident if it’s the right method for handling missing values specifically.
upvoted 0 times
...
Marta
5 months ago
I remember we discussed that deleting rows with missing values can lead to losing important data, so I’m not sure that’s the best approach.
upvoted 0 times
...
Estrella
5 months ago
Feature crossing could be an interesting idea, but I'm not sure how that would work with a missing variable. I'll have to look into that more.
upvoted 0 times
...
Annalee
5 months ago
I'm leaning towards predicting the missing values using linear regression. That way, I can keep all the data and still have a complete feature set.
upvoted 0 times
...
Clorinda
5 months ago
Deleting the rows with missing values seems like the easiest option, but since every instance is important, that might not be the best approach.
upvoted 0 times
...
Chan
5 months ago
Hmm, this is a tricky one. I'll need to think carefully about how to handle the missing data without losing important information.
upvoted 0 times
...
Denna
5 months ago
Replacing the missing values with zeros seems like it could introduce some bias. I'd want to explore other imputation methods to see what works best for this dataset.
upvoted 0 times
...
Esteban
5 months ago
Hmm, the wording is a bit tricky here. I'll need to carefully read through each statement and think about which ones shouldn't be true.
upvoted 0 times
...
Orville
5 months ago
Okay, I've got this. Microservices are all about modularity and decoupling, right? I'm pretty sure I can identify the true statements here.
upvoted 0 times
...
Carlota
5 months ago
I'm pretty sure the answer is IPSec, since it's a common protocol used for secure data transfers at the network layer.
upvoted 0 times
...
Noah
5 months ago
I think option B makes the most sense, estimating future guest counts sounds crucial for planning.
upvoted 0 times
...
Elliott
5 months ago
I'm a little unsure about this one. The options seem to cover different aspects of branch office needs. I'll have to think it through and try to identify the most typical requirement.
upvoted 0 times
...
Lawanda
10 months ago
Ah, the classic 'replace with zeros' approach. It's like trying to fix a leaky faucet with duct tape - it just doesn't work!
upvoted 0 times
...
Vivan
10 months ago
Replacing missing values with zeros? That's about as useful as a chocolate teapot. We need a better solution here.
upvoted 0 times
Mari
9 months ago
In that case, applying feature crossing with another column might be a good alternative.
upvoted 0 times
...
Lorita
9 months ago
But what if the missing values are not predictable?
upvoted 0 times
...
Hyun
9 months ago
I think predicting the missing values using linear regression could be a better approach.
upvoted 0 times
...
Omega
9 months ago
Agreed, replacing missing values with zeros is not a good idea.
upvoted 0 times
...
...
Izetta
10 months ago
Predicting the missing values using linear regression? Hmm, I'm not sure that's a good idea if the variable doesn't have high variance.
upvoted 0 times
Santos
9 months ago
User1: Exactly, it might provide more accurate predictions than just deleting the rows or replacing the missing values with zeros.
upvoted 0 times
...
Delpha
9 months ago
User2: User1, that's a good suggestion. It could help capture the relationship between the missing variable and the other column.
upvoted 0 times
...
Clorinda
9 months ago
User1: Maybe you could try applying feature crossing with another column that does not have missing values.
upvoted 0 times
...
Antonette
9 months ago
C) Predict the missing values using linear regression.
upvoted 0 times
...
Hubert
9 months ago
B) Apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Vincenza
10 months ago
A) Delete the rows that have missing values.
upvoted 0 times
...
...
Whitley
10 months ago
Ooh, feature crossing! Sounds fancy. But won't that just mask the problem instead of actually solving it?
upvoted 0 times
Darrel
9 months ago
C) Predict the missing values using linear regression.
upvoted 0 times
...
Fernanda
9 months ago
Ooh, feature crossing! Sounds fancy. But won't that just mask the problem instead of actually solving it?
upvoted 0 times
...
Barabara
10 months ago
B) Apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Basilia
10 months ago
A) Delete the rows that have missing values.
upvoted 0 times
...
...
Ellen
11 months ago
Deleting rows with missing values? That's like throwing the baby out with the bathwater. We need to preserve every data point!
upvoted 0 times
Stephaine
10 months ago
User 2: How about predicting the missing values using linear regression?
upvoted 0 times
...
Percy
10 months ago
User 1: I agree, we can't just delete rows with missing values.
upvoted 0 times
...
...
Virgina
11 months ago
I think deleting the rows with missing values is the best option to ensure accurate predictions.
upvoted 0 times
...
Johna
11 months ago
I disagree, I believe we should apply feature crossing with another column that does not have missing values.
upvoted 0 times
...
Socorro
11 months ago
I think we should predict the missing values using linear regression.
upvoted 0 times
...

Save Cancel