You are developing an ML model to predict house prices. While preparing the data, you discover that an important predictor variable, distance from the closest school, is often missing and does not have high variance. Every instance (row) in your data is important. How should you handle the missing data?
Sampled Shapley is a fast and scalable approximation of the Shapley value, which is a game-theoretic concept that measures the contribution of each feature to the model prediction. Sampled Shapley is suitable for online prediction requests, as it can return feature attributions with minimal latency. The path count parameter controls the number of samples used to estimate the Shapley value, and a lower value means faster computation. Integrated Gradients is another explanation method that computes the average gradient along the path from a baseline input to the actual input. Integrated Gradients is more accurate than Sampled Shapley, but also more computationally intensive. Therefore, it is not recommended for online prediction requests, especially with a high path count. Prediction drift is the change in the distribution of feature values or labels over time. It can affect the performance and accuracy of the model, and may require retraining or redeploying the model. Vertex AI Model Monitoring allows you to monitor prediction drift on your deployed models and endpoints, and set up alerts and notifications when the drift exceeds a certain threshold. You can specify an email address to receive the notifications, and use the information to retrigger the training pipeline and deploy an updated version of your model. This is the most direct and convenient way to achieve your goal. Training-serving skew is the difference between the data used for training the model and the data used for serving the model. It can also affect the performance and accuracy of the model, and may indicate data quality issues or model staleness. Vertex AI Model Monitoring allows you to monitor training-serving skew on your deployed models and endpoints, and set up alerts and notifications when the skew exceeds a certain threshold. However, this is not relevant to the question, as the question is about the feature attributions of the model, not the data distribution.Reference:
Vertex AI: Explanation methods
Vertex AI: Configuring explanations
Vertex AI: Monitoring prediction drift
Vertex AI: Monitoring training-serving skew
Sommer
3 months agoTamie
3 months agoMindy
4 months agoShonda
4 months agoTomas
4 months agoJulene
4 months agoSharen
4 months agoElinore
5 months agoMarta
5 months agoEstrella
5 months agoAnnalee
5 months agoClorinda
5 months agoChan
5 months agoDenna
5 months agoEsteban
5 months agoOrville
5 months agoCarlota
5 months agoNoah
5 months agoElliott
5 months agoLawanda
10 months agoVivan
10 months agoMari
9 months agoLorita
9 months agoHyun
9 months agoOmega
9 months agoIzetta
10 months agoSantos
9 months agoDelpha
9 months agoClorinda
9 months agoAntonette
9 months agoHubert
9 months agoVincenza
10 months agoWhitley
10 months agoDarrel
9 months agoFernanda
9 months agoBarabara
10 months agoBasilia
10 months agoEllen
11 months agoStephaine
10 months agoPercy
10 months agoVirgina
11 months agoJohna
11 months agoSocorro
11 months ago