You are developing an image recognition model using PyTorch based on ResNet50 architecture. Your code is working fine on your local laptop on a small subsample. Your full dataset has 200k labeled images You want to quickly scale your training workload while minimizing cost. You plan to use 4 V100 GPUs. What should you do? (Choose Correct Answer and Give Reference and Explanation)
Traffic splitting is a feature of Vertex AI that allows you to distribute the prediction requests among multiple models or model versions within the same endpoint. You can specify the percentage of traffic that each model or model version receives, and change it at any time. Traffic splitting can help you test the new model in production without creating a new endpoint or a separate service. You can deploy the new model to the existing Vertex AI endpoint, and use traffic splitting to send 5% of production traffic to the new model. You can monitor the end-user metrics, such as listening time, to compare the performance of the new model and the previous model. If the end-user metrics improve between models over time, you can gradually increase the percentage of production traffic sent to the new model. This solution can help you test the new model in production while minimizing complexity and cost.Reference:
Deploying models to endpoints | Vertex AI
Beatriz
5 months agoKallie
5 months agoAileen
6 months agoShanda
6 months agoMinna
6 months agoHershel
6 months agoJody
7 months agoDaren
7 months agoLeonard
7 months agoKenneth
7 months agoArdella
7 months agoMona
8 months agoBlair
8 months agoNorah
1 year agoHyun
12 months agoNicholle
1 year agoRonny
1 year agoRhea
1 year agoCarmelina
12 months agoEva
1 year agoJessenia
1 year agoLashandra
1 year agoErnestine
1 year agoTheodora
11 months agoMelvin
12 months agoBev
12 months agoAsha
1 year agoLauran
1 year agoLayla
1 year agoArlette
1 year agoJesusita
1 year agoPaul
1 year agoRobt
1 year ago