Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Exam Professional Machine Learning Engineer Topic 1 Question 98 Discussion

Actual exam question for Google's Professional Machine Learning Engineer exam
Question #: 98
Topic #: 1
[All Professional Machine Learning Engineer Questions]

You trained a model on data stored in a Cloud Storage bucket. The model needs to be retrained frequently in Vertex AI Training using the latest data in the bucket. Data preprocessing is required prior to retraining. You want to build a simple and efficient near-real-time ML pipeline in Vertex AI that will preprocess the data when new data arrives in the bucket. What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: B

Cloud Run can be triggered on new data arrivals, which makes it ideal for near-real-time processing. The function then initiates the Vertex AI Pipeline for preprocessing and storing features in Vertex AI Feature Store, aligning with the retraining needs. Cloud Scheduler (Option A) is suitable for scheduled jobs, not event-driven triggers. Dataflow (Option C) is better suited for batch processing or ETL rather than ML preprocessing pipelines.


Contribute your Thoughts:

Tula
2 days ago
I agree with Jackie. Storing the processed features in Vertex AI Feature Store seems like a good idea for efficiency.
upvoted 0 times
...
Nieves
4 days ago
Option B looks like the most efficient solution. Triggering a pipeline when new data arrives in the bucket is a great way to keep the model up-to-date in near-real-time.
upvoted 0 times
...
Jackie
5 days ago
I think option A is the best choice. It allows us to create a pipeline using Vertex AI SDK and schedule it with Cloud Scheduler.
upvoted 0 times
...

Save Cancel