You trained a model on data stored in a Cloud Storage bucket. The model needs to be retrained frequently in Vertex AI Training using the latest data in the bucket. Data preprocessing is required prior to retraining. You want to build a simple and efficient near-real-time ML pipeline in Vertex AI that will preprocess the data when new data arrives in the bucket. What should you do?
Cloud Run can be triggered on new data arrivals, which makes it ideal for near-real-time processing. The function then initiates the Vertex AI Pipeline for preprocessing and storing features in Vertex AI Feature Store, aligning with the retraining needs. Cloud Scheduler (Option A) is suitable for scheduled jobs, not event-driven triggers. Dataflow (Option C) is better suited for batch processing or ETL rather than ML preprocessing pipelines.
Mike
19 days agoDoug
24 days agoErnest
26 days agoWynell
7 days agoDalene
9 days agoAmie
29 days agoJoesph
13 days agoJolanda
14 days agoCharlesetta
18 days agoCassie
1 months agoVirgina
1 months agoCherrie
1 months agoEve
16 days agoHan
18 days agoTammara
25 days agoTula
1 months agoNieves
1 months agoJavier
28 days agoDallas
1 months agoJackie
1 months ago