New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Professional Data Engineer Exam - Topic 4 Question 13 Discussion

Actual exam question for Google's Professional Data Engineer exam
Question #: 13
Topic #: 4
[All Professional Data Engineer Questions]

You work for an advertising company, and you've developed a Spark ML model to predict click-through rates at advertisement blocks. You've been developing everything at your on-premises data center, and now your company is migrating to Google Cloud. Your data center will be migrated to BigQuery. You periodically retrain your Spark ML models, so you need to migrate existing training pipelines to Google Cloud. What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: A

Contribute your Thoughts:

0/2000 characters
Rosendo
4 months ago
Can we really trust BigQuery for all our data needs?
upvoted 0 times
...
Elenora
4 months ago
Definitely go with option C, it makes the most sense for Spark.
upvoted 0 times
...
Vicente
4 months ago
Surprised that Cloud ML Engine isn't the go-to choice for this.
upvoted 0 times
...
Clare
4 months ago
I disagree, rewriting everything in TensorFlow seems like a lot of work!
upvoted 0 times
...
Georgeanna
5 months ago
I think using Cloud Dataproc is the best option here.
upvoted 0 times
...
Tonja
5 months ago
I have a feeling option A might not be the right fit since Cloud ML Engine is more for TensorFlow models. I think we need to stick with Spark for our existing models.
upvoted 0 times
...
Annmarie
5 months ago
I remember practicing a question about migrating models, and I feel like option D could be a good choice since it allows us to keep using Spark. But isn't there a cost concern with spinning up a cluster?
upvoted 0 times
...
Refugia
5 months ago
I think option C sounds familiar since we discussed using Cloud Dataproc for Spark jobs in class. But I'm not entirely sure if reading directly from BigQuery is the best approach.
upvoted 0 times
...
Alonso
5 months ago
I’m a bit confused about option B. I know TensorFlow is powerful, but rewriting everything seems like a lot of work. Would it really be worth it?
upvoted 0 times
...
Filiberto
5 months ago
Hmm, this is a tricky one. I think the key is figuring out what's causing the issue with the minutes option not being visible to the user. Maybe it has something to do with the permissions or the license type.
upvoted 0 times
...
Alba
5 months ago
This seems like a straightforward question about the UiPath Robotic Enterprise Framework. I'm pretty confident I can figure this out.
upvoted 0 times
...
Marica
5 months ago
I practiced a similar question on Control Plane Policing last week. If I recall correctly, the right configuration typically specifies the class-map correctly.
upvoted 0 times
...
Yong
5 months ago
This looks like a data classification matrix, so I'll go with option C.
upvoted 0 times
...

Save Cancel