New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Professional Data Engineer Exam - Topic 1 Question 37 Discussion

Actual exam question for Google's Professional Data Engineer exam
Question #: 37
Topic #: 1
[All Professional Data Engineer Questions]

You are migrating your data warehouse to Google Cloud and decommissioning your on-premises data center Because this is a priority for your company, you know that bandwidth will be made available for the initial data load to the cloud. The files being transferred are not large in number, but each file is 90 GB Additionally, you want your transactional systems to continually update the warehouse on Google Cloud in real time What tools should you use to migrate the data and ensure that it continues to write to your warehouse?

Show Suggested Answer Hide Answer
Suggested Answer: C

Contribute your Thoughts:

0/2000 characters
Benedict
4 months ago
A is solid, but I’m surprised people are still considering gsutil for this!
upvoted 0 times
...
Fletcher
4 months ago
Wait, can Pub/Sub really handle that much data in real-time?
upvoted 0 times
...
Dorothy
4 months ago
gsutil for both? Seems a bit outdated for real-time updates.
upvoted 0 times
...
Charisse
4 months ago
I disagree, B seems more efficient with BigQuery.
upvoted 0 times
...
Artie
5 months ago
I think option A is the best choice for migration and updates.
upvoted 0 times
...
Sharmaine
5 months ago
I vaguely remember that BigQuery Data Transfer Service could be useful for migrations, but I’m not sure how it fits with the real-time aspect.
upvoted 0 times
...
Dortha
5 months ago
I feel like gsutil is more for simple file transfers, and I don't think it handles real-time updates well, but I'm not completely confident about that.
upvoted 0 times
...
Basilia
5 months ago
I remember we discussed using Storage Transfer Service for large file migrations in class, but I'm not entirely sure if it's the best option here.
upvoted 0 times
...
Hollis
5 months ago
I think we practiced a similar question where Pub/Sub was mentioned for real-time updates, but I can't recall if it was paired with Dataflow or something else.
upvoted 0 times
...
Dorothy
5 months ago
Okay, I've got this. RDDs are partitioned, fault-tolerant, and efficient for large-scale data processing. I'm pretty sure those are the key characteristics, so I'll select those options.
upvoted 0 times
...
Annabelle
5 months ago
Hmm, I'm not sure. Orchestrator might be a good option too, since it can automate the deployment process and integrate with various cloud providers.
upvoted 0 times
...
Janey
5 months ago
I remember studying about preventive care services, but I'm not sure if health club memberships are covered.
upvoted 0 times
...
Darell
5 months ago
Okay, let me think this through. I know "faithful representation" means the information accurately reflects the real-world economic phenomena. So I just need to determine which of these is not part of that.
upvoted 0 times
...

Save Cancel