U.S. Independence Day Deal! Unlock 25% OFF Today – Limited-Time Offer - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Associate Data Practitioner Exam - Topic 2 Question 25 Discussion

Your company is building a near real-time streaming pipeline to process JSON telemetry data from small appliances. You need to process messages arriving at a Pub/Sub topic, capitalize letters in the serial number field, and write results to BigQuery. You want to use a managed service and write a minimal amount of code for underlying transformations. What should you do?
C) Use the ''Pub/Sub to BigQuery'' Dataflow template with a UDF, and write the results to BigQuery.
A) Use a Pub/Sub to BigQuery subscription, write results directly to BigQuery, and schedule a transformation query to run every five minutes.
B) Use a Pub/Sub to Cloud Storage subscription, write a Cloud Run service that is triggered when objects arrive in the bucket, performs the transformations, and writes the results to BigQuery.
D) Use a Pub/Sub push subscription, write a Cloud Run service that accepts the messages, performs the transformations, and writes the results to BigQuery.

Google Associate Data Practitioner Exam - Topic 2 Question 25 Discussion

Actual exam question for Google's Associate Data Practitioner exam
Question #: 25
Topic #: 2
[All Associate Data Practitioner Questions]

Your company is building a near real-time streaming pipeline to process JSON telemetry data from small appliances. You need to process messages arriving at a Pub/Sub topic, capitalize letters in the serial number field, and write results to BigQuery. You want to use a managed service and write a minimal amount of code for underlying transformations. What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: C

Using the 'Pub/Sub to BigQuery' Dataflow template with a UDF (User-Defined Function) is the optimal choice because it combines near real-time processing, minimal code for transformations, and scalability. The UDF allows for efficient implementation of custom transformations, such as capitalizing letters in the serial number field, while Dataflow handles the rest of the managed pipeline seamlessly.


Contribute your Thoughts:

0/2000 characters
Brittani
1 month ago
I think option C sounds familiar since we practiced using Dataflow templates in class, but I'm not entirely sure about the UDF part.
upvoted 0 times
...

Save Cancel