New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Professional Data Engineer Exam - Topic 5 Question 49 Discussion

Actual exam question for Google's Professional Data Engineer exam
Question #: 49
Topic #: 5
[All Professional Data Engineer Questions]

You are creating a new pipeline in Google Cloud to stream IoT data from Cloud Pub/Sub through Cloud Dataflow to BigQuery. While previewing the data, you notice that roughly 2% of the data appears to be corrupt. You need to modify the Cloud Dataflow pipeline to filter out this corrupt dat

a. What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: B

Contribute your Thoughts:

0/2000 characters
Marge
3 months ago
Definitely B! It's the most efficient way to handle this.
upvoted 0 times
...
Stanford
3 months ago
A won't really help with filtering, just checking.
upvoted 0 times
...
Taryn
3 months ago
Wait, why not just fix the corrupt data instead of filtering it out?
upvoted 0 times
...
Rory
4 months ago
I think C could work too, but B seems more straightforward.
upvoted 0 times
...
Kimbery
4 months ago
B is the best option to filter out corrupt data.
upvoted 0 times
...
Son
4 months ago
Partitioning sounds like it could work, but I feel like it might be overkill for just filtering out 2% of corrupt data.
upvoted 0 times
...
Filiberto
4 months ago
I practiced a similar question where we had to clean data in a pipeline. I think using a ParDo transform was the solution there too.
upvoted 0 times
...
Jacqueline
4 months ago
I'm not entirely sure, but I think a SideInput might complicate things. We just need to filter out the bad data directly.
upvoted 0 times
...
Brittni
5 months ago
I remember something about using ParDo for filtering data in Dataflow. It seems like the right approach to discard corrupt elements.
upvoted 0 times
...
Mirta
5 months ago
I agree, the Process Builder seems like the way to go here. I'd create a single Process that checks the criteria and then takes the appropriate actions based on the results.
upvoted 0 times
...
Maybelle
5 months ago
Hmm, I'm a bit confused on this one. I'll have to think it through carefully.
upvoted 0 times
...
Jerrod
5 months ago
Okay, I think I've got this. Based on the question, the Azure Sentinel Contributor role seems like the best fit. It allows the security analyst to edit the queries of custom workbooks, which is exactly what the requirement is asking for. And it follows the principle of least privilege since it's a more specific role than something like Security Administrator.
upvoted 0 times
...

Save Cancel