Google Professional Data Engineer Exam - Topic 5 Question 49 Discussion
You are creating a new pipeline in Google Cloud to stream IoT data from Cloud Pub/Sub through Cloud Dataflow to BigQuery. While previewing the data, you notice that roughly 2% of the data appears to be corrupt. You need to modify the Cloud Dataflow pipeline to filter out this corrupt data. What should you do?
B) Add a ParDo transform in Cloud Dataflow to discard corrupt elements.
A) Add a SideInput that returns a Boolean if the element is corrupt.
C) Add a Partition transform in Cloud Dataflow to separate valid data from corrupt data.
D) Add a GroupByKey transform in Cloud Dataflow to group all of the valid data together and discard the rest.
Marge
7 months agoStanford
7 months agoTaryn
7 months agoRory
7 months agoKimbery
7 months agoSon
7 months agoFiliberto
8 months agoJacqueline
8 months agoBrittni
8 months agoMirta
8 months agoMaybelle
8 months agoJerrod
8 months ago