Google Professional Data Engineer Exam - Topic 3 Question 122 Discussion

Actual exam question for Google's Professional Data Engineer exam

Question #: 122
Topic #: 3

[All Professional Data Engineer Questions]

You work for a large real estate firm and are preparing 6 TB of home sales data lo be used for machine learning You will use SOL to transform the data and use BigQuery ML lo create a machine learning model. You plan to use the model for predictions against a raw dataset that has not been transformed. How should you set up your workflow in order to prevent skew at prediction time?

AWhen creating your model, use BigQuerys TRANSFORM clause to define preprocessing stops. At prediction time, use BigQuery's ML. EVALUATE clause without specifying any transformations on the raw input data.

BWhen creating your model, use BigQuery's TRANSFORM clause to define preprocessing steps Before requesting predictions, use a saved query to transform your raw input data, and then use ML. EVALUATE

CUse a BigOuery to define your preprocessing logic. When creating your model, use the view as your model training data. At prediction lime, use BigQuery's ML EVALUATE clause without specifying any transformations on the raw input data.

DPreprocess all data using Dataflow. At prediction time, use BigOuery's ML. EVALUATE clause without specifying any further transformations on the input data.

Show Suggested Answer

Suggested Answer: A

https://cloud.google.com/bigquery-ml/docs/bigqueryml-transform Using the TRANSFORM clause, you can specify all preprocessing during model creation. The preprocessing is automatically applied during the prediction and evaluation phases of machine learning

by Dorothy at May 13, 2026, 01:57 PM

Limited Time Offer

25%

Off

Get Premium Professional Data Engineer Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Deandrea

17 days ago

I remember we discussed the importance of consistent preprocessing during our practice sessions. I think option B makes sense because it emphasizes transforming the raw data before predictions.

upvoted 0 times

...