New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Professional Data Engineer Exam - Topic 1 Question 54 Discussion

Actual exam question for Google's Professional Data Engineer exam
Question #: 54
Topic #: 1
[All Professional Data Engineer Questions]

You are building a data pipeline on Google Cloud. You need to prepare data using a casual method for a

machine-learning process. You want to support a logistic regression model. You also need to monitor and

adjust for null values, which must remain real-valued and cannot be removed. What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: C

Contribute your Thoughts:

0/2000 characters
Frederica
4 months ago
Option C is interesting, but I prefer Dataflow for handling nulls.
upvoted 0 times
...
James
4 months ago
I agree with Ludivina, option B is definitely the way to go!
upvoted 0 times
...
Nan
4 months ago
Wait, can you really convert nulls to 'none' in a logistic regression model? Seems off.
upvoted 0 times
...
Ludivina
4 months ago
I think option B is the best choice! Converting nulls to 0 is a common practice.
upvoted 0 times
...
Fletcher
4 months ago
Option A sounds good, but converting nulls to 'none' might not work for real-valued data.
upvoted 0 times
...
Emmett
5 months ago
I vaguely remember that we shouldn't convert nulls to 'none' or 0 for logistic regression. I think we should keep them as real values, but I can't remember the exact method.
upvoted 0 times
...
Jeffrey
5 months ago
I feel like using Cloud Dataflow might be more appropriate for this scenario, but I'm uncertain about whether a custom script is necessary for handling nulls.
upvoted 0 times
...
Melda
5 months ago
I think we practiced a similar question where we used Cloud Dataprep to handle nulls, but I can't recall if converting them to 0 was the best option.
upvoted 0 times
...
Tanja
5 months ago
I remember we discussed how null values should be handled carefully, but I'm not sure if converting them to 'none' is the right approach for a logistic regression model.
upvoted 0 times
...
Glennis
5 months ago
Hmm, I'm not sure about this one. Bridging the vNIC sounds like the right approach, but I want to double-check that in my notes.
upvoted 0 times
...
Berry
5 months ago
Okay, let me see here. I believe the key is to run an automatic schedule request to generate the counts first, before entering any unscheduled entries.
upvoted 0 times
...
Graham
5 months ago
I think the key here is understanding the typical challenges faced by this specific customer group. Option A sounds like a safe bet, but I want to make sure I'm not missing something more specific to their situation. I'll read through the options again carefully.
upvoted 0 times
...

Save Cancel