MultipleChoice
26 of 55. A data scientist at an e-commerce company is working with user data obtained from its subscriber database and has stored the data in a DataFrame df_user.
Before further processing, the data scientist wants to create another DataFrame df_user_non_pii and store only the non-PII columns. The PII columns in df_user are name, email, and birthdate.
Which code snippet can be used to meet this requirement?
A.
df_user_non_pii = df_user.drop("name", "email", "birthdate")
B.
df_user_non_pii = df_user.dropFields("name", "email", "birthdate")
C.
df_user_non_pii = df_user.select("name", "email", "birthdate")
D.
df_user_non_pii = df_user.remove("name", "email", "birthdate")
OptionsMultipleChoice
What is the benefit of using Pandas on Spark for data transformations?
Options:
Options