Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 1 Question 5 Discussion

Question

Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 1 Question 5 Discussion

1 of 55. A data scientist wants to ingest a directory full of plain text files so that each record in the output DataFrame contains the entire contents of a single file and the full path of the file the text was read from.The first attempt does read the text files, but each record contains a single line. This code is shown below:txt_path = "/datasets/raw_txt/*"df = spark.read.text(txt_path) # one row per line by defaultdf = df.withColumn("file_path", input_file_name()) # add full pathWhich code change can be implemented in a DataFrame that meets the data scientist's requirements?

B) Add the option lineSep to the text() function.

C) Add the option wholetext=False to the text() function.

D) Add the option lineSep=', ' to the text() function.

Accepted Answer

A) Add the option wholetext to the text() function.

Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 1 Question 5 Discussion

Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 1 Question 5 Discussion

Contribute your Thoughts:

Mitsue

Shawnna

Grover

Evangelina

Ashton

Lindsey

Chaya

Nana

Stephaine

Reynalda

Novella

Lucia

Tamesha

Lai

Berry

Felicia

James

Shaquana

Cristal

Daron

Ligia

Lemuel

Caren

Raul

Ettie

Veda

Ruthann

Leana

Audra

Virgina