Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 6 Question 12 Discussion

Question

Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 6 Question 12 Discussion

A data scientist wants each record in the DataFrame to contain:The first attempt at the code does read the text files but each record contains a single line. This code is shown below:The entire contents of a fileThe full file pathThe issue: reading line-by-line rather than full text per file.Code:corpus = spark.read.text("/datasets/raw_txt/*") \.select('*', '_metadata.file_path')Which change will ensure one record per file?Options:

B) Add the option lineSep='\n' to the text() function

C) Add the option wholetext=False to the text() function

D) Add the option lineSep=', ' to the text() function

Accepted Answer

A) Add the option wholetext=True to the text() function

Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 6 Question 12 Discussion

Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 6 Question 12 Discussion

Contribute your Thoughts:

Lina

Lawrence

Dorethea

Miesha

Halina

Tequila

Roselle