Databricks Certified Data Engineer Professional Exam - Topic 5 Question 45 Discussion

Actual exam question for Databricks's Databricks Certified Data Engineer Professional exam

Question #: 45
Topic #: 5

[All Databricks Certified Data Engineer Professional Questions]

A Data Engineer is building a simple data pipeline using Lakeflow Declarative Pipelines (LDP) in Databricks to ingest customer data. The raw customer data is stored in a cloud storage location in JSON format. The task is to create Lakeflow Declarative Pipelines that read the raw JSON data and write it into a Delta table for further processing.

Which code snippet will correctly ingest the raw JSON data and create a Delta table using LDP?

import dlt

@dlt.table

def raw_customers():

return spark.read.format("csv").load("s3://my-bucket/raw-customers/")

import dlt

@dlt.table

def raw_customers():

return spark.read.json("s3://my-bucket/raw-customers/")

import dlt

@dlt.table

def raw_customers():

return spark.read.format("parquet").load("s3://my-bucket/raw-customers/")

import dlt

@dlt.view

def raw_customers():

return spark.format.json("s3://my-bucket/raw-customers/")

AOption A

BOption B

COption C

DOption D

Show Suggested Answer

Suggested Answer: B

The correct method to define a table using Lakeflow Declarative Pipelines (LDP) is with the @dlt.table decorator, which persists the output as a managed Delta table. When ingesting raw JSON data, spark.read.json() or spark.read.format('json').load() is the standard approach. This reads JSON-formatted files from the source and stores them in Delta format automatically managed by Databricks.

Reference Source: Databricks Lakeflow Declarative Pipelines Developer Guide -- ''Create tables from raw JSON and Delta sources.''

by Reynalda at Mar 06, 2026, 03:47 PM

Limited Time Offer

25%

1 month ago

I think the answer might be B since it specifically uses `spark.read.json`, which matches the JSON format of the data.

upvoted 0 times

...