Actual Dumps for Databricks Databricks Certified Data Engineer Associate Exam 2026

Question No: 1

MultipleChoice

A new data engineering team team has been assigned to an ELT project. The new data engineering team will need full privileges on the table sales to fully manage the project.

Which command can be used to grant full permissions on the database to the new data engineering team?

Options

Agrant all privileges on table sales TO team;

BGRANT SELECT ON TABLE sales TO team;

CGRANT SELECT CREATE MODIFY ON TABLE sales TO team;

DGRANT ALL PRIVILEGES ON TABLE team TO sales;

Question No: 2

MultipleChoice

A data engineer runs a statement every day to copy the previous day's sales into the table transactions. Each day's sales are in their own file in the location "/transactions/raw".

Today, the data engineer runs the following command to complete this task:

After running the command today, the data engineer notices that the number of records in table transactions has not changed.

Which option best describes why the statement might not have copied any new records into the table?

Options

AThe format of the files to be copied were not included with the FORMAT_OPTIONS keyword.

BThe names of the files to be copied were not included with the FILES keyword.

CThe previous day's file has already been copied into the table.

DThe PARQUET file format does not support COPY INTO.

EThe COPY INTO statement requires the table to be refreshed to view the copied rows.

Question No: 3

MultipleChoice

A data engineer wants to schedule their Databricks SQL dashboard to refresh every hour, but they only want the associated SQL endpoint to be running when It is necessary. The dashboard has multiple queries on multiple datasets associated with it. The data that feeds the dashboard is automatically processed using a Databricks Job.

Which approach can the data engineer use to minimize the total running time of the SQL endpoint used in the refresh schedule of their dashboard?

Options

AO They can reduce the cluster size of the SQL endpoint.

BQ They can turn on the Auto Stop feature for the SQL endpoint.

CO They can set up the dashboard's SQL endpoint to be serverless.

D0 They can ensure the dashboard's SQL endpoint matches each of the queries' SQL endpoints.

Question No: 4

MultipleChoice

A data engineering team has two tables. The first table march_transactions is a collection of all retail transactions in the month of March. The second table april_transactions is a collection of all retail transactions in the month of April. There are no duplicate records between the tables.

Which of the following commands should be run to create a new table all_transactions that contains all records from march_transactions and april_transactions without duplicate records?

Options

ACREATE TABLE all_transactions AS
SELECT * FROM march_transactions
INNER JOIN SELECT * FROM april_transactions;

BCREATE TABLE all_transactions AS
SELECT * FROM march_transactions
UNION SELECT * FROM april_transactions;

CCREATE TABLE all_transactions AS
SELECT * FROM march_transactions
OUTER JOIN SELECT * FROM april_transactions;

DCREATE TABLE all_transactions AS
SELECT * FROM march_transactions
INTERSECT SELECT * from april_transactions;

ECREATE TABLE all_transactions AS
SELECT * FROM march_transactions
MERGE SELECT * FROM april_transactions;

Question No: 5

MultipleChoice

A data analyst has developed a query that runs against Delta table. They want help from the data engineering team to implement a series of tests to ensure the data returned by the query is clean. However, the data engineering team uses Python for its tests rather than SQL.

Which of the following operations could the data engineering team use to run the query and operate with the results in PySpark?

Options

ASELECT * FROM sales

Bspark.delta.table

Cspark.sql

DThere is no way to share data between PySpark and SQL.

Espark.table

Question No: 6

MultipleChoice

A data engineering team has two tables. The first table march_transactions is a collection of all retail transactions in the month of March. The second table april_transactions is a collection of all retail transactions in the month of April. There are no duplicate records between the tables.

Which of the following commands should be run to create a new table all_transactions that contains all records from march_transactions and april_transactions without duplicate records?

A.