Databricks Certified Associate Developer for Apache Spark 3.5 Exam - Topic 3 Question 7 Discussion

Actual exam question for Databricks's Databricks Certified Associate Developer for Apache Spark 3.5 exam

Question #: 7
Topic #: 3

[All Databricks Certified Associate Developer for Apache Spark 3.5 Questions]

44 of 55. A data engineer is working on a real-time analytics pipeline using Spark Structured Streaming. They want the system to process incoming data in micro-batches at a fixed interval of 5 seconds.

Which code snippet fulfills this requirement?

query = df.writeStream \

.outputMode("append") \

.trigger(processingTime="5 seconds") \

.start()

query = df.writeStream \

.outputMode("append") \

.trigger(continuous="5 seconds") \

.start()

query = df.writeStream \

.outputMode("append") \

.trigger(once=True) \

.start()

query = df.writeStream \

.outputMode("append") \

.start()

AOption A

BOption B

COption C

DOption D

Show Suggested Answer

Suggested Answer: A

To process data in fixed micro-batch intervals, use the .trigger(processingTime='interval') option in Structured Streaming.

Correct usage:

query = df.writeStream

.outputMode('append')

.trigger(processingTime='5 seconds')

.start()

This instructs Spark to process available data every 5 seconds.

Why the other options are incorrect:

B: continuous triggers are for continuous processing mode (different execution model).

C: once=True runs the stream a single time (batch mode).

D: Default trigger runs as fast as possible, not fixed intervals.

PySpark Structured Streaming Guide --- Trigger types: processingTime, once, continuous.

Databricks Exam Guide (June 2025): Section ''Structured Streaming'' --- controlling streaming triggers and batch intervals.

===========

by Teresita at Nov 19, 2025, 04:40 AM

Limited Time Offer

25%

Off

Get Premium Databricks Certified Associate Developer for Apache Spark 3.5 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Jodi

3 months ago

D lacks the trigger. It won't process at intervals.

upvoted 0 times

...

Tabetha

3 months ago

C is definitely not it. Once means no continuous processing.

upvoted 0 times

...

Zita

3 months ago

I feel confident about A too. It matches the requirement.

upvoted 0 times

...

Laura

3 months ago

Option B seems wrong. Continuous isn't for micro-batches.

upvoted 0 times

...

Magdalene

3 months ago

Agreed! 5 seconds is perfect for micro-batches.

upvoted 0 times

...

Fletcher

3 months ago

Is it really just 5 seconds? Seems too short for some use cases.

upvoted 0 times

...

An

4 months ago

Wow, I didn't know about the "trigger" options!

upvoted 0 times

...

Holley

4 months ago

Option C won't work for micro-batches, right?

upvoted 0 times

...

Cathrine

4 months ago

I think Option B is better for continuous processing.

upvoted 0 times

...

Pearly

5 months ago

Option A is definitely the right choice!

upvoted 0 times

...

Chan

5 months ago

Wait, is this a trick question? I'm going with Option D just to be safe.

upvoted 0 times

...

Julian

5 months ago

Option B is the way to go, no doubt about it.

upvoted 0 times

...

Tarra

5 months ago

Hmm, I'm torn between Option A and Option B. Tough choice!

upvoted 0 times

...

Reuben

5 months ago

Option D seems a bit too simple, I'm going with Option A.

upvoted 0 times

...

Pamella

5 months ago

Option C is clearly the way to go here.

upvoted 0 times

...

Glory

6 months ago

I think Option B is the correct answer.

upvoted 0 times

...

Joanna

6 months ago

Option A looks good to me.

upvoted 0 times

...

Leota

6 months ago

Option D seems incomplete since it doesn't specify a trigger, which I think is necessary for setting the interval. I lean towards A as well.

upvoted 0 times

...

Tamra

6 months ago

I practiced a similar question, and I recall that `once=True` is used for one-time queries, which wouldn’t work here. So, I think C is definitely wrong.

upvoted 0 times

...

Amie

6 months ago

I'm leaning towards Option A as well. The question is specifically asking for a fixed 5-second interval, and Option A seems to address that requirement directly. The other options don't seem to match the problem statement as closely.

upvoted 0 times

...

Brittani

6 months ago

Based on my understanding, the `trigger(continuous="5 seconds")` option in Option B is used for continuous processing, which is different from the micro-batch processing that the question is asking for. So I think Option A is the better choice here.

upvoted 0 times

...

Eugene

7 months ago

I'm not entirely sure, but I think `continuous` is for continuous processing, which doesn't fit the micro-batch requirement. So, maybe it's not B?

upvoted 0 times

...

Glenna

7 months ago

I remember that for micro-batch processing, we typically use the `processingTime` trigger, so I think Option A is the right choice.

upvoted 0 times

...

Wilbert

7 months ago

I think Option A is the right choice. It specifies processing time.

upvoted 0 times

...

Tricia

8 months ago

Hmm, I'm a bit confused. Option B also mentions a 5-second interval, but it uses the `trigger(continuous="5 seconds")` parameter instead. I'm not sure what the difference is between those two options.

upvoted 0 times

...

Lavonda

8 months ago

I think Option A looks like the right choice here. The question specifically asks for a 5-second micro-batch processing interval, and Option A uses the `trigger(processingTime="5 seconds")` parameter to fulfill that requirement.

upvoted 0 times