Which query is performing a streaming hop from raw data to a Bronze table?
A)
B)
C)
D)
The query performing a streaming hop from raw data to a Bronze table is identified by using the Spark streaming read capability and then writing to a Bronze table. Let's analyze the options:
Option A: Utilizes .writeStream but performs a complete aggregation which is more characteristic of a roll-up into a summarized table rather than a hop into a Bronze table.
Option B: Also uses .writeStream but calculates an average, which again does not typically represent the raw to Bronze transformation, which usually involves minimal transformations.
Option C: This uses a basic .write with .mode('append') which is not a streaming operation, and hence not suitable for real-time streaming data transformation to a Bronze table.
Option D: It employs spark.readStream.load() to ingest raw data as a stream and then writes it out with .writeStream, which is a typical pattern for streaming data into a Bronze table where raw data is captured in real-time and minimal transformation is applied. This approach aligns with the concept of a Bronze table in a modern data architecture, where raw data is ingested continuously and stored in a more accessible format.
Reference: Databricks documentation on Structured Streaming: Structured Streaming in Databricks
Samira
4 months agoLatanya
4 months agoKristel
4 months agoYun
4 months agoTina
4 months agoKaran
3 months agoGabriele
3 months agoCarylon
3 months agoBrittani
4 months agoLai
4 months agoVirgie
4 months agoElouise
4 months agoYuki
4 months agoTyra
4 months agoLaurel
3 months agoFranchesca
4 months agoFlorinda
4 months agoErasmo
4 months ago