A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minute.
A data analyst has created a dashboard based on this gold-level data. The project stakeholders want to see the results in the dashboard updated within one minute or less of new data becoming available within the gold-level tables.
Which of the following cautions should the data analyst share prior to setting up the dashboard to complete this task?
A Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables every minute requires a high level of compute resources to handle the frequent data ingestion, processing, and writing. This could result in a significant cost for the organization, especially if the data volume and velocity are large. Therefore, the data analyst should share this caution with the project stakeholders before setting up the dashboard and evaluate the trade-offs between the desired refresh rate and the available budget. The other options are not valid cautions because:
B) The gold-level tables are assumed to be appropriately clean for business reporting, as they are the final output of the data engineering pipeline. If the data quality is not satisfactory, the issue should be addressed at the source or silver level, not at the gold level.
C) The streaming data is an appropriate data source for a dashboard, as it can provide near real-time insights and analytics for the business users. Structured Streaming supports various sources and sinks for streaming data, including Delta Lake, which can enable both batch and streaming queries on the same data.
D) The streaming cluster is fault tolerant, as Structured Streaming provides end-to-end exactly-once fault-tolerance guarantees through checkpointing and write-ahead logs. If a query fails, it can be restarted from the last checkpoint and resume processing.
Fletcher
2 days agoPaulina
8 days agoDaryl
14 days agoGerman
19 days agoElina
24 days agoRosamond
1 month agoLatrice
1 month agoLashonda
1 month agoNada
1 month agoAliza
1 month agoTerry
1 month agoCory
1 month agoRaymon
1 year agoVeronika
1 year agoArminda
1 year agoJanae
1 year agoSkye
1 year agoJudy
1 year agoKattie
1 year agoLashaunda
1 year agoDesiree
1 year agoAmber
1 year agoBev
1 year agoLorrine
1 year agoKattie
1 year agoAnjelica
1 year agoJustine
1 year agoGail
1 year agoAshanti
1 year agoGlendora
1 year agoAaron
1 year agoHailey
1 year agoDelpha
1 year agoGilberto
1 year agoLeonor
1 year ago