You are building a new application that you need to collect data from in a scalable way. Data arrives continuously from the application throughout the day, and you expect to generate approximately 150 GB of JSON data per day by the end of the year. Your requirements are:
Decoupling producer from consumer
Space and cost-efficient storage of the raw ingested data, which is to be stored indefinitely
Near real-time SQL query
Maintain at least 2 years of historical data, which will be queried with SQ
Which pipeline should you use to meet these requirements?
Cruz
4 months agoShayne
4 months agoFiliberto
4 months agoDelisa
4 months agoLavonna
5 months agoGermaine
5 months agoCasie
5 months agoLenna
5 months agoGail
5 months agoBilly
5 months agoPatria
5 months agoMitsue
5 months agoSharen
5 months agoStaci
5 months ago