Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam MLS-C01 Topic 4 Question 101 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 101
Topic #: 4
[All MLS-C01 Questions]

A company ingests machine learning (ML) data from web advertising clicks into an Amazon S3 data lake. Click data is added to an Amazon Kinesis data stream by using the Kinesis Producer Library (KPL). The data is loaded into the S3 data lake from the data stream by using an Amazon Kinesis Data Firehose delivery stream. As the data volume increases, an ML specialist notices that the rate of data ingested into Amazon S3 is relatively constant. There also is an increasing backlog of data for Kinesis Data Streams and Kinesis Data Firehose to ingest.

Which next step is MOST likely to improve the data ingestion rate into Amazon S3?

Show Suggested Answer Hide Answer
Suggested Answer: D

The best visualization for this task is to create a bar plot, faceted by year, of average sales for each region and add a horizontal line in each facet to represent average sales. This way, the data scientist can easily compare the yearly average sales for each region with the overall average sales and see the trends over time. The bar plot also allows the data scientist to see the relative performance of each region within each year and across years. The other options are less effective because they either do not show the yearly trends, do not show the overall average sales, or do not group the data by region.

References:

pandas.DataFrame.groupby --- pandas 2.1.4 documentation

pandas.DataFrame.plot.bar --- pandas 2.1.4 documentation

Matplotlib - Bar Plot - Online Tutorials Library


Contribute your Thoughts:

Stefany
5 days ago
I'm not sure about decreasing the retention period - that might cause us to lose valuable data. Increasing the number of prefixes or adding more consumers could be better options.
upvoted 0 times
...
Melita
7 days ago
Hmm, increasing the number of shards for the data stream seems like the most logical choice here. More shards should help distribute the load and improve the ingestion rate.
upvoted 0 times
...
Matthew
8 days ago
I'm not sure about that. Maybe adding more consumers using the Kinesis Client Library could also help speed up the process.
upvoted 0 times
...
Izetta
9 days ago
I agree with Pearlie. More shards would allow for parallel processing and faster data ingestion.
upvoted 0 times
...
Pearlie
13 days ago
I think increasing the number of shards for the data stream could help improve the ingestion rate into Amazon S3.
upvoted 0 times
...

Save Cancel