Independence Day Deal! Unlock 25% OFF Today – Limited-Time Offer - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam MLS-C01 Topic 4 Question 101 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 101
Topic #: 4
[All MLS-C01 Questions]

A company ingests machine learning (ML) data from web advertising clicks into an Amazon S3 data lake. Click data is added to an Amazon Kinesis data stream by using the Kinesis Producer Library (KPL). The data is loaded into the S3 data lake from the data stream by using an Amazon Kinesis Data Firehose delivery stream. As the data volume increases, an ML specialist notices that the rate of data ingested into Amazon S3 is relatively constant. There also is an increasing backlog of data for Kinesis Data Streams and Kinesis Data Firehose to ingest.

Which next step is MOST likely to improve the data ingestion rate into Amazon S3?

Show Suggested Answer Hide Answer
Suggested Answer: D

The best visualization for this task is to create a bar plot, faceted by year, of average sales for each region and add a horizontal line in each facet to represent average sales. This way, the data scientist can easily compare the yearly average sales for each region with the overall average sales and see the trends over time. The bar plot also allows the data scientist to see the relative performance of each region within each year and across years. The other options are less effective because they either do not show the yearly trends, do not show the overall average sales, or do not group the data by region.

References:

pandas.DataFrame.groupby --- pandas 2.1.4 documentation

pandas.DataFrame.plot.bar --- pandas 2.1.4 documentation

Matplotlib - Bar Plot - Online Tutorials Library


Contribute your Thoughts:

Eulah
2 months ago
Wait, wait, wait... Did someone say 'backlog'? That's like a data traffic jam! We need to get those bits moving, pronto. Crank up those shards, my dudes!
upvoted 0 times
Carylon
10 days ago
C: Let's do it, more shards it is!
upvoted 0 times
...
Novella
1 months ago
B: Agreed, that should help clear up the backlog and improve the data ingestion rate.
upvoted 0 times
...
Elden
1 months ago
A: Yeah, we definitely need to increase the number of shards for the data stream.
upvoted 0 times
...
...
Johanna
2 months ago
Aha, I see what they're getting at. Increasing the number of shards is the way to go. It's like turbocharging your data pipeline - more horsepower to handle that growing backlog!
upvoted 0 times
Shawana
15 days ago
D: Definitely, more shards means more capacity to handle the data.
upvoted 0 times
...
Ressie
19 days ago
C: I agree, it's like giving a boost to the system.
upvoted 0 times
...
Shonda
1 months ago
B: Yeah, that makes sense. It will help with the data ingestion rate.
upvoted 0 times
...
Ira
1 months ago
A: I think increasing the number of shards is the best option.
upvoted 0 times
...
...
Jennifer
2 months ago
Whoa, hold up! Adding more consumers using the Kinesis Client Library? That's a bold move, my friend. I'd be a little worried about the complexity and overhead that could bring.
upvoted 0 times
Vernell
25 days ago
B: Yeah, that could help distribute the workload better and improve the ingestion rate.
upvoted 0 times
...
Amber
27 days ago
A: I think increasing the number of shards for the data stream might be a better option.
upvoted 0 times
...
...
Stefany
2 months ago
I'm not sure about decreasing the retention period - that might cause us to lose valuable data. Increasing the number of prefixes or adding more consumers could be better options.
upvoted 0 times
...
Melita
2 months ago
Hmm, increasing the number of shards for the data stream seems like the most logical choice here. More shards should help distribute the load and improve the ingestion rate.
upvoted 0 times
Lisha
20 days ago
D: Decreasing the retention period for the data stream might also help with the backlog issue.
upvoted 0 times
...
Andrew
23 days ago
C: I think increasing the number of S3 prefixes for the delivery stream could also improve the ingestion rate.
upvoted 0 times
...
Tandra
25 days ago
B: But wouldn't adding more consumers using the Kinesis Client Library also help distribute the workload?
upvoted 0 times
...
Filiberto
1 months ago
A: I agree, increasing the number of shards for the data stream should help with the data ingestion rate.
upvoted 0 times
...
...
Matthew
2 months ago
I'm not sure about that. Maybe adding more consumers using the Kinesis Client Library could also help speed up the process.
upvoted 0 times
...
Izetta
2 months ago
I agree with Pearlie. More shards would allow for parallel processing and faster data ingestion.
upvoted 0 times
...
Pearlie
2 months ago
I think increasing the number of shards for the data stream could help improve the ingestion rate into Amazon S3.
upvoted 0 times
...

Save Cancel