Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam MLS-C01 Topic 2 Question 89 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 89
Topic #: 2
[All MLS-C01 Questions]

A data scientist is building a forecasting model for a retail company by using the most recent 5 years of sales records that are stored in a data warehouse. The dataset contains sales records for each of the company's stores across five commercial regions The data scientist creates a working dataset with StorelD. Region. Date, and Sales Amount as columns. The data scientist wants to analyze yearly average sales for each region. The scientist also wants to compare how each region performed compared to average sales across all commercial regions.

Which visualization will help the data scientist better understand the data trend?

Show Suggested Answer Hide Answer
Suggested Answer: D

The best visualization for this task is to create a bar plot, faceted by year, of average sales for each region and add a horizontal line in each facet to represent average sales. This way, the data scientist can easily compare the yearly average sales for each region with the overall average sales and see the trends over time. The bar plot also allows the data scientist to see the relative performance of each region within each year and across years. The other options are less effective because they either do not show the yearly trends, do not show the overall average sales, or do not group the data by region.

References:

pandas.DataFrame.groupby --- pandas 2.1.4 documentation

pandas.DataFrame.plot.bar --- pandas 2.1.4 documentation

Matplotlib - Bar Plot - Online Tutorials Library


Contribute your Thoughts:

Audra
8 hours ago
You know, I was leaning towards option C at first, but now I'm not so sure. Creating an aggregated dataset by region might be a bit too high-level. The data scientist might want to see the store-level data as well, to get a better sense of the variation within each region.
upvoted 0 times
...
Marjory
9 hours ago
I'm a little torn between options C and D, to be honest. I like the idea of the simple bar plot in option C, but I think the faceted layout in option D will give the data scientist a more comprehensive view of the data. Plus, adding that horizontal line to represent the overall average is a nice touch.
upvoted 0 times
...
Antonette
1 days ago
Haha, you know what they say - a picture is worth a thousand sales numbers! But in all seriousness, I think option D is the way to go. The data scientist will be able to see at a glance which regions are performing above or below the company average. Plus, the faceted layout will make it easy to spot any year-over-year changes.
upvoted 0 times
...
France
2 days ago
Ooh, that's an interesting idea! A line plot could definitely work too. Though I do think the bar plot options might be a bit more visually appealing, especially if they use some nice colors to differentiate the regions.
upvoted 0 times
...
Rima
2 days ago
I agree, option D is definitely the way to go. Having the faceted bar plot will make it easy to spot any trends or outliers in the regional sales data. Plus, adding that horizontal line to show the overall average is a nice touch that will really help the data scientist benchmark each region's performance.
upvoted 0 times
...
Maddie
3 days ago
Ha, I can just imagine the data scientist staring at a bunch of bar plots, trying to make sense of it all. Maybe they should just go with a nice, simple line plot? That way they can see the trends over time for each region more clearly.
upvoted 0 times
...
Xochitl
3 days ago
This is a great question! It really tests our understanding of data visualization techniques and how to effectively analyze sales data. I think option D is the best choice here. Creating a bar plot faceted by year, with average sales for each region and a horizontal line to represent the overall average, will give the data scientist a clear visual of how each region is performing compared to the company-wide average.
upvoted 0 times
...
Lucia
4 days ago
Hmm, I'm not sure. Option B also seems like it could work, with a bar plot colored by region and faceted by year. That might make it easier to compare each region's performance year-over-year. I'd have to think about the pros and cons of each approach.
upvoted 0 times
...
Eleni
6 days ago
I agree, this is a good question. I'm leaning towards option D. Creating a bar plot faceted by year, with each region's average sales, and adding a horizontal line to represent the overall average sales. That seems like it would give the data scientist a clear picture of how each region is performing compared to the average.
upvoted 0 times
...
Farrah
8 days ago
I think this is a great question that really tests our ability to analyze data and choose the right visualization. The data scientist wants to understand yearly average sales for each region and how each region compares to the overall average. I think the key is to choose a visualization that makes those insights clear and easy to interpret.
upvoted 0 times
...

Save Cancel