Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam MLS-C01 Topic 5 Question 74 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 74
Topic #: 5
[All MLS-C01 Questions]

A finance company has collected stock return data for 5.000 publicly traded companies. A financial analyst has a dataset that contains 2.000 attributes for each company. The financial analyst wants to use Amazon SageMaker to identify the top 15 attributes that are most valuable to predict future stock returns.

Which solution will meet these requirements with the LEAST operational overhead?

Show Suggested Answer Hide Answer

Contribute your Thoughts:

Loren
18 days ago
Ah, the age-old question: linear or random forest? It's like choosing between a cup of coffee or a Red Bull - both will get the job done, just depends how much of a caffeine kick you want.
upvoted 0 times
Fallon
2 days ago
Linear regression is simpler and easier to interpret, but random forest may capture more complex relationships in the data.
upvoted 0 times
...
...
Hector
23 days ago
Options A and B are like the finance equivalent of rock-paper-scissors. I'd go with whichever one makes the most intuitive sense to me.
upvoted 0 times
...
Nieves
24 days ago
Option D with Amazon SageMaker Autopilot sounds like the easiest solution, but I'm curious to see how the Clarify report compares to the other methods. Might be worth exploring if I'm feeling lazy.
upvoted 0 times
Corinne
1 days ago
Option D with Amazon SageMaker Autopilot is definitely the way to go. It's the easiest and most efficient method.
upvoted 0 times
...
...
Erin
27 days ago
Option C looks interesting, but I'm not familiar with the quick model visualization in SageMaker. I'd need to do some research to see how reliable the feature importance scores are for this use case.
upvoted 0 times
Margart
21 days ago
Option A seems like a solid choice. Using linear regression to rank coefficient values could help identify the top attributes.
upvoted 0 times
...
...
Avery
1 months ago
I'm leaning towards Option B. Random forests can handle nonlinear relationships and the Gini importance scores should give me a good sense of the top predictive features.
upvoted 0 times
Stevie
9 days ago
Gini importance scores can definitely help identify the top predictive features.
upvoted 0 times
...
Lucille
14 days ago
Option B sounds like a good choice. Random forests are great for handling nonlinear relationships.
upvoted 0 times
...
...
Holley
1 months ago
Option A seems like a good starting point. Linear regression is a classic approach for this type of problem, and ranking the coefficient values is a straightforward way to identify the most important features.
upvoted 0 times
...
Corrie
2 months ago
That's a valid point, Eun. But I still think the automation and detailed report from Autopilot in option D would save time and effort.
upvoted 0 times
...
Eun
2 months ago
I disagree, I believe option A is the way to go. Using linear regression and ranking coefficient values seems like a straightforward approach.
upvoted 0 times
...
Corrie
2 months ago
I think option D is the best choice because Autopilot can automate the process and provide a detailed report on the most predictive features.
upvoted 0 times
...

Save Cancel