Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam MLS-C01 Topic 4 Question 80 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 80
Topic #: 4
[All MLS-C01 Questions]

A Data Scientist is working on an application that performs sentiment analysis. The validation accuracy is poor and the Data Scientist thinks that the cause may be a rich vocabulary and a low average frequency of words in the dataset

Which tool should be used to improve the validation accuracy?

Show Suggested Answer Hide Answer
Suggested Answer: A

Contribute your Thoughts:

Jamal
5 days ago
I prefer using NLTK for stemming and stop word removal to improve accuracy.
upvoted 0 times
...
Chauncey
5 days ago
C) Natural Language Toolkit (NLTK) stemming and stop word removal seems like the best option to handle the issue of rich vocabulary and low average word frequency. Removing common words and reducing words to their base form can help improve the model's performance.
upvoted 0 times
...
Kris
8 days ago
I agree with Florinda, TF-IDF can help with the low frequency of words issue.
upvoted 0 times
...
Florinda
17 days ago
I think we should use Scikit-learn TF-IDF vectorizers.
upvoted 0 times
...

Save Cancel