New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon MLS-C01 Exam - Topic 3 Question 110 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 110
Topic #: 3
[All MLS-C01 Questions]

A company stores its documents in Amazon S3 with no predefined product categories. A data scientist needs to build a machine learning model to categorize the documents for all the company's products.

Which solution will meet these requirements with the MOST operational efficiency?

Show Suggested Answer Hide Answer
Suggested Answer: C

Amazon SageMaker's Neural Topic Model (NTM) is designed to uncover underlying topics within text data by clustering documents based on topic similarity. For document categorization, NTM can identify product categories by analyzing and grouping the documents, making it an efficient choice for unsupervised learning where predefined categories do not exist.


Contribute your Thoughts:

0/2000 characters
Temeka
3 months ago
I vote for C, neural models are the future of categorization!
upvoted 0 times
...
Linwood
3 months ago
A custom clustering model sounds complicated, is it really worth it?
upvoted 0 times
...
Wei
3 months ago
Wait, can NTM really handle such diverse documents? Sounds risky!
upvoted 0 times
...
Polly
4 months ago
I disagree, I think D would work better for text-heavy data.
upvoted 0 times
...
Lenny
4 months ago
Option B seems the most efficient for categorizing documents quickly.
upvoted 0 times
...
Bernardine
4 months ago
I vaguely remember the Blazing Text model being mentioned as a fast option for text classification, but I’m unsure how it compares to the NTM.
upvoted 0 times
...
Melissa
4 months ago
I feel like the Neural Topic Model could be a good fit since it’s designed for document categorization, but I’m not entirely confident about its operational efficiency compared to the others.
upvoted 0 times
...
Carey
4 months ago
I think option B sounds familiar; we practiced k-means clustering in our last session, but I can't recall if it was specifically for text data.
upvoted 0 times
...
Solange
5 months ago
I remember we discussed clustering models in class, but I'm not sure if building a custom one is the most efficient way to categorize documents.
upvoted 0 times
...
Alayna
5 months ago
I'm leaning towards option A or C. The custom clustering model or the Neural Topic Model both seem like they could handle the unstructured document data well. I'd want to dig into the details of each approach a bit more to decide.
upvoted 0 times
...
Shayne
5 months ago
The Blazing Text model in option D could be a good fit too. I'm familiar with that from some previous work, and it's designed specifically for text classification tasks.
upvoted 0 times
...
Luis
5 months ago
The Neural Topic Model in option C looks interesting. I've heard good things about that approach for text categorization. Might be worth exploring that further.
upvoted 0 times
...
Taryn
5 months ago
I'm a bit unsure about the k-means model in option B. Tokenizing the data and transforming it to tabular format seems like it could be tricky. Not sure if that's the most efficient approach.
upvoted 0 times
...
Elvera
5 months ago
I think the custom clustering model in option A sounds like the most robust and flexible solution. Building a Docker image and using SageMaker gives us a lot of control and customization.
upvoted 0 times
...
Rodrigo
1 year ago
Wait, we can't use the 'most' efficient solution? I thought that was the whole point of the question. *scratches head*
upvoted 0 times
Arletta
12 months ago
B: B) Tokenize the data and transform the data into tabular data. Train an Amazon SageMaker k-means model to generate the product categories.
upvoted 0 times
...
Bernardo
12 months ago
A: A) Build a custom clustering model. Create a Dockerfile and build a Docker image. Register the Docker image in Amazon Elastic Container Registry (Amazon ECR). Use the custom image in Amazon SageMaker to generate a trained model.
upvoted 0 times
...
...
Rosamond
1 year ago
I'm not sure, option C also seems like a good choice with the Neural Topic Model. It could provide accurate product categories.
upvoted 0 times
...
Silvana
1 year ago
I agree with Mariann. Building a custom clustering model and using Docker image in Amazon SageMaker sounds efficient.
upvoted 0 times
...
Launa
1 year ago
Option C with the Neural Topic Model could be interesting, but I'd want to understand the trade-offs compared to k-means.
upvoted 0 times
Cecil
1 year ago
C: Maybe we should do a deeper dive into both options before making a decision.
upvoted 0 times
...
Hoa
1 year ago
B: I agree, but we should definitely consider the trade-offs compared to k-means.
upvoted 0 times
...
Jerry
1 year ago
A: Option C with the Neural Topic Model could be interesting.
upvoted 0 times
...
...
Chun
1 year ago
Option D with Blazing Text seems intriguing, but I'm not sure it's the 'most' operationally efficient choice here.
upvoted 0 times
...
Eun
1 year ago
I'm not a fan of the custom clustering model in Option A. Too much overhead for this use case.
upvoted 0 times
...
Wilbert
1 year ago
Option B looks like the way to go. Tokenizing and using SageMaker's k-means is a straightforward solution.
upvoted 0 times
Willard
1 year ago
I think it will save a lot of time and effort compared to building a custom clustering model.
upvoted 0 times
...
Jeannetta
1 year ago
It's definitely a practical approach to categorize the documents efficiently.
upvoted 0 times
...
Vincenza
1 year ago
I agree, using k-means for clustering is a good choice for this task.
upvoted 0 times
...
Catherin
1 year ago
Option B looks like the way to go. Tokenizing and using SageMaker's k-means is a straightforward solution.
upvoted 0 times
...
...
Mariann
1 year ago
I think option A is the best choice for operational efficiency.
upvoted 0 times
...

Save Cancel