Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon MLS-C01 Exam - Topic 2 Question 130 Discussion

[Modeling]A Machine Learning Specialist is working with a large cybersecurily company that manages security events in real time for companies around the world The cybersecurity company wants to design a solution that will allow it to use machine learning to score malicious events as anomalies on the data as it is being ingested The company also wants be able to save the results in its data lake for later processing and analysisWhat is the MOST efficient way to accomplish these tasks'?
A) Ingest the data using Amazon Kinesis Data Firehose, and use Amazon Kinesis Data Analytics Random Cut Forest (RCF) for anomaly detection Then use Kinesis Data Firehose to stream the results to Amazon S3
B) Ingest the data into Apache Spark Streaming using Amazon EMR. and use Spark MLlib with k-means to perform anomaly detection Then store the results in an Apache Hadoop Distributed File System (HDFS) using Amazon EMR with a replication factor of three as the data lake
C) Ingest the data and store it in Amazon S3 Use AWS Batch along with the AWS Deep Learning AMIs to train a k-means model using TensorFlow on the data in Amazon S3.
D) Ingest the data and store it in Amazon S3. Have an AWS Glue job that is triggered on demand transform the new data Then use the built-in Random Cut Forest (RCF) model within Amazon SageMaker to detect anomalies in the data

Amazon MLS-C01 Exam - Topic 2 Question 130 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 130
Topic #: 2
[All MLS-C01 Questions]

[Modeling]

A Machine Learning Specialist is working with a large cybersecurily company that manages security events in real time for companies around the world The cybersecurity company wants to design a solution that will allow it to use machine learning to score malicious events as anomalies on the data as it is being ingested The company also wants be able to save the results in its data lake for later processing and analysis

What is the MOST efficient way to accomplish these tasks'?

Show Suggested Answer Hide Answer
Suggested Answer: A

Contribute your Thoughts:

0/2000 characters
Phillip
1 month ago
I feel like option D might be the most comprehensive since it involves AWS Glue and SageMaker, but I can't recall if that setup is the most efficient.
upvoted 0 times
...
Sharika
1 month ago
I remember practicing with Spark Streaming and HDFS, so option B seems appealing, but I’m not confident if k-means is the best choice for anomaly detection.
upvoted 0 times
...
Hester
1 month ago
I think option A sounds familiar because we discussed Kinesis Data Firehose in class, but I'm not entirely sure about the Random Cut Forest part.
upvoted 0 times
...

Save Cancel