New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon BDS-C00 Exam - Topic 6 Question 98 Discussion

Actual exam question for Amazon's BDS-C00 exam
Question #: 98
Topic #: 6
[All BDS-C00 Questions]

A social media customer has data from different data sources including RDS running MySQL, RedShift, and Hive on EMR. To support better analysis, the customer needs to be able to analyze data from different data sources and to combine the results.

What is the most cost-effective solution to meet these requirements?

Show Suggested Answer Hide Answer
Suggested Answer: C

Contribute your Thoughts:

0/2000 characters
William
3 months ago
I like option C for its visualization capabilities!
upvoted 0 times
...
Dorethea
3 months ago
Option D seems too complicated for what they need.
upvoted 0 times
...
Sheron
3 months ago
Wait, can Presto really handle all those connections efficiently?
upvoted 0 times
...
Huey
4 months ago
I disagree, I think option A is more straightforward.
upvoted 0 times
...
Angelo
4 months ago
Option B sounds like a solid choice for combining data sources.
upvoted 0 times
...
Yvonne
4 months ago
Writing a program on EC2 to aggregate results sounds like a lot of work, and I’m not sure if it’s efficient compared to the other solutions.
upvoted 0 times
...
Cathrine
4 months ago
I think spinning up an Elasticsearch cluster could be overkill for this scenario, especially since we have cheaper options available.
upvoted 0 times
...
Belen
4 months ago
The Redshift COPY command seems familiar, but I feel like loading everything into S3 first might add unnecessary costs.
upvoted 0 times
...
Dulce
5 months ago
I remember we discussed using Presto for querying multiple data sources in one go, but I'm not entirely sure if it's the most cost-effective option.
upvoted 0 times
...
Renea
5 months ago
Option A with loading all the data to S3 and then using Redshift seems like a good approach, but I'm worried about the data transfer costs. I'll need to do some calculations to see if that's really the most cost-effective solution.
upvoted 0 times
...
Frederic
5 months ago
Hmm, I'm a bit unsure about this one. The question mentions cost-effectiveness, so I'm not sure if option D with a separate EC2 instance is the best approach. I'll have to think this through carefully.
upvoted 0 times
...
Alberto
5 months ago
This looks like a pretty straightforward question about integrating data from different sources. I think option B with Presto is the most cost-effective solution here.
upvoted 0 times
...
William
5 months ago
I like the idea of using Presto in option B to query the different data sources in a single query. That seems like a really elegant solution, and it should be pretty cost-effective too. I'm leaning towards that one.
upvoted 0 times
...
Florinda
5 months ago
I'm a bit unsure about this one. The options seem pretty technical, so I'll need to think it through step-by-step to make sure I select the right answer.
upvoted 0 times
...
Bethanie
5 months ago
Hmm, the code seems to be trying to read a file even if it doesn't exist. I'll need to focus on that and make sure the indentation is correct.
upvoted 0 times
...
Verdell
5 months ago
I remember going over Auto Scaling and how it adjusts resources, but I'm a bit unclear about the “Elastic Self-Health” feature. Is that actually a thing?
upvoted 0 times
...
Ocie
10 months ago
I wonder if the customer has considered using a crystal ball to query the data. It's probably more reliable than some of these fancy-schmancy data solutions.
upvoted 0 times
Maybelle
8 months ago
C: D) Write a program running on a separate EC2 instance to run queries to three different systems. Aggregate the results after getting the responses from all three systems.
upvoted 0 times
...
Evangelina
8 months ago
B: B) Install Presto on the EMR cluster where Hive sits. Configure MySQL and PostgreSQL connector to select from different data sources in a single query.
upvoted 0 times
...
Yesenia
8 months ago
A: A) Load all data from a different database/warehouse to S3. Use Redshift COPY command to copy data to Redshift for analysis.
upvoted 0 times
...
...
Howard
10 months ago
I like the simplicity of option A. Just load everything to S3 and use Redshift to analyze it all. Sure, it might require a bit more data movement, but it's a tried-and-true approach. Plus, I bet I could write a script to automate the whole process and save the customer some time.
upvoted 0 times
Doyle
9 months ago
Definitely, option A seems like the most cost-effective solution for analyzing data from different sources.
upvoted 0 times
...
Paul
9 months ago
I agree, simplicity is key. Plus, automating the process with a script would definitely save time for the customer.
upvoted 0 times
...
Lashunda
9 months ago
Option A sounds like a solid plan. It's a straightforward process to load data to S3 and analyze it with Redshift.
upvoted 0 times
...
...
Isadora
10 months ago
Ooh, option C sounds interesting! Elasticsearch and Kibana could give us some really slick data visualization capabilities. But I'm not sure if it's the most cost-effective solution, especially with the need to set up a whole new cluster.
upvoted 0 times
Alida
9 months ago
D) Write a program running on a separate EC2 instance to run queries to three different systems. Aggregate the results after getting the responses from all three systems.
upvoted 0 times
...
Shawnna
10 months ago
C) Spin up an Elasticsearch cluster. Load data from all three data sources and use Kibana to analyze.
upvoted 0 times
...
King
10 months ago
B) Install Presto on the EMR cluster where Hive sits. Configure MySQL and PostgreSQL connector to select from different data sources in a single query.
upvoted 0 times
...
Kristel
10 months ago
A) Load all data from a different database/warehouse to S3. Use Redshift COPY command to copy data to Redshift for analysis.
upvoted 0 times
...
...
Cherry
10 months ago
I'm leaning towards option D. Sure, it might require a bit more work, but having a dedicated program to query the different systems and aggregate the results could give us more control and flexibility. Plus, it's always fun to write your own code, am I right?
upvoted 0 times
Tiera
10 months ago
I agree, having a dedicated program to query and aggregate data from different systems could be beneficial in the long run.
upvoted 0 times
...
Hyman
10 months ago
Option D sounds like a good choice. Writing our own program could give us more control over the process.
upvoted 0 times
...
...
Sommer
10 months ago
Hmm, I think option B is the way to go. Presto can query across different data sources without the need for data movement. Plus, it's probably the most cost-effective solution since we don't have to spin up any additional resources like EC2 or Elasticsearch.
upvoted 0 times
...
Huey
10 months ago
I personally think option C could be a good solution. Setting up an Elasticsearch cluster and using Kibana for analysis might provide a more flexible approach.
upvoted 0 times
...
Markus
11 months ago
I disagree, I believe option A is more cost-effective. Loading data to S3 and using Redshift COPY command for analysis seems simpler and efficient.
upvoted 0 times
...
Ivette
11 months ago
I think option B is the best solution. Installing Presto on the EMR cluster seems like a cost-effective way to analyze data from different sources.
upvoted 0 times
...

Save Cancel