Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon-DEA-C01 Exam - Topic 1 Question 19 Discussion

Actual exam question for Amazon's Amazon-DEA-C01 exam
Question #: 19
Topic #: 1
[All Amazon-DEA-C01 Questions]

A data engineer needs to securely transfer 5 TB of data from an on-premises data center to an Amazon S3 bucket. Approximately 5% of the data changes every day. Updates to the data need to be regularly proliferated to the S3 bucket. The data includes files that are in multiple formats. The data engineer needs to automate the transfer process and must schedule the process to run periodically.

Which AWS service should the data engineer use to transfer the data in the MOST operationally efficient way?

Show Suggested Answer Hide Answer
Suggested Answer: A

AWS DataSync is an online data movement and discovery service that simplifies and accelerates data migrations to AWS as well as moving data to and from on-premises storage, edge locations, other cloud providers, and AWS Storage services1. AWS DataSync can copy data to and from various sources and targets, including Amazon S3, and handle files in multiple formats. AWS DataSync also supports incremental transfers, meaning it can detect and copy only the changes to the data, reducing the amount of data transferred and improving the performance.AWS DataSync can automate and schedule the transfer process using triggers, and monitor the progress and status of the transfers using CloudWatch metrics and events1.

AWS DataSync is the most operationally efficient way to transfer the data in this scenario, as it meets all the requirements and offers a serverless and scalable solution. AWS Glue, AWS Direct Connect, and Amazon S3 Transfer Acceleration are not the best options for this scenario, as they have some limitations or drawbacks compared to AWS DataSync.AWS Glue is a serverless ETL service that can extract, transform, and load data from various sources to various targets, including Amazon S32.However, AWS Glue is not designed for large-scale data transfers, as it has some quotas and limits on the number and size of files it can process3. AWS Glue also does not support incremental transfers, meaning it would have to copy the entire data set every time, which would be inefficient and costly.

AWS Direct Connect is a service that establishes a dedicated network connection between your on-premises data center and AWS, bypassing the public internet and improving the bandwidth and performance of the data transfer. However, AWS Direct Connect is not a data transfer service by itself, as it requires additional services or tools to copy the data, such as AWS DataSync, AWS Storage Gateway, or AWS CLI. AWS Direct Connect also has some hardware and location requirements, and charges you for the port hours and data transfer out of AWS.

Amazon S3 Transfer Acceleration is a feature that enables faster data transfers to Amazon S3 over long distances, using the AWS edge locations and optimized network paths. However, Amazon S3 Transfer Acceleration is not a data transfer service by itself, as it requires additional services or tools to copy the data, such as AWS CLI, AWS SDK, or third-party software. Amazon S3 Transfer Acceleration also charges you for the data transferred over the accelerated endpoints, and does not guarantee a performance improvement for every transfer, as it depends on various factors such as the network conditions, the distance, and the object size.Reference:

AWS DataSync

AWS Glue

AWS Glue quotas and limits

[AWS Direct Connect]

[Data transfer options for AWS Direct Connect]

[Amazon S3 Transfer Acceleration]

[Using Amazon S3 Transfer Acceleration]


Contribute your Thoughts:

0/2000 characters
Jaleesa
3 months ago
I’m surprised they didn’t mention using Snowball for this!
upvoted 0 times
...
Rosalia
3 months ago
Wait, can S3 Transfer Acceleration handle 5 TB efficiently?
upvoted 0 times
...
Mirta
3 months ago
Direct Connect seems overkill for just transferring files.
upvoted 0 times
...
Oretha
4 months ago
I think AWS Glue could work too, but not as efficient.
upvoted 0 times
...
Ashley
4 months ago
AWS DataSync is perfect for this!
upvoted 0 times
...
Joni
4 months ago
Amazon S3 Transfer Acceleration sounds familiar, but I feel like it’s more about speeding up transfers rather than automating them.
upvoted 0 times
...
Dino
4 months ago
I practiced a similar question where AWS Direct Connect was mentioned, but I don't think it's the best for this specific scenario.
upvoted 0 times
...
Bok
4 months ago
I'm not entirely sure, but I remember something about AWS Glue being more for data transformation rather than transfer.
upvoted 0 times
...
Val
5 months ago
I think AWS DataSync might be the right choice since it automates data transfer and handles changes efficiently.
upvoted 0 times
...
Gabriele
5 months ago
I'm a little confused by all the options. What's the difference between AWS DataSync and the other choices like Direct Connect or Glue? I'll need to research the capabilities of each service more to figure out the best fit.
upvoted 0 times
...
Cathrine
5 months ago
Based on the details provided, AWS DataSync seems like the clear choice here. It can handle large data volumes, support multiple file formats, and automate the transfer process with scheduling. That's exactly what this data engineer needs.
upvoted 0 times
...
Tashia
5 months ago
Hmm, I'm a bit unsure about this one. There are a few different AWS services that could potentially work, like AWS Glue or S3 Transfer Acceleration. I'll need to carefully review the requirements to determine the best option.
upvoted 0 times
...
Reiko
5 months ago
This seems like a pretty straightforward data transfer problem. I think AWS DataSync would be the most operationally efficient solution since it's designed for this kind of secure, automated data transfer.
upvoted 0 times
...
Johnson
7 months ago
AWS Direct Connect could work, but it seems overkill for just 5 TB of data. Plus, the automated updates would be a pain to manage with that approach.
upvoted 0 times
...
Ria
7 months ago
That's a good point, Miesha. Amazon S3 Transfer Acceleration could be a good choice for faster transfer speeds.
upvoted 0 times
...
Graciela
7 months ago
AWS Glue might be a good option if the data is in multiple formats, but I'm not sure it's the most operationally efficient for this use case.
upvoted 0 times
...
Miesha
7 months ago
I would go with Amazon S3 Transfer Acceleration. It can speed up the transfer of data to S3.
upvoted 0 times
...
Ernie
7 months ago
I think AWS DataSync is the way to go here. It's designed specifically for high-speed, secure data transfers, and it can handle the automatic updates too.
upvoted 0 times
Tawna
6 months ago
I think AWS Glue could also work since it's used for data integration and ETL processes.
upvoted 0 times
...
Brittani
6 months ago
I agree, AWS DataSync seems like the best option for this scenario.
upvoted 0 times
...
...
Ria
7 months ago
AWS DataSync is designed for transferring large amounts of data efficiently and can handle data changes well.
upvoted 0 times
...
Delisa
7 months ago
Why do you think that, Ria?
upvoted 0 times
...
Ria
7 months ago
I think the data engineer should use AWS DataSync.
upvoted 0 times
...

Save Cancel