A data engineer is using an AWS Glue ETL job to remove outdated customer records from a table that contains customer account information. The data engineer is using the following SQL command:
MERGE INTO accounts t USING monthly_accounts_update s
ON t.customer = s.customer
WHEN MATCHED THEN DELETE
What will happen when the data engineer runs the SQL command?
In AWS Glue's SQL implementation (Spark SQL-compatible), the MERGE INTO statement supports conditional actions.
The clause WHEN MATCHED THEN DELETE deletes matching records from the target table (accounts) where the join condition is true.
''A MERGE INTO statement can perform updates, inserts, or deletes based on the match condition between source and target tables.''
-- Ace the AWS Certified Data Engineer - Associate Certification - version 2 - apple.pdf
A company is planning to upgrade its Amazon Elastic Block Store (Amazon EBS) General Purpose SSD storage from gp2 to gp3. The company wants to prevent any interruptions in its Amazon EC2 instances that will cause data loss during the migration to the upgraded storage.
Which solution will meet these requirements with the LEAST operational overhead?
Changing the volume type of the existing gp2 volumes to gp3 is the easiest and fastest way to migrate to the new storage type without any downtime or data loss. You can use the AWS Management Console, the AWS CLI, or the Amazon EC2 API to modify the volume type, size, IOPS, and throughput of your gp2 volumes. The modification takes effect immediately, and you can monitor the progress of the modification using CloudWatch. The other options are either more complex or require additional steps, such as creating snapshots, transferring data, or attaching new volumes, which can increase the operational overhead and the risk of errors.Reference:
Migrating Amazon EBS volumes from gp2 to gp3 and save up to 20% on costs(Section: How to migrate from gp2 to gp3)
Switching from gp2 Volumes to gp3 Volumes to Lower AWS EBS Costs(Section: How to Switch from GP2 Volumes to GP3 Volumes)
Modifying the volume type, IOPS, or size of an EBS volume - Amazon Elastic Compute Cloud(Section: Modifying the volume type)
A company processes 500 GB of audience and advertising data daily, storing CSV files in Amazon S3 with schemas registered in AWS Glue Data Catalog. They need to convert these files to Apache Parquet format and store them in an S3 bucket.
The solution requires a long-running workflow with 15 GiB memory capacity to process the data concurrently, followed by a correlation process that begins only after the first two processes complete.
AWS Glue Workflows can coordinate multiple ETL jobs and triggers. They support parallel execution and sequential dependencies, which is ideal for concurrent data processing followed by correlation steps, all with minimal operational overhead.
''Use AWS Glue Workflows to orchestrate multiple ETL jobs in sequence or in parallel, supporting conditional triggers and dependency management.''
-- Ace the AWS Certified Data Engineer - Associate Certification - version 2 - apple.pdf
A company has an on-premises PostgreSQL database that contains customer data. The company wants to migrate the customer data to an Amazon Redshift data warehouse. The company has established a VPN connection between the on-premises database and AWS.
The on-premises database is continuously updated. The company must ensure that the data in Amazon Redshift is updated as quickly as possible.
Which solution will meet these requirements?
Option B is the only solution that supports near real-time updates from a continuously changing source to Amazon Redshift. The requirement says the on-premises PostgreSQL database is ''continuously updated'' and the target must be updated ''as quickly as possible.'' Nightly full backups or nightly full loads (Options A and D) inherently introduce at least a daily lag, which violates the freshness requirement. Similarly, exporting with pg_dump and reloading with COPY (Option C) is a batch approach and does not provide continuous change propagation.
The study material explicitly positions AWS Database Migration Service (DMS) for database migrations and highlights that it supports both full-load and change data capture (CDC), and that CDC enables continuous replication so ongoing changes can be applied after the initial load.
A company is building a new application that ingests CSV files into Amazon Redshift. The company has developed the frontend for the application.
The files are stored in an Amazon S3 bucket. Files are no larger than 5 MB.
A data engineer is developing the extract, transform, and load (ETL) pipeline for the CSV files. The data engineer configured a Redshift cluster and an AWS Lambda function that copies the data out of the files into the Redshift cluster.
Which additional steps should the data engineer perform to meet these requirements?
Option A is the most direct and operationally efficient way to trigger the existing Lambda-based load into Amazon Redshift whenever a new CSV file is uploaded to Amazon S3. The key requirement is event-driven automation on ''new object created'' events, and the study material explicitly describes using Amazon EventBridge to react to S3 uploads by creating a rule on the default event bus and routing matching events to a target application.
Compared with queue-based designs (Options B and D), Option A reduces components: there is no need to manage an SQS queue, batching/visibility timeout behavior, or retry semantics at the consumer layer. This is especially appropriate because the files are small ( 5 MB) and the Lambda function is already implemented to perform the copy/load step. Using DMS (Option C) is not designed for ''S3 object arrival Lambda Redshift'' event triggering; it introduces unnecessary services and operational work for a simple file-ingestion trigger.
Betty Rogers
10 days agoAmy Thompson
16 days agoAngela Wilson
16 days agoBetty Phillips
16 days agoElizabeth Peterson
16 days agoDeborah Green
16 days agoHarold Rodriguez
24 days agoBarbara Lee
27 days agoRyan Rogers
1 month agoStephen Sanchez
1 month agoBrian Taylor
1 month agoLinda Johnson
1 month agoJoseph Cook
28 days agoGeorge Nguyen
24 days agoDonna Lewis
22 days agoSharen
2 months agoEveline
2 months agoAbraham
3 months agoTelma
3 months agoLashawn
3 months agoLavelle
3 months agoSanjuana
4 months agoLatricia
4 months agoCorazon
4 months agoLaine
4 months agoCecily
5 months agoMarsha
5 months agoFelix
5 months agoCammy
5 months agoFloyd
6 months agoKarma
6 months agoDenise
6 months agoKenneth
6 months agoZona
7 months agoCyril
7 months agoRyann
7 months agoSue
7 months agoPamela
8 months agoTammara
8 months agoChantell
8 months agoJame
8 months agoStanton
9 months agoMargot
9 months agoStefanie
9 months agoNakisha
9 months agoCarlene
9 months agoBea
9 months agoTomoko
11 months agoCarlota
11 months agoJustine
12 months agoDallas
1 year agoHui
1 year agoLeonor
1 year agoRosenda
1 year agoDiego
1 year agoElbert
1 year agoJohnetta
1 year agoFletcher
1 year agoAndra
1 year agoKaitlyn
1 year agoCecilia
1 year agoMarquetta
1 year agoWade
1 year agoGlory
1 year agoTatum
1 year agoMelodie
1 year agoVicki
1 year agoGaston
1 year agoPedro
1 year agoTanesha
1 year agoFredric
2 years agoGlenn
2 years agoEliseo
2 years agoShawna
2 years agoEloisa
2 years agoDaron
2 years agoLashonda
2 years agoEdgar
2 years agoRessie
2 years agoIlene
2 years agoKarina
2 years ago