Microsoft Exam DP-600 Topic 1 Question 8 Discussion

Actual exam question for Microsoft's DP-600 exam

Question #: 8
Topic #: 1

You are analyzing customer purchases in a Fabric notebook by using PySpanc You have the following DataFrames:

You need to join the DataFrames on the customer_id column. The solution must minimize data shuffling. You write the following code.

Which code should you run to populate the results DataFrame?

AOption A

BOption B

COption C

DOption D

Show Suggested Answer

Suggested Answer: A

by Estrella at Mar 21, 2024, 02:51 AM

Limited Time Offer

25%

Off

Get Premium DP-600 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Beatriz

1 years ago

Interesting, why do you think code C could be a good choice?

upvoted 0 times

...

Shasta

1 years ago

I think code C may also be a good option to consider.

upvoted 0 times

...

Sharita

1 years ago

Code A minimizes data shuffling more effectively than code B.

upvoted 0 times

...

Beatriz

1 years ago

Why do you think code A is better?

upvoted 0 times

...

Sharita

1 years ago

I disagree, I believe code A is the better choice.

upvoted 0 times

...

Beatriz

1 years ago

I think the best option is to run code B.

upvoted 0 times

...

Louann

1 years ago

Haha, good one! I'll make sure to keep my notes in a Fabric notebook, just like the one in the question. Definitely the most efficient way to study for this exam.

upvoted 0 times

...

Jean

1 years ago

Exactly! I'm going with option C. It just makes the most sense for this scenario. Now, if only the exam had a question about the best way to store my exam notes... *wink wink*

upvoted 0 times

Peter

1 years ago

Going with option C, thanks for the input.

upvoted 0 times

...

Rodolfo

1 years ago

Option C it is, no need to overthink it.

upvoted 0 times

...

Corinne

1 years ago

Agreed, option C it is.

upvoted 0 times

...

Xuan

1 years ago

Let's all choose option C then.

upvoted 0 times

...

Son

1 years ago

Yeah, option C minimizes data shuffling.

upvoted 0 times

...

Lettie

1 years ago

Option C seems to be the best choice here.

upvoted 0 times

...

Alise

1 years ago

I think I will go with option C as well.

upvoted 0 times

...

Ooh, good catch! Option C does seem like the winner here. Broadcast joins are great for minimizing data movement, especially when one of the DataFrames is small enough to fit in memory on each partition.

upvoted 0 times

...