Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Microsoft Exam DP-300 Topic 5 Question 113 Discussion

Actual exam question for Microsoft's DP-300 exam
Question #: 113
Topic #: 5
[All DP-300 Questions]

You are designing a date dimension table in an Azure Synapse Analytics dedicated SQL pool. The date

dimension table will be used by all the fact tables.

Which distribution type should you recommend to minimize data movement?

Show Suggested Answer Hide Answer
Suggested Answer: B

A replicated table has a full copy of the table available on every Compute node. Queries run fast on replicated tables since joins on replicated tables don't require data movement. Replication requires extra storage, though, and isn't practical for large tables.

Incorrect Answers:

C: A round-robin distributed table distributes table rows evenly across all distributions. The assignment of rows to distributions is random. Unlike hash-distributed tables, rows with equal values are not guaranteed to be assigned to the same distribution.

As a result, the system sometimes needs to invoke a data movement operation to better organize your data before it can resolve a query.


https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-data-warehouse-tables-distribute

Contribute your Thoughts:

Laquanda
19 days ago
HASH distribution, definitely. Anything to avoid the dreaded 'data movement' in my reports. I'm not trying to be the laughingstock of the data team!
upvoted 0 times
...
Tabetha
21 days ago
Hmm, HASH distribution seems like the logical choice. Can't go wrong with that. Unless you want to be the one explaining all the extra data movement to the boss.
upvoted 0 times
...
Lashon
1 months ago
HASH distribution is the way to go. I don't want to be the one responsible for excessive data movement in the data warehouse!
upvoted 0 times
Antonio
15 days ago
User 1: HASH distribution is definitely the best choice for minimizing data movement.
upvoted 0 times
...
...
Lillian
1 months ago
I'm not sure, but I think REPLICATE distribution could also be a good option to consider.
upvoted 0 times
...
Jesusa
1 months ago
HASH distribution sounds like the way to go here. It will ensure that related data is collocated on the same compute node, reducing data movement.
upvoted 0 times
Sueann
1 months ago
I agree, using HASH distribution will definitely help with minimizing data movement.
upvoted 0 times
...
...
Lorrie
2 months ago
I think the correct answer is HASH distribution. It should minimize data movement across the compute nodes in the dedicated SQL pool.
upvoted 0 times
Mike
26 days ago
Yes, HASH distribution ensures that related data is stored together, reducing the need to move data around.
upvoted 0 times
...
Toi
29 days ago
I agree, HASH distribution is the way to go for minimizing data movement.
upvoted 0 times
...
...
Margret
2 months ago
I agree with Socorro. HASH distribution will help optimize performance for all fact tables.
upvoted 0 times
...
Socorro
2 months ago
I think we should recommend HASH distribution to minimize data movement.
upvoted 0 times
...

Save Cancel