Microsoft Exam DP-500 Topic 2 Question 45 Discussion

Actual exam question for Microsoft's DP-500 exam

Question #: 45
Topic #: 2

You use Azure Synapse Analytics and Apache Spark notebooks to You need to use PySpark to gain access to the visual libraries. Which Python libraries should you use?

ASeaborn only

BMatplotlib and Seaborn

CMatplotlib only

DMatplotlib and TensorFlow

ETensorFlow only

FSeaborn and TensorFlow

Show Suggested Answer

Suggested Answer: B

pandas.DataFrame.corr computes pairwise correlation of columns, excluding NA/null values.

Incorrect:

* freqItems

pyspark.sql.DataFrame.freqItems

Finding frequent items for columns, possibly with false positives. Using the frequent element count algorithm described in https://doi.org/10.1145/762471.762473, proposed by Karp, Schenker, and Papadimitriou.'

* summary is used for index.

* There is no panda method for rollup. Rollup would not be correct anyway.

by Susy at Jul 07, 2024, 04:01 AM

Limited Time Offer

25%

Off

Get Premium DP-500 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Roselle

1 years ago

I believe freqItems is used for finding frequent items, not data distribution statistics. So, D) describe is the correct answer.

upvoted 0 times

...

Vonda

1 years ago

I'm not sure, but I think A) freqItems might also be used for data distribution statistics.

upvoted 0 times

...

Huey

1 years ago

The 'describe' method is the way to go! It's like a magic trick - you wave your DataFrame at it, and *poof*, you've got a beautiful table of distribution stats. Saves you from having to do all that number-crunching yourself.

upvoted 0 times

...

Rosendo

1 years ago

Ah, the 'describe' method - the data analyst's best friend! It's like having a personal genie that can summarize your data in a snap. Beats trying to do it all by hand, that's for sure.

upvoted 0 times

Johnathon

11 months ago

'describe' is my go-to method for getting a quick summary of the DataFrame.

upvoted 0 times

...

Arminda

11 months ago

I prefer using 'describe' as well, it gives a quick snapshot of the data distribution.

upvoted 0 times

...

Nina

11 months ago

I agree, 'describe' is definitely a time-saver when it comes to getting an overview of the data.

upvoted 0 times

...

Diane

11 months ago

D) describe

upvoted 0 times

...

Lezlie

11 months ago

Yes, 'describe' is definitely the way to go. It gives you all the key statistics you need at a glance.

upvoted 0 times

...

Gilbert

11 months ago

D) describe

upvoted 0 times

...

Jaime

12 months ago

C) sample

upvoted 0 times

...

Amber

1 years ago

B) corr

upvoted 0 times

...

Devorah

1 years ago

A) freqItems

upvoted 0 times

...

Whitney

1 years ago

I agree with Alecia, describe method gives statistical summary of the DataFrame.

upvoted 0 times

...

Lourdes

1 years ago

Definitely 'describe'! It's the perfect tool for getting a quick overview of your data. Plus, it's way easier than trying to do all that manually. Who's got time for that?

upvoted 0 times

Nadine

1 years ago

Agreed, it's definitely the easiest option.

upvoted 0 times

...

Glory

1 years ago

I think 'describe' is the way to go.

upvoted 0 times

...

Alecia

1 years ago

I think the answer is D) describe.

upvoted 0 times

...

Pamella

1 years ago

Hmm, I think the 'describe' method is the way to go. It's like the Swiss Army knife of data analysis - it gives you a nice summary of the distribution, including measures like mean, standard deviation, and percentiles.

upvoted 0 times