New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Databricks Machine Learning Professional Exam - Topic 1 Question 3 Discussion

Actual exam question for Databricks's Databricks Machine Learning Professional exam
Question #: 3
Topic #: 1
[All Databricks Machine Learning Professional Questions]

A machine learning engineer has developed a random forest model using scikit-learn, logged the model using MLflow as random_forest_model, and stored its run ID in the run_id Python variable. They now want to deploy that model by performing batch inference on a Spark DataFrame spark_df.

Which of the following code blocks can they use to create a function called predict that they can use to complete the task?

A)

B)

It is not possible to deploy a scikit-learn model on a Spark DataFrame.

C)

D)

E)

Show Suggested Answer Hide Answer
Suggested Answer: D

Contribute your Thoughts:

0/2000 characters
Myra
3 months ago
Not sure about D, looks a bit complicated for this task.
upvoted 0 times
...
Leeann
3 months ago
I prefer Option C, seems more straightforward.
upvoted 0 times
...
Val
3 months ago
Surprised to see B as an option, that can't be right!
upvoted 0 times
...
Joni
4 months ago
I think Option B is incorrect, scikit-learn models can work with Spark.
upvoted 0 times
...
Alva
4 months ago
Option A looks solid for batch inference!
upvoted 0 times
...
Kirk
4 months ago
I’m a bit confused about the images in options C, D, and E. I wish I could recall what they contained!
upvoted 0 times
...
Antonio
4 months ago
I practiced a similar question where we had to load a model from MLflow. I feel like option A might be the right choice here.
upvoted 0 times
...
Audrie
4 months ago
I think option B is wrong because we can deploy scikit-learn models on Spark DataFrames, right?
upvoted 0 times
...
Royal
5 months ago
I remember we discussed how to deploy scikit-learn models with Spark, but I’m not sure if it was specifically about random forests.
upvoted 0 times
...
Celestine
5 months ago
I'm leaning towards Option D, but I want to make sure I understand the process of converting the scikit-learn model to a Spark UDF. That might be the key to solving this problem.
upvoted 0 times
...
Denny
5 months ago
I'm a bit confused here. Option B says it's not possible, but the question seems to imply that we can deploy the scikit-learn model on Spark. I'll need to think this through carefully.
upvoted 0 times
...
Kristal
5 months ago
Okay, let's see. Option A looks promising, but I'm not sure if that's the right way to do it. I'll need to double-check the syntax and make sure it's compatible with the given setup.
upvoted 0 times
...
Gail
5 months ago
Hmm, this looks like a tricky one. I'll need to carefully read through the options and think about how to deploy a scikit-learn model on a Spark DataFrame.
upvoted 0 times
...
Renay
5 months ago
Aha, Option C looks like it might be the solution! I'll need to verify that the MLflow integration and the Spark DataFrame usage are correct, but this seems like the most straightforward approach.
upvoted 0 times
...
Daron
5 months ago
I'm not sure about this one. I'll need to eliminate the options that don't sound right and then make an educated guess.
upvoted 0 times
...
Shawna
5 months ago
I'm leaning towards option B being the wrong one. My notes mentioned that HDHPs might actually have lower premiums compared to traditional plans!
upvoted 0 times
...
Nobuko
5 months ago
Okay, I think I've got this. Assigning to the OU is the recommended method because it saves the admin a ton of time and effort compared to manually adding the app to each user.
upvoted 0 times
...

Save Cancel