Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Professional Data Engineer Exam - Topic 3 Question 71 Discussion

You are developing an application that uses a recommendation engine on Google Cloud. Your solution should display new videos to customers based on past views. Your solution needs to generate labels for the entities in videos that the customer has viewed. Your design must be able to provide very fast filtering suggestions based on data from other customer preferences on several TB of data. What should you do?
C) Build an application that calls the Cloud Video Intelligence API to generate labels. Store data in Cloud Bigtable, and filter the predicted labels to match the user's viewing history to generate preferences.
A) Build and train a complex classification model with Spark MLlib to generate labels and filter the results. Deploy the models using Cloud Dataproc. Call the model from your application.
B) Build and train a classification model with Spark MLlib to generate labels. Build and train a second classification model with Spark MLlib to filter results to match customer preferences. Deploy the models using Cloud Dataproc. Call the models from your application.
D) Build an application that calls the Cloud Video Intelligence API to generate labels. Store data in Cloud SQL, and join and filter the predicted labels to match the user's viewing history to generate preferences.

Google Professional Data Engineer Exam - Topic 3 Question 71 Discussion

Actual exam question for Google's Professional Data Engineer exam
Question #: 71
Topic #: 3
[All Professional Data Engineer Questions]

You are developing an application that uses a recommendation engine on Google Cloud. Your solution should display new videos to customers based on past views. Your solution needs to generate labels for the entities in videos that the customer has viewed. Your design must be able to provide very fast filtering suggestions based on data from other customer preferences on several TB of dat

a. What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: C

Contribute your Thoughts:

0/2000 characters
Lettie
6 months ago
Not sure if Cloud Bigtable is the best choice for this use case.
upvoted 0 times
...
Tresa
7 months ago
A is solid, but I prefer the simplicity of C.
upvoted 0 times
...
Micaela
7 months ago
Surprised that using Cloud SQL is even an option here!
upvoted 0 times
...
Noel
7 months ago
I think B is overkill, two models just for filtering?
upvoted 0 times
...
Jody
7 months ago
Option C seems the most efficient for quick labeling!
upvoted 0 times
...
Avery
7 months ago
I’m a bit confused about whether to use one model or two for filtering. I think having two models could complicate things, but it might be necessary for accuracy.
upvoted 0 times
...
Dierdre
8 months ago
I practiced a similar question where we had to choose between different storage options. I feel like Cloud Bigtable is better for handling large datasets like this one.
upvoted 0 times
...
Izetta
8 months ago
I'm not entirely sure, but I think using Cloud SQL might slow down the filtering process compared to Cloud Bigtable.
upvoted 0 times
...
Detra
8 months ago
I remember we discussed using the Cloud Video Intelligence API in class. It seems like a good fit for generating labels quickly.
upvoted 0 times
...
Pura
8 months ago
Whoa, this is a lot to take in. I'm not sure I have the experience to build a full recommendation engine from scratch, especially with all the different Google Cloud services involved. Maybe I'd start by trying to understand the overall architecture and see if I can find any sample code or tutorials to get me started.
upvoted 0 times
...
Anna
8 months ago
Okay, I think I've got a plan. I'd start by exploring the Cloud Video Intelligence API and seeing how I can use that to generate the video labels. Then I'd look into Cloud Bigtable as a fast, scalable way to store and filter the data based on customer preferences. I'm feeling pretty confident I can tackle this if I break it down step-by-step.
upvoted 0 times
...
Michel
8 months ago
Hmm, this is a tough one. I'm not super familiar with Spark MLlib or Cloud Dataproc, but I think option C might be the way to go. Using the Cloud Video Intelligence API to generate the labels and then storing and filtering the data in Cloud Bigtable sounds like a more straightforward approach.
upvoted 0 times
...
Dorian
8 months ago
This looks like a complex problem that requires a lot of different components to be integrated. I'm not sure if I have the skills to build a full solution from scratch, but I think I could at least try to break it down and tackle it piece by piece.
upvoted 0 times
...
Johnathon
8 months ago
I'm a bit confused on this one. I know there are a few different objects we could use, but I'm not totally sure which one is the best fit for modeling a client. I'll need to double-check the documentation to make sure I get this right.
upvoted 0 times
...
Eliseo
8 months ago
Hmm, I'm a bit confused by this question. It mentions a "series of questions" and that I won't be able to go back and review previous questions. That seems a bit tricky. I'll need to be really careful to understand each question before answering.
upvoted 0 times
...
Mel
8 months ago
I think you can't place hosted elements in Elevation Views, but I could be wrong. It's kind of tricky.
upvoted 0 times
...
Rima
8 months ago
Okay, let's see. The key here is to determine the appropriate action based on the risk exposure. I think I have a strategy to work through this step-by-step.
upvoted 0 times
...
Audra
1 year ago
As a video enthusiast, I'd love to see a recommendation engine that can keep up with my binge-watching habits! Option C sounds like it would be a 'reel' winner.
upvoted 0 times
Kristel
11 months ago
I think Option C is the way to go for developing an application with a recommendation engine on Google Cloud.
upvoted 0 times
...
Taryn
11 months ago
It would definitely help in providing fast filtering suggestions based on past views.
upvoted 0 times
...
Fausto
11 months ago
I agree, using the Cloud Video Intelligence API to generate labels and storing data in Cloud Bigtable seems like a solid solution.
upvoted 0 times
...
Louvenia
12 months ago
Option C sounds like a great choice for generating labels and filtering suggestions based on customer preferences.
upvoted 0 times
...
...
Stephanie
1 year ago
Hmm, I'm not sure I'd want to build a complex Spark MLlib model just for this task. The Cloud Video Intelligence API in Option C or D looks like a more efficient solution to me.
upvoted 0 times
Donette
12 months ago
Yeah, I think using the Cloud Video Intelligence API in Option C or D would be the way to go for fast filtering suggestions.
upvoted 0 times
...
Salome
12 months ago
I agree, Option D also seems like a good option. Storing data in Cloud SQL could be useful.
upvoted 0 times
...
Ocie
1 year ago
Yeah, building and training multiple models in Option B might be overkill. The Cloud Video Intelligence API sounds simpler.
upvoted 0 times
...
Quentin
1 year ago
Option C sounds like a good choice. Using the Cloud Video Intelligence API seems efficient.
upvoted 0 times
...
Frederic
1 year ago
I agree, Option D also seems like a good option. Storing data in Cloud SQL could be useful.
upvoted 0 times
...
Talia
1 year ago
Option C sounds like a good choice. Using the Cloud Video Intelligence API seems efficient.
upvoted 0 times
...
...
Fernanda
1 year ago
Option D seems like a good choice if you're already familiar with Cloud SQL. The simplicity of storing data in a relational database and joining the labels with user history could be a straightforward approach.
upvoted 0 times
Tonette
1 year ago
That's true, combining the API for labels with Cloud SQL for filtering could be a powerful combination.
upvoted 0 times
...
Rashida
1 year ago
But wouldn't using the Cloud Video Intelligence API to generate labels be more accurate?
upvoted 0 times
...
Nina
1 year ago
I agree, using Cloud SQL for storing data and joining labels with user history sounds efficient.
upvoted 0 times
...
Kenia
1 year ago
Option D seems like a good choice if you're already familiar with Cloud SQL.
upvoted 0 times
...
...
Joana
1 year ago
I'm leaning towards Option B. Having two separate models, one for generating labels and one for filtering, could provide more flexibility and control over the recommendation process.
upvoted 0 times
Bernardine
1 year ago
I agree, Option B seems like it would give us more control over the recommendation process.
upvoted 0 times
...
Frederica
1 year ago
Option B sounds like a good choice. Having separate models for generating labels and filtering could be beneficial.
upvoted 0 times
...
...
Patrick
1 year ago
That's a good point, but I still think option A is more straightforward and easier to implement.
upvoted 0 times
...
Lynelle
1 year ago
I disagree, I believe option B is better. Having two classification models for labeling and filtering can provide more accurate results.
upvoted 0 times
...
Galen
1 year ago
Option C looks like the most efficient and scalable solution. Using the Cloud Video Intelligence API and Cloud Bigtable seems like a great way to handle the large amounts of data and provide fast filtering capabilities.
upvoted 0 times
Mary
1 year ago
Yes, using Cloud Bigtable for storing data and filtering predicted labels based on user viewing history is a smart choice for this recommendation engine application.
upvoted 0 times
...
Arthur
1 year ago
It's important to have fast filtering suggestions based on customer preferences, and Cloud Bigtable can definitely help with that.
upvoted 0 times
...
Jeffrey
1 year ago
I agree, Cloud Video Intelligence API can help generate accurate labels for the videos, and Cloud Bigtable can handle the massive amount of data efficiently.
upvoted 0 times
...
Colene
1 year ago
Option C looks like the most efficient and scalable solution. Using the Cloud Video Intelligence API and Cloud Bigtable seems like a great way to handle the large amounts of data and provide fast filtering capabilities.
upvoted 0 times
...
...
Patrick
1 year ago
I think option A is the best choice. Using Spark MLlib for classification and Cloud Dataproc for deployment seems efficient.
upvoted 0 times
...

Save Cancel