Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Professional Machine Learning Engineer Exam - Topic 8 Question 70 Discussion

Actual exam question for Google's Professional Machine Learning Engineer exam
Question #: 70
Topic #: 8
[All Professional Machine Learning Engineer Questions]

You are creating a social media app where pet owners can post images of their pets. You have one million user uploaded images with hashtags. You want to build a comprehensive system that recommends images to users that are similar in appearance to their own uploaded images.

What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: D

The best option to build a comprehensive system that recommends images to users that are similar in appearance to their own uploaded images is to download a pretrained convolutional neural network (CNN), and use the model to generate embeddings of the input images. Embeddings are low-dimensional representations of high-dimensional data that capture the essential features and semantics of the data. By using a pretrained CNN, you can leverage the knowledge learned from large-scale image datasets, such as ImageNet, and apply it to your own domain. A pretrained CNN can be used as a feature extractor, where the output of the last hidden layer (or any intermediate layer) is taken as the embedding vector for the input image. You can then measure the similarity between embeddings using a distance metric, such as cosine similarity or Euclidean distance, and recommend images that have the highest similarity scores to the user's uploaded image. Option A is incorrect because downloading a pretrained CNN and fine-tuning the model to predict hashtags based on the input images may not capture the visual similarity of the images, as hashtags may not reflect the appearance of the images accurately. For example, two images of different breeds of dogs may have the same hashtag #dog, but they may not look similar to each other. Moreover, fine-tuning the model may require additional data and computational resources, and it may not generalize well to new images that have different or missing hashtags. Option B is incorrect because retrieving image labels and dominant colors from the input images using the Vision API may not capture the visual similarity of the images, as labels and colors may not reflect the fine-grained details of the images. For example, two images of the same breed of dog may have different labels and colors depending on the background, lighting, and angle of the image. Moreover, using the Vision API may incur additional costs and latency, and it may not be able to handle custom or domain-specific labels. Option C is incorrect because using the provided hashtags to create a collaborative filtering algorithm may not capture the visual similarity of the images, as collaborative filtering relies on the ratings or preferences of users, not the features of the images. For example, two images of different animals may have similar ratings or preferences from users, but they may not look similar to each other. Moreover, collaborative filtering may suffer from the cold start problem, where new images or users that have no ratings or preferences cannot be recommended.Reference:

Image similarity search with TensorFlow

Image embeddings documentation

Pretrained models documentation

Similarity metrics documentation


Contribute your Thoughts:

0/2000 characters
Torie
4 months ago
C is a classic approach, but it might miss out on visual similarities!
upvoted 0 times
...
Monroe
4 months ago
B is interesting, but I wonder if it’s enough without deep learning.
upvoted 0 times
...
Vonda
4 months ago
Wait, can we really trust a model to get it right? Seems risky.
upvoted 0 times
...
Jennie
4 months ago
I think A could work too, but not as well as D.
upvoted 0 times
...
Alonzo
5 months ago
Option D sounds like the best choice for accurate recommendations!
upvoted 0 times
...
Lanie
5 months ago
I feel like generating embeddings from a CNN is the most effective way to measure image similarity, but I need to double-check how that works.
upvoted 0 times
...
Micheal
5 months ago
I practiced a similar question about collaborative filtering, but I’m not convinced it would capture the visual similarity we need here.
upvoted 0 times
...
Kristofer
5 months ago
I think using the Vision API to get image labels and colors could work, but I wonder if that’s enough for good recommendations.
upvoted 0 times
...
Deandrea
5 months ago
I remember we discussed using pretrained CNNs in class, but I'm not sure if fine-tuning for hashtags is the best approach.
upvoted 0 times
...
Trevor
5 months ago
I'm feeling pretty confident about this one. I think option D is the way to go - using image embeddings to measure similarity is a classic approach for this type of problem. As long as I can find a good pretrained model, I should be able to get a solid recommendation system up and running.
upvoted 0 times
...
Theron
5 months ago
Option A looks like a good place to start. Fine-tuning a pretrained CNN to predict hashtags could be a nice way to leverage existing models and the hashtag data we have. I'd want to experiment with different CNN architectures and hyperparameters to see what works best.
upvoted 0 times
...
Vanna
5 months ago
Hmm, I'm a bit unsure about this one. I'm not super familiar with image recommendation systems, so I'll have to think it through carefully. Maybe I'll start by looking into the different approaches and see which one seems most promising given the details provided.
upvoted 0 times
...
Carmen
5 months ago
This seems like a pretty straightforward image recommendation problem. I think I'd go with option D - using a pretrained CNN to generate image embeddings and then measuring similarity to make recommendations. That way I can leverage transfer learning and avoid having to train a whole model from scratch.
upvoted 0 times
...
Josphine
5 months ago
Hmm, I'm a bit unsure about this one. The options seem to cover different areas of the system, so I'll need to think carefully about which one specifically refers to the accounts receivable setup.
upvoted 0 times
...
Alesia
6 months ago
Okay, I know virtual machines and containers are both virtualization technologies, but I'm not sure how Kubernetes fits into this. I'll have to review my notes on 5G infrastructure.
upvoted 0 times
...
Mila
6 months ago
I'm not entirely sure about this one. I'll need to review the Kafka producer documentation to make sure I understand the potential exceptions.
upvoted 0 times
...
Gayla
2 years ago
Yeah, that's a good point, Shaun. The Vision API route could be a nice balance between complexity and performance. Plus, it would allow us to leverage both the visual features and the hashtag information, which seems like a more well-rounded approach.
upvoted 0 times
Lawanda
2 years ago
D) Download a pretrained convolutional neural network, and use the model to generate embeddings of the input images. Measure similarity between embeddings to make recommendations.
upvoted 0 times
...
Patti
2 years ago
B) Retrieve image labels and dominant colors from the input images using the Vision API. Use these properties and the hashtags to make recommendations.
upvoted 0 times
...
Shawna
2 years ago
A) Download a pretrained convolutional neural network, and fine-tune the model to predict hashtags based on the input images. Use the predicted hashtags to make recommendations.
upvoted 0 times
...
...
Shaun
2 years ago
I'm not sure about that. Wouldn't using the Vision API to retrieve labels and colors be a more straightforward solution? It seems like it would be simpler to implement and might work just as well, especially if the hashtags are reliable.
upvoted 0 times
...
Leah
2 years ago
I agree with Ellsworth. Using a pre-trained model to extract features and compute image similarity seems like the most robust approach. The hashtags can be a useful additional signal, but they shouldn't be the only factor in the recommendation system.
upvoted 0 times
...
Ellsworth
2 years ago
Hmm, this is an interesting question. I think the best approach would be to use a pre-trained convolutional neural network and generate image embeddings to make recommendations. That way, we can capture the visual similarity between the images without relying solely on the hashtags, which might not always be accurate or comprehensive.
upvoted 0 times
...

Save Cancel