Google Professional Machine Learning Engineer Exam - Topic 5 Question 97 Discussion

Actual exam question for Google's Professional Machine Learning Engineer exam

Question #: 97
Topic #: 5

[All Professional Machine Learning Engineer Questions]

You need to train a natural language model to perform text classification on product descriptions that contain millions of examples and 100,000 unique words. You want to preprocess the words individually so that they can be fed into a recurrent neural network. What should you do?

ACreate a hot-encoding of words, and feed the encodings into your model.

BIdentify word embeddings from a pre-trained model, and use the embeddings in your model.

CSort the words by frequency of occurrence, and use the frequencies as the encodings in your model.

DAssign a numerical value to each word from 1 to 100,000 and feed the values as inputs in your model.

Show Suggested Answer

Suggested Answer: D

The best option to build a comprehensive system that recommends images to users that are similar in appearance to their own uploaded images is to download a pretrained convolutional neural network (CNN), and use the model to generate embeddings of the input images. Embeddings are low-dimensional representations of high-dimensional data that capture the essential features and semantics of the data. By using a pretrained CNN, you can leverage the knowledge learned from large-scale image datasets, such as ImageNet, and apply it to your own domain. A pretrained CNN can be used as a feature extractor, where the output of the last hidden layer (or any intermediate layer) is taken as the embedding vector for the input image. You can then measure the similarity between embeddings using a distance metric, such as cosine similarity or Euclidean distance, and recommend images that have the highest similarity scores to the user's uploaded image. Option A is incorrect because downloading a pretrained CNN and fine-tuning the model to predict hashtags based on the input images may not capture the visual similarity of the images, as hashtags may not reflect the appearance of the images accurately. For example, two images of different breeds of dogs may have the same hashtag #dog, but they may not look similar to each other. Moreover, fine-tuning the model may require additional data and computational resources, and it may not generalize well to new images that have different or missing hashtags. Option B is incorrect because retrieving image labels and dominant colors from the input images using the Vision API may not capture the visual similarity of the images, as labels and colors may not reflect the fine-grained details of the images. For example, two images of the same breed of dog may have different labels and colors depending on the background, lighting, and angle of the image. Moreover, using the Vision API may incur additional costs and latency, and it may not be able to handle custom or domain-specific labels. Option C is incorrect because using the provided hashtags to create a collaborative filtering algorithm may not capture the visual similarity of the images, as collaborative filtering relies on the ratings or preferences of users, not the features of the images. For example, two images of different animals may have similar ratings or preferences from users, but they may not look similar to each other. Moreover, collaborative filtering may suffer from the cold start problem, where new images or users that have no ratings or preferences cannot be recommended.Reference:

Image similarity search with TensorFlow

Image embeddings documentation

Pretrained models documentation

Similarity metrics documentation

by Cristy at Feb 08, 2025, 09:01 AM

Limited Time Offer

25%

Off

Get Premium Professional Machine Learning Engineer Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Celeste

6 months ago

Sorting by frequency? That doesn't seem like a solid approach.

upvoted 0 times

...

Kenneth

6 months ago

Totally agree with B! Word embeddings are the best choice here.

upvoted 0 times

...

Fidelia

6 months ago

Wait, can you really just assign numbers to words? Sounds odd.

upvoted 0 times

...

Caprice

7 months ago

Hot-encoding? That's super inefficient for this scale.

upvoted 0 times

...

Ligia

7 months ago

B is definitely the way to go! Pre-trained embeddings save time.

upvoted 0 times

...

Yen

7 months ago

Assigning numerical values sounds too simplistic; I feel like it wouldn't capture the relationships between words well enough.

upvoted 0 times

...

Twanna

7 months ago

I practiced a similar question where we had to choose between embeddings and one-hot encoding, and embeddings won out for performance.

upvoted 0 times

...

Marjory

7 months ago

I'm not entirely sure, but I think hot-encoding might create a huge sparse matrix, which could be inefficient for our model.

upvoted 0 times

...

Frederic

8 months ago

I remember we discussed word embeddings in class, and they seem to be really effective for capturing semantic meaning.

upvoted 0 times

...

Shayne

8 months ago

Whoa, this is a lot of data to work with. I'm a little overwhelmed, to be honest. I think I'd start by trying option D, just to get something simple up and running. Assigning numerical values to the words seems like a good starting point, even if it's not the most sophisticated approach.

upvoted 0 times

...

Aretha

8 months ago

Okay, I've got this! The answer is definitely B. Using pre-trained word embeddings is the way to go for this kind of problem. It will give us a compact, meaningful representation of the words that the neural network can work with. I feel pretty confident about this one.

upvoted 0 times

...

Refugia

8 months ago

Hmm, this is a tricky one. I'm not sure if a hot-encoding would be the best approach given the large vocabulary size. And the other options seem a bit too simplistic. I'll need to think this through carefully and maybe review some of the NLP techniques we covered in class.

upvoted 0 times

...

Providencia

8 months ago

This looks like a classic natural language processing problem. I think the key is to find a way to represent the words in a numerical format that the neural network can work with. Option B seems like the best approach - using pre-trained word embeddings should give us a good starting point.

upvoted 0 times

...

Stephane

1 year ago

I wonder if they'll accept 'leet speak' as a valid encoding? Gotta keep up with the cool kids, you know? Seriously though, I'm going with the word embeddings. Seems like the most robust approach.

upvoted 0 times

Sabrina

11 months ago

Yeah, word embeddings seem like a reliable option for handling such a large dataset.

upvoted 0 times

...

Kristal

11 months ago

I'm not sure about 'leet speak', but word embeddings sound like a solid choice for this task.

upvoted 0 times

...

Cary

11 months ago

I think word embeddings are the way to go too. They capture the meaning of words better.

upvoted 0 times

...

Emmanuel

1 year ago

Wow, a whole 100,000 unique words? That's a lot of vocab to memorize. I'm just hoping I can find a model that's already been trained on this kind of data. Word embeddings it is!

upvoted 0 times

Kirk

12 months ago

Wow, that's a smart choice! Word embeddings will definitely help with handling such a large vocabulary.

upvoted 0 times

...

Marta

1 year ago

B) Identify word embeddings from a pre-trained model, and use the embeddings in your model.

upvoted 0 times

...

Edwin

1 year ago

A) Create a hot-encoding of words, and feed the encodings into your model.

upvoted 0 times

...

Cecil

1 year ago

A numerical value from 1 to 100,000 for each word? What is this, a game of Scrabble? I'll pass on that one, thanks. Gotta be the word embeddings for me.

upvoted 0 times

Honey

12 months ago

Yeah, assigning numerical values to words sounds like a headache. Word embeddings all the way.

upvoted 0 times

...

Argelia

1 year ago

Sorting words by frequency could be useful, but embeddings are more efficient.

upvoted 0 times

...

Frederica

1 year ago

Hot-encoding seems like a lot of work for that many unique words.

upvoted 0 times

...

Sylvie

1 year ago

User 2: Definitely, hot-encoding sounds like a lot of work.

upvoted 0 times

...

Lilli

1 year ago

I agree, word embeddings are the way to go for this task.

upvoted 0 times

...

Francesco

1 year ago

User 1: I agree, word embeddings are the way to go.

upvoted 0 times

...

Hildred

1 year ago

Hmm, sorting by frequency and using the rankings as encodings? That's an interesting approach, but I'm not sure it'll capture the semantic relationships between the words. I think I'll go with the word embeddings too.

upvoted 0 times

...

Serina

1 year ago

But what about sorting words by frequency? Wouldn't that help with efficiency?

upvoted 0 times

...

Merissa

1 year ago

Ooh, this is a tricky one! I definitely don't want to try hot-encoding all those words - that sounds like a memory nightmare. Guess I'll have to go with option B and use some pre-trained word embeddings. Probably save me a ton of time and headaches.

upvoted 0 times