New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Microsoft DP-700 Exam - Topic 2 Question 7 Discussion

Actual exam question for Microsoft's DP-700 exam
Question #: 7
Topic #: 2
[All DP-700 Questions]

You have an Azure event hub. Each event contains the following fields:

BikepointID

Street

Neighbourhood

Latitude

Longitude

No_Bikes

No_Empty_Docks

You need to ingest the events. The solution must only retain events that have a Neighbourhood value of Chelsea, and then store the retained events in a Fabric lakehouse.

What should you use?

Show Suggested Answer Hide Answer
Suggested Answer: D

To compute the standard deviation of the temperature from the thermal sensor data, you would use the Aggregate transform operator in Eventstream1. The Aggregate operator allows you to apply functions like sum, average, count, and statistical functions like standard deviation across a group of rows or events. This operator is ideal for operations that require summarizing or computing statistics over a dataset, such as calculating the standard deviation.


Contribute your Thoughts:

0/2000 characters
Helene
3 months ago
No way, D is definitely the way to go!
upvoted 0 times
...
Aileen
3 months ago
I’m surprised they want to filter by Neighbourhood like that.
upvoted 0 times
...
Shalon
3 months ago
Wait, why not just use a streaming dataset?
upvoted 0 times
...
Kenda
4 months ago
A KQL queryset would work too, right?
upvoted 0 times
...
Annice
4 months ago
I think D is the best choice for this.
upvoted 0 times
...
Isadora
4 months ago
I feel like the eventstream option might not be enough for the filtering requirement. Spark seems more appropriate for this scenario.
upvoted 0 times
...
Muriel
4 months ago
I practiced a similar question where we had to filter events, and I think a streaming dataset could be the right choice, but I’m not confident.
upvoted 0 times
...
Sage
5 months ago
I'm not entirely sure, but I remember something about KQL being useful for querying data. Would that work here?
upvoted 0 times
...
Lindsey
5 months ago
I think we might need to use Apache Spark Structured Streaming for this, since it can handle real-time data and filtering.
upvoted 0 times
...
Lai
5 months ago
This seems pretty straightforward to me. I'd just use a KQL queryset to filter the events with a Neighbourhood value of Chelsea, and then store the results in the Fabric lakehouse. KQL is a powerful query language for working with Azure data sources, so I think that would be the most efficient approach here.
upvoted 0 times
...
Blondell
5 months ago
Okay, I've got a strategy for this. I'd use Apache Spark Structured Streaming to ingest the events from the Azure event hub. That way, I can apply the Neighbourhood filter in real-time as the data is coming in, and then write the filtered events directly to the Fabric lakehouse. Spark Streaming should give me the performance and scalability I need for this kind of streaming data processing.
upvoted 0 times
...
Jannette
5 months ago
Hmm, I'm a bit unsure about this one. I'm not super familiar with Azure event hubs and Fabric lakehouses. I think I'd need to do some more research on the specific technologies involved before deciding on the best approach. Maybe I'll try to find some examples online to get a better understanding.
upvoted 0 times
...
Cecil
5 months ago
This looks like a classic data ingestion and filtering problem. I'd start by using a streaming dataset to ingest the events from the Azure event hub, and then apply a filter to only retain the events with a Neighbourhood value of Chelsea before storing them in the Fabric lakehouse.
upvoted 0 times
...
Glenn
10 months ago
Haha, I bet the answer is just to use a KQL queryset and then manually filter out the events. That's probably the most 'Azure' solution, right?
upvoted 0 times
Beata
8 months ago
User 4: Just filter out the events with Neighbourhood value of Chelsea.
upvoted 0 times
...
Robt
9 months ago
User 3: It's definitely the most Azure way to do it.
upvoted 0 times
...
Hildred
9 months ago
User 2: Yeah, that sounds like the right choice.
upvoted 0 times
...
Kristofer
9 months ago
User 1: I think the answer is to use a KQL queryset.
upvoted 0 times
...
Rikki
9 months ago
Hayley: Yeah, it's a more efficient way to handle the data ingestion process.
upvoted 0 times
...
Hayley
10 months ago
User 2: That's correct. Using KQL will make it easier to retain only the events you need for the Fabric lakehouse.
upvoted 0 times
...
Kiley
10 months ago
User 1: Actually, the best option is to use a KQL queryset to filter out the events with Neighbourhood value of Chelsea.
upvoted 0 times
...
...
Lillian
10 months ago
I'm leaning towards option C, a streaming dataset. That seems like it would be a good fit for this use case, and it might be a bit simpler to set up than the Spark solution.
upvoted 0 times
...
Dick
11 months ago
Hmm, I'm not sure about that. Wouldn't an eventstream be a more straightforward solution? It's designed for ingesting and processing streaming data like this.
upvoted 0 times
Rhea
10 months ago
I agree, using a KQL queryset would allow us to easily retain events with a Neighbourhood value of Chelsea before storing them in a Fabric lakehouse.
upvoted 0 times
...
Theola
10 months ago
I think a KQL queryset would be more appropriate for filtering events based on specific criteria like Neighbourhood.
upvoted 0 times
...
...
Gracia
11 months ago
I think option D, Apache Spark Structured Streaming, is the way to go. It can handle the streaming data and filter out the events based on the Neighbourhood value.
upvoted 0 times
Dorthy
10 months ago
I'm not sure about the others, but Apache Spark Structured Streaming sounds like the best choice for this task.
upvoted 0 times
...
Jamal
10 months ago
I would go with Apache Spark Structured Streaming as well, it seems like the most suitable option for this scenario.
upvoted 0 times
...
Rozella
10 months ago
I think using a KQL queryset might also work well for filtering out events based on the Neighbourhood value.
upvoted 0 times
...
Amie
10 months ago
I agree, Apache Spark Structured Streaming is a powerful tool for handling streaming data.
upvoted 0 times
...
...
Celestina
11 months ago
I'm not sure, but I think Apache Spark Structured Streaming could also work for this scenario.
upvoted 0 times
...
Nobuko
11 months ago
I agree with Leanora, KQL would be the best option to filter events by Neighbourhood.
upvoted 0 times
...
Leanora
11 months ago
I think we should use a KQL queryset for this.
upvoted 0 times
...

Save Cancel