A data scientist is developing a model to predict the outcome of a vote for a national mascot. The choice is between tigers and lions. The full data set represents feedback from individuals representing 17 professions and 12 different locations. The following rank aggregation represents 80% of the data set:

Which of the following is the most likely concern about the model's ability to predict the outcome of the vote?
The aggregated feedback covers only 80% of respondents, mostly from a few professions and locations, so the model hasn't ''seen'' the remaining 20% (and those underrepresented groups). Its performance on those unseen subsets (out-of-sample data) is therefore the primary concern for how well it will predict the actual vote.
Alberto
4 months agoLeatha
4 months agoJodi
4 months agoArlette
4 months agoLaine
5 months agoDean
5 months agoJudy
5 months agoNan
5 months agoLayla
6 months agoFrancene
6 months agoBette
6 months agoIdella
6 months agoGlory
6 months agoMalcom
9 months agoTamra
7 months agoBenton
8 months agoTyisha
9 months agoKaitlyn
9 months agoJanine
9 months agoAlba
9 months agoTalia
9 months agoLashawnda
9 months agoShizue
10 months agoLuisa
9 months agoLazaro
9 months agoAlease
10 months agoLeana
10 months agoChantell
8 months agoLou
8 months agoCordelia
9 months agoMaurine
9 months ago