Okay, I see the data imbalance is a key factor here. I'm not sure if RMSE is the right choice, since that's more for regression tasks. I'm leaning towards F1 score or a weighted F-score to handle the imbalance.
I believe Key, Value, and TimeStamp are included because they are essential for identifying and storing data in HBase. Key Type is not necessary for the KeyValue format.
Tom
3 months agoKayleigh
3 months agoMammie
4 months agoElizabeth
4 months agoJeanice
4 months agoNoemi
4 months agoSherill
4 months agoOretha
5 months agoGlen
5 months agoJeanice
5 months agoDanica
5 months agoAlva
10 months agoShelba
8 months agoGertude
9 months agoJudy
9 months agoDeeanna
10 months agoPolly
9 months agoJames
9 months agoJanessa
9 months agoLashon
10 months agoAnnice
9 months agoShenika
9 months agoAretha
9 months agoMerissa
9 months agoJoye
10 months agoLynsey
10 months agoAmber
10 months agoFrank
11 months agoLaurel
10 months agoLuis
10 months agoHorace
10 months agoYun
11 months agoReita
11 months agoEvangelina
11 months ago