You have trained a DNN regressor with TensorFlow to predict housing prices using a set of predictive features. Your default precision is tf.float64, and you use a standard TensorFlow estimator;
estimator tf.estimator.DNNRegressor(
feature_columns[YOUR_LIST_OF_FEATURES],
hidden_units-[1024, 512, 256],
dropoutNone)
Your model performs well, but Just before deploying it to production, you discover that your current serving latency is 10ms @ 90 percentile and you currently serve on CPUs. Your production requirements expect a model latency of 8ms @ 90 percentile. You are willing to accept a small decrease in performance in order to reach the latency requirement Therefore your plan is to improve latency while evaluating how much the model's prediction decreases. What should you first try to quickly lower the serving latency?
Hortencia
6 months agoAudria
6 months agoLashandra
6 months agoJacquelyne
7 months agoMattie
7 months agoCasie
7 months agoReuben
7 months agoElden
7 months agoLettie
8 months agoLouvenia
8 months agoFreeman
8 months agoMargart
8 months agoTwanna
8 months agoSimona
8 months agoOllie
8 months agoLauran
8 months agoTheodora
1 year agoZita
1 year agoMargarita
11 months agoKyoko
12 months agoTenesha
12 months agoPamella
1 year agoEmilio
11 months agoRosio
11 months agoMarilynn
12 months agoCatarina
1 year agoAide
12 months agoJunita
12 months agoNu
1 year agoNgoc
1 year agoAliza
1 year agoLoreen
1 year ago