You've migrated a Hadoop job from an on-premises cluster to Dataproc and Good Storage. Your Spark job is a complex analytical workload fiat consists of many shuffling operations, and initial data are parquet toes (on average 200-400 MB size each) You see some degradation in performance after the migration to Dataproc so you'd like to optimize for it. Your organization is very cost-sensitive so you'd Idee to continue using Dataproc on preemptibles (with 2 non-preemptibles workers only) for this workload. What should you do?
Junita
7 months agoFausto
7 months agoEdgar
7 months agoYuonne
8 months agoMohammad
8 months agoJosephine
8 months agoVincenza
8 months agoKris
8 months agoMarkus
8 months agoLatia
8 months agoEileen
8 months agoLing
8 months agoJanessa
8 months ago