You are developing a recommendation engine for an online clothing store. The historical customer transaction data is stored in BigQuery and Cloud Storage. You need to perform exploratory data analysis (EDA), preprocessing and model training. You plan to rerun these EDA, preprocessing, and training steps as you experiment with different types of algorithms. You want to minimize the cost and development effort of running these steps as you experiment. How should you configure the environment?
Cost-effectiveness:User-managed notebooks in Vertex AI Workbench allow you to leverage pre-configured virtual machines with reasonable resource allocation, keeping costs lower compared to options involving managed notebooks or Dataproc clusters.
Development flexibility:User-managed notebooks offer full control over the environment, allowing you to install additional libraries or dependencies needed for your specific EDA, preprocessing, and model training tasks. This flexibility is crucial while experimenting with different algorithms.
BigQuery integration:The %%bigquery magic commands provide seamless integration with BigQuery within the Jupyter Notebook environment. This enables efficient querying and exploration of customer transaction data stored in BigQuery directly from the notebook, streamlining the workflow.
Other options and why they are not the best fit:
B) Managed notebook:While managed notebooks offer an easier setup, they might have limited customization options, potentially hindering your ability to install specific libraries or tools.
C) Dataproc Hub:Dataproc Hub focuses on running large-scale distributed workloads, and it might be overkill for your scenario involving exploratory analysis and experimentation with different algorithms. Additionally, it could incur higher costs compared to a user-managed notebook.
D) Dataproc cluster with spark-bigquery-connector:Similar to option C, using a Dataproc cluster with the spark-bigquery-connector would be more complex and potentially more expensive than using %%bigquery magic commands within a user-managed notebook for accessing BigQuery data.
https://cloud.google.com/vertex-ai/docs/workbench/instances/bigquery
Tamie
6 days agoDoyle
12 days agoAbraham
18 days agoAlbina
23 days agoVesta
28 days agoSalena
1 month agoAaron
1 month agoNoemi
1 month agoAfton
1 month agoTheresia
1 month agoEvelynn
2 months agoLasandra
1 year agoMicah
1 year agoMalcom
1 year agoDenae
1 year agoAshlee
1 year agoSherell
1 year agoDeeann
1 year agoBernadine
1 year agoCharlene
1 year agoCandra
1 year agoHolley
1 year agoDelisa
1 year agoVirgie
1 year agoRessie
1 year agoEmerson
1 year agoDorcas
1 year agoChristoper
1 year agoGraciela
1 year agoNoel
1 year agoRashad
1 year agoKimi
1 year agoBettyann
1 year ago