An organization is developing a feature repository and is electing to one-hot encode all categorical feature variables. A data scientist suggests that the categorical feature variables should not be one-hot encoded within the feature repository.
Which of the following explanations justifies this suggestion?
In Spark ML, a transformer is an algorithm that can transform one DataFrame into another DataFrame. It takes a DataFrame as input and produces a new DataFrame as output. This transformation can involve adding new columns, modifying existing ones, or applying feature transformations. Examples of transformers in Spark MLlib include feature transformers like StringIndexer, VectorAssembler, and StandardScaler.
Databricks documentation on transformers: Transformers in Spark ML
Marshall
6 months agoRicarda
6 months agoDorthy
6 months agoAlex
7 months agoAsuncion
7 months agoTamar
7 months agoHerman
7 months agoYuonne
7 months agoLelia
8 months agoJesusita
8 months agoFlo
8 months agoLeeann
8 months agoAlida
8 months agoEmily
1 year agoFelice
1 year agoGalen
11 months agoLayla
11 months agoTony
11 months agoHermila
11 months agoIndia
1 year agoJettie
1 year agoStephanie
11 months agoMaurine
12 months agoGaston
12 months agoGlenna
1 year agoChaya
12 months agoMitsue
12 months agoFernanda
12 months agoJoaquin
1 year agoShawna
1 year agoArlette
1 year ago