An organization is developing a feature repository and is electing to one-hot encode all categorical feature variables. A data scientist suggests that the categorical feature variables should not be one-hot encoded within the feature repository.
Which of the following explanations justifies this suggestion?
In Spark ML, a transformer is an algorithm that can transform one DataFrame into another DataFrame. It takes a DataFrame as input and produces a new DataFrame as output. This transformation can involve adding new columns, modifying existing ones, or applying feature transformations. Examples of transformers in Spark MLlib include feature transformers like StringIndexer, VectorAssembler, and StandardScaler.
Databricks documentation on transformers: Transformers in Spark ML
Marshall
4 months agoRicarda
5 months agoDorthy
5 months agoAlex
5 months agoAsuncion
5 months agoTamar
5 months agoHerman
6 months agoYuonne
6 months agoLelia
6 months agoJesusita
6 months agoFlo
6 months agoLeeann
6 months agoAlida
6 months agoEmily
11 months agoFelice
11 months agoGalen
9 months agoLayla
9 months agoTony
10 months agoHermila
10 months agoIndia
11 months agoJettie
11 months agoStephanie
10 months agoMaurine
10 months agoGaston
10 months agoGlenna
12 months agoChaya
10 months agoMitsue
10 months agoFernanda
10 months agoJoaquin
1 year agoShawna
1 year agoArlette
1 year ago