An organization is developing a feature repository and is electing to one-hot encode all categorical feature variables. A data scientist suggests that the categorical feature variables should not be one-hot encoded within the feature repository.
Which of the following explanations justifies this suggestion?
In Spark ML, a transformer is an algorithm that can transform one DataFrame into another DataFrame. It takes a DataFrame as input and produces a new DataFrame as output. This transformation can involve adding new columns, modifying existing ones, or applying feature transformations. Examples of transformers in Spark MLlib include feature transformers like StringIndexer, VectorAssembler, and StandardScaler.
Databricks documentation on transformers: Transformers in Spark ML
Alex
2 days agoAsuncion
8 days agoTamar
13 days agoHerman
19 days agoYuonne
24 days agoLelia
1 month agoJesusita
1 month agoFlo
1 month agoLeeann
1 month agoAlida
1 month agoEmily
6 months agoFelice
6 months agoGalen
4 months agoLayla
4 months agoTony
5 months agoHermila
5 months agoIndia
6 months agoJettie
6 months agoStephanie
5 months agoMaurine
5 months agoGaston
5 months agoGlenna
7 months agoChaya
5 months agoMitsue
5 months agoFernanda
5 months agoJoaquin
7 months agoShawna
7 months agoArlette
7 months ago