An organization is developing a feature repository and is electing to one-hot encode all categorical feature variables. A data scientist suggests that the categorical feature variables should not be one-hot encoded within the feature repository.
Which of the following explanations justifies this suggestion?
In Spark ML, a transformer is an algorithm that can transform one DataFrame into another DataFrame. It takes a DataFrame as input and produces a new DataFrame as output. This transformation can involve adding new columns, modifying existing ones, or applying feature transformations. Examples of transformers in Spark MLlib include feature transformers like StringIndexer, VectorAssembler, and StandardScaler.
Databricks documentation on transformers: Transformers in Spark ML
Marshall
3 months agoRicarda
3 months agoDorthy
3 months agoAlex
4 months agoAsuncion
4 months agoTamar
4 months agoHerman
4 months agoYuonne
4 months agoLelia
5 months agoJesusita
5 months agoFlo
5 months agoLeeann
5 months agoAlida
5 months agoEmily
9 months agoFelice
9 months agoGalen
8 months agoLayla
8 months agoTony
8 months agoHermila
8 months agoIndia
10 months agoJettie
10 months agoStephanie
8 months agoMaurine
9 months agoGaston
9 months agoGlenna
10 months agoChaya
9 months agoMitsue
9 months agoFernanda
9 months agoJoaquin
11 months agoShawna
11 months agoArlette
11 months ago