How to one hot encode several categorical variables in R
Example Data
customers <- data.frame(
id=c(10, 20, 30, 40, 50),
gender=factor(c('male', 'female', 'female', 'male', 'female')),
mood=factor(c('happy', 'sad', 'happy', 'sad','happy')),
outcome=c(1, 1, 0, 0, 0))
Hint: You can use
Code
library(data.table)
library(mltools)
customers_1h <- one_hot(as.data.table(customers))
Result
> customers_1h
id gender_female gender_male mood_happy mood_sad outcome
1: 10 0 1 1 0 1
2: 20 1 0 0 1 1
3: 30 1 0 1 0 0
4: 40 0 1 0 1 0
5: 50 1 0 1 0 0
Member discussion