INTRODUCTION TO FEATURE ENGINEERING
1. Feature Engineering
Definition: Process of creating new input features or modifying existing ones to improve
model performance.
Importance: Better features = better performance.
2. Feature Transformation
Normalization: Scale data to [0,1] range.
Standardization: Convert data to zero mean and unit variance.
Log/Power Transformations: Reduce skewness.
Encoding: Label encoding, one-hot encoding.
Binning: Convert continuous to categorical (e.g., age groups).
3. Feature Subset Selection
Goal: Choose the most informative features to reduce overfitting and improve accuracy.
Techniques:
o Filter Methods: Correlation, chi-square test
o Wrapper Methods: Forward selection, backward elimination
o Embedded Methods: Lasso, Ridge regression