Objective Function
How does the objective function look like?
Objective function:
Training loss: measures how well the model fit on training data
- Square loss:
- Logistic loss:
Regularization: How complicated is the model?
- norm (Ridge):
- norm (Lasso):
Objective Function | Linear model? | Loss | Regularization | |
---|---|---|---|---|
Ridge regression | β | square | ||
Lasso regression | β | square | ||
Logistic regression | β | logistic |
Why do we want to contain two component in the objective?
Optimizing training loss encourages predictive models
- Fitting well in training data at least get you close to training data which is hopefully close to the underlying distribution
Optimizing regularization encourages simple models
- Simpler models tends to have smaller variance in future predictions, making prediction stable