Learning in Logistic Regression

Learning in Logistic Regression

Logistic regression is an instance of supervised classification in which we know the correct label yy (either 0 or 1) for each observation xx.

The system produces/predicts y^\hat{y}, the estimate for the true yy. We want to learn parameters (ww and bb) that make y^\hat{y} for each training observation as close as possible to the true yy. 💪

This requires two components:

  • loss function: also called cost function, a metric measures the distance between the system output and the gold output
    • The loss function that is commonly used for logistic regression and also for neural networks is cross-entropy loss
  • Optimization algorithm for iteratively updating the weights so as to minimize this loss function