Logistic Regression-MaximumLikelihood

Logistic Regression - Maximum Likelihood

Concept

https://youtu.be/vN5cNN2-HWE

https://youtu.be/BfKanl1aSG0

Introduction

Click here for the full note

Logistic regression is a classification problem

It is choosing class 'k' of highest $Pr(Y=k | X=x)$

Q. How to estimate $Pr(Y=k | X=x)$ ?

Option 1: Indirectly estimate using Bayes rule. e.g. LDA
Option 2: Directly estimate using Logistic Regression

Logistic Regression Model of Posterior Probability Function

From now on, we will assume it is binary classification, K=[0,1], number of K=2

Let function $p(x)$ is

p(x)=Pr(Y=1|X=x)

Definition of odds are

Odds: $p(x)/(1-p(x))$
log-odds: $log[p(x)/(1-p(x))]$

Let Log-odds is a Linear Function of data X

Q) How to estimate the parameter $\beta_i$ ? Use maximum likelihood

Estimate Posterior Probability Function with Likelihood Function

Given: Training dataset: $(z_1,y_1)~(z_N,y_N)$

The probability of the observed data: product of probability that Y=1 for z_i of k=1 and probability that Y=0 for z_i of k=0

l(\beta ) = \prod\limits_{i:yi = 1} {\Pr (Y = 1|X = {z_i})} \prod\limits_{i:yi = 0} {\Pr (Y = 0|X = {z_i})}

This is also known as Likelihood function.

Let function $p(z_i)$ is

p(z_i)=Pr(Y=1|X=z_i)

Then, Likelihood function is expressed as

l(\beta ) = \prod\limits_{i:yi = 1} {p({z_i})} \prod\limits_{i:yi = 0} {(1-p({z_i}))}

Maximizing Likelihood

Goal: estimate parameter $\beta$ that maximizes likelihood function.

Maximizing likelihood is also maximizing log-likelihood

L(\beta)=log(l(\beta))

Estimating parameter $\beta$ for Maximizing Log-Likelihood

{{\bf{\beta }}_{t + 1}} = {{\bf{\beta }}_t} - {{\bf{H}}^{ - 1}}\nabla L({{\bf{\beta }}_t})

Expressing with Matrices

\begin{array}{c} {{\bf{\beta }}_{t + 1}} = {{\bf{\beta }}_t} - {{\bf{H}}^{ - 1}}\nabla L({{\bf{\beta }}_t})\\ {{\bf{\beta }}_{t + 1}} = {{\bf{\beta }}_t} - {{\bf{(}}{{\bf{Z}}^{\bf{T}}}{\bf{WZ)}}^{{\bf{ - 1}}}}{{\bf{Z}}^{\bf{T}}}{\bf{(y - p)}}\\ = {{\bf{(}}{{\bf{Z}}^{\bf{T}}}{\bf{WZ)}}^{{\bf{ - 1}}}}({{\bf{Z}}^{\bf{T}}}{\bf{W}}){\bf{v}}\\ {\bf{v}} = {\bf{Z}}{{\bf{\beta }}_t} - {{\bf{W}}^{ - 1}}{\bf{(y - p)}} \end{array}

Iterative Reweighted Least Squares (IRLS)

Solving iteratively with updated values of W,p

\begin{array}{c} {{\bf{\beta }}_{t + 1}} = {{\bf{(}}{{\bf{Z}}^{\bf{T}}}{\bf{WZ)}}^{{\bf{ - 1}}}}({{\bf{Z}}^{\bf{T}}}{\bf{W}}){\bf{v}}\\ {\bf{v}} = {\bf{Z}}{{\bf{\beta }}_t} - {{\bf{W}}^{ - 1}}{\bf{(y - p)}}\\ p(x) = \frac{{{e^{{{\bf{\beta }}^T}{\bf{z_i}}}}}}{{1 + {e^{{{\bf{\beta }}^T}{\bf{z_i}}}}}} \end{array}

Example

Step 1

Step 2

Step 3

Multinomial Logistic Regression:
MATLAB Example
Fitting Data with Logistic Regression:
MATLAB Example

PreviousLogistic Regression Math NextSVM

Last updated 3 years ago

Was this helpful?