thr3ads.net - R help - [R] Stochastic Gradient Ascent for logistic regression [Apr 2009]

If this information is useful, please help other people find it:
Share via:

Tim LIU

2009-Apr-26 05:27 UTC

[R] Stochastic Gradient Ascent for logistic regression

Hi. guys,

I am trying to write my own Stochastic Gradient Ascent for logistic 
regression in R. But it seems that I am having convergence problem.

Am I doing anything wrong, or just the data is off?

Here is my code in R -



lbw <-
read.table("http://www.biostat.jhsph.edu/~ririzarr/Teaching/754/lbw.dat"
, header=TRUE)

attach(lbw)



lbw[1:2,]
low age lwt race smoke ptl ht ui ftv bwt
1 0 19 182 2 0 0 0 1 0 2523
2 0 33 155 3 0 0 0 0 3 2551




#-----R implementation of logistic regression : gradient descent ------
sigmoid<-function(z)
{
1/(1 + exp(-1*z))

}




X<-cbind(age,lwt, smoke, ht, ui)

#y<-low


my_logistic<-function(X,y)
{

alpha <- 0.005
n<-5 
m<-189
max_iters <- 189 #number of obs

ll<-0

X<-cbind(1,X)

theta <-rep(0,6) # intercept and 5 regerssors
#theta <- c(1.39, -0.034, -0.01, 0.64, 1.89, 0.88) #glm estimates as 
starting values
theta_all<-theta
for (i in 1:max_iters) 
{ 
dim(X)
length(theta)
hx <- sigmoid(X %*% theta) # matrix 
product

ix<-i

for (j in 1:6)
{
theta[j] <- theta[j] + alpha * ((y-hx)[ix]) * X[ix,j] 
#stochastic gradient !

}





logl <- sum( y * log(hx) + (1 - y) * log(1 - hx) ) #direct 
multiplication

ll<-rbind(ll, logl)


theta_all = cbind(theta_all,theta)
}

par(mfrow=c(4,2))


plot(na.omit(ll[,1]))
lines(ll[,1])

for (j in 1:6)
{

plot(theta_all[j,])
lines(theta_all[j,])
} 


#theta_all
#ll
cbind(ll,t(theta_all))
}


my_logistic(X,low)
=============

parameter estimates values jumped after 130+ iterations...

not converging even when I use parameter estimates as starting values 
from glm (family=binomial)


help!




-- 
View this message in context:
http://www.nabble.com/Stochastic-Gradient-Ascent-for-logistic-regression-tp23239378p23239378.html
Sent from the R help mailing list archive at Nabble.com.

Ravi Varadhan

2009-Apr-26 15:47 UTC

head link

[R] Stochastic Gradient Ascent for logistic regression

Hi Tim,

There are two main problems with your implementation of Stochastic gradient
algorithm:

1.  You are only implementing one cycle of the algorithm, i.e. it cycles over
each data point only once.  You need to do this several time, until convergence
of parameters is obtained.
2.  Stochastic gradient algorithm has very slow convergence.  It can be really
slow if the predictors are not scaled properly.

I am attaching a code that takes care of (1) and (2).  It gives results that are
in good agreement with glm() results.  Beware that it is still very slow.

This seems like your homework assignment.  If so, you should acknowledge that
you got help from the R group.

Ravi.
____________________________________________________________________

Ravi Varadhan, Ph.D.
Assistant Professor,
Division of Geriatric Medicine and Gerontology
School of Medicine
Johns Hopkins University

Ph. (410) 502-2619
email: rvaradhan at jhmi.edu


----- Original Message -----
From: Tim LIU <timothy.sliu10 at gmail.com>
Date: Sunday, April 26, 2009 7:41 am
Subject: [R]  Stochastic Gradient Ascent for logistic regression
To: r-help at r-project.org

>  
>  
>  Hi. guys,
>  
>  I am trying to write my own Stochastic Gradient Ascent for logistic 
>  regression in R. But it seems that I am having convergence problem.
>  
>  Am I doing anything wrong, or just the data is off?
>  
>  Here is my code in R -
>  
>  
>  
>  lbw <-
>  read.table(""
>  , header=TRUE)
>  
>  attach(lbw)
>  
>  
>  
>  lbw[1:2,]
>  low age lwt race smoke ptl ht ui ftv bwt
>  1 0 19 182 2 0 0 0 1 0 2523
>  2 0 33 155 3 0 0 0 0 3 2551
>  
>  
>  
>  
>  #-----R implementation of logistic regression : gradient descent ------
>  sigmoid<-function(z)
>  {
>  1/(1 + exp(-1*z))
>  
>  }
>  
>  
>  
>  
>  X<-cbind(age,lwt, smoke, ht, ui)
>  
>  #y<-low
>  
>  
>  my_logistic<-function(X,y)
>  {
>  
>  alpha <- 0.005
>  n<-5 
>  m<-189
>  max_iters <- 189 #number of obs
>  
>  ll<-0
>  
>  X<-cbind(1,X)
>  
>  theta <-rep(0,6) # intercept and 5 regerssors
>  #theta <- c(1.39, -0.034, -0.01, 0.64, 1.89, 0.88) #glm estimates as 
> 
>  starting values
>  theta_all<-theta
>  for (i in 1:max_iters) 
>  { 
>  dim(X)
>  length(theta)
>  hx <- sigmoid(X %*% theta) # matrix 
>  product
>  
>  ix<-i
>  
>  for (j in 1:6)
>  {
>  theta[j] <- theta[j] + alpha * ((y-hx)[ix]) * X[ix,j] 
>  #stochastic gradient !
>  
>  }
>  
>  
>  
>  
>  
>  logl <- sum( y * log(hx) + (1 - y) * log(1 - hx) ) #direct 
>  multiplication
>  
>  ll<-rbind(ll, logl)
>  
>  
>  theta_all = cbind(theta_all,theta)
>  }
>  
>  par(mfrow=c(4,2))
>  
>  
>  plot(na.omit(ll[,1]))
>  lines(ll[,1])
>  
>  for (j in 1:6)
>  {
>  
>  plot(theta_all[j,])
>  lines(theta_all[j,])
>  } 
>  
>  
>  #theta_all
>  #ll
>  cbind(ll,t(theta_all))
>  }
>  
>  
>  my_logistic(X,low)
>  =============>  
>  
>  parameter estimates values jumped after 130+ iterations...
>  
>  not converging even when I use parameter estimates as starting values 
> 
>  from glm (family=binomial)
>  
>  
>  help!
>  
>  
>  
>  
>  -- 
>  View this message in context: 
>  Sent from the R help mailing list archive at Nabble.com.
>  
>  ______________________________________________
>  R-help at r-project.org mailing list
>  
>  PLEASE do read the posting guide 
>  and provide commented, minimal, self-contained, reproducible code. -------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: stochGradLogistReg.txt
URL:
<https://stat.ethz.ch/pipermail/r-help/attachments/20090426/2ef49cb3/attachment-0002.txt>

Seemingly Similar Threads

Search for more apparently analagous threads

R help - Apr 2009 - Stochastic Gradient Ascent for logistic regression

[R] Stochastic Gradient Ascent for logistic regression

[R] Stochastic Gradient Ascent for logistic regression

Seemingly Similar Threads