Displaying 20 results from an estimated 4000 matches similar to: "Large Stata file Import in R"
2009 Jun 30
1
Stata file and R Interaction :File Size Problem in Import
Hi
I am using Stata 10 and I need to import a data set in stata 10 to R, I
have saved the dataset in lower versions of Stata as well by using saveold
command in Stata.
My RAM is 4gb and the stata file is 600MB, I am getting an error message
which says :
"Error: cannot allocate vector of size 3.4 Mb
In addition: There were 50 or more warnings (use warnings() to see the
first
2009 Jun 29
1
Stata file Import and Analysis in R
Hi
I have a stata data set (.dta file) of size 600 MB, I need to import it in
R and do a 2SLS multilevel analysis on the data set. I would eb grateful if
help is provided for the first part of how to import this big file from
Stata to R and then how to open the imported Stata file in R?
Kindly help
Thanks in advance
--
Dr.Saurav Pathak
PhD, Univ.of.Florida
Mechanical Engineering
Doctoral
2009 Jul 16
1
PROBIT REGRESSION FOR GROUPED/CLUSTERED DATA
Hello all
I have been working to fix this for weeks now, It should be simple to fix.
Please help
Let me explain what I am doing, I have a data set for 65 countries over a
period of 9 years (2000-2008). Each country has on an average say 2000
interviews, so that the total set has roughly 65*9*2000 data
points/observations (of course there are missing vales as well). Now let me
explain how are the
2009 Jul 11
2
Heckman Selection Model/Inverse Mills Ratio
I have so far used the following command
glm(formula = s ~ age + gender + gemedu + gemhinc + es_gdppc +
imf_pop + estbbo_m, family = binomial(link = "probit"))
My question is
1. How do i discard the non significant selection variables (one out of the
seven variables above is non-significant) and calculate the Inverse Mills
Ratio of the significant variables
2. I need the inverse
2009 Jul 01
2
getOptions("max.print") in R
I am typing the following on the command prompt:
>variab = read.csv(file.choose(), header=T)
>variab
It lists 900,000 ( this is the total number of observations in "variab" )
minus 797124 observations and prompts the following message
[ reached getOption("max.print") -- omitted 797124 entries ]]
Is there a way to see the entire set of data, ie all of 900,000 obs, and
2013 Oct 02
1
foreign package problem with stata 13
foreign package does not support dataset saved at stata 13.
anyone knows any wayt to make it works?
[[alternative HTML version deleted]]
2009 Oct 18
2
How to create MULTILEVELS in a dataset??
Dear R users
I have a data set which has five variables. One depenedent variable y, and 4
Independent variables (education-level, householdincome, countrygdp and
countrygdpsquare). The first two are data corresponding to the individual
and the next two coorespond to the country to which the individual belongs
to. My data set does not make this distinction between individual level and
country
2009 Jul 12
2
Heckman Selection MOdel Help in R
Hi Saurav!
On Sun, Jul 12, 2009 at 6:06 PM, Pathak,
Saurav<s.pathak08 at imperial.ac.uk> wrote:
> I am new to R, I have to do a 2 step Heckman model, my selection equation is
> below which I was successful in running but I am unable to proceed further,
>
>
>
> I have so far used the following command
>
> glm(formula = s ~ age + gender + gemedu + gemhinc + es_gdppc +
2003 Jan 14
4
density plot - beginner's question
Hi,
I am trying to plot densities given on a two dimensional grid. My
data is in the an external file, and is arranged in three columns:
x, y, density
how may i get a plot of this? i would like to get (1) a three
dimensional plot and (2) a color coded two dimensional plot.
I have tried using
image(x, y, density)
but i am asked to put the data in ascending order. i am not sure
how i may
2007 Jan 24
2
modify rectangle color from image
Hi,
I need some suggestion on how I could modify the color on some
rectangle that I have created using "image".
In other words, I have a 5x5 matrix, say, m.
m <- matrix(rnorm(25), nrow=5)
I create a grid of rectangles by:
image(m)
Now I want to change the color of rectangle (3,3) to blue.
I don't know how this could be done, and searching the web has
2005 Feb 28
2
3d scatterplots of more than 1 data set
hi,
i am need to plot two or more sets of data in a 3d scatterplot,
each set with different color.
i tried Rcmdr, and the 3d scatterplot facility, based on rgl. that
is what i need. but i cannot seem to code different sets of data
differently. any help will be very helpful.
i tried scatterplot3d, but it is difficult to get the right angle in
it. i need to be able to rotate the axes, and
2011 Oct 07
1
loess question
Hi All,
I am trying to use loess to smooth a 2D image, and also obtain the
standard error for every pixel. I see that the standard error does not
make sense. For example, running the following:
library(stats)
x <- array(c(1:100), dim=c(100,100))
y <- t(x)
v <- exp(-((x-50)^2+(y-50)^2)/30^2)
s <- v*0.02
g_noise <- rnorm(10000, mean = 0, sd = s)
f <- v + g_noise
f.loess <-
2012 May 03
1
cannot calculate standard estimate with predict on loess
Hi,
For some reason I have been unable to use the predict function when I
desire the standard error to be calculated too. For example, when I try
the following:
l<- loess(d~x+y, span=span, se=TRUE)
p<- predict(l, se=TRUE)
I get the following error message:
Error in vector("double", length) : vector size cannot be NA
In addition: Warning message:
In N * M1 : NAs produced by
2003 Feb 13
2
multi-color plot
hi all,
i am trying to make multi-color plots. that is, i generally use,
plot(x, y, type="n")
text(x, y, labels=class)
here, the vector class denotes the class of each point. there are
usually 3-4 classes of points. how may i display the different
classes in different colors?
thanks for any help.
--
saurav
2002 Dec 19
1
newbie question on dist
hi,
i have just begun using R, so please bear with me.
i am trying to use cmdscale and display the result. i read the data
using read.table(), calculate the proximity matrix using dist() and
the display the result using the cmdscale(). this is very fine.
in addition, i want the display to distinguish between two classes
of records in my data. i have my data records marked as "1" or
2014 Jan 23
2
Stata support in package foreign
As you know, Stata support in 'foreign' was frozen a while back at Stata
version 12. R-core has received a request from a grad student to 'give
top priority' to supporting Stata 13. That is not going to happen, not
least because none of us has access to Stata.
However, according to Stata's documentation both Stata 12 and 13 use
format 115, so foreign::read.dta should be
2009 Dec 16
1
Stata files
I have a client who uses Stata 11.
Can anyone advise me on ways of transferring data from this version of
Stata to R?
Reading the documentation for read.dta in package foreign it seems it
only allows for versions up to Stata 10. I don't know Stata at all but a
bit of poking around on their website led me to believe Stata 11
couldn't write in Stata 10 format. However Stata 11 can
2006 Jan 16
2
importing from Stata
Hi,
I have a new job, and everyone here uses Stata. I won't give up on R,
but I must learn better how to exchange data between the two softwares.
I am now focusing on importing data from Stata to R, and I must confess
that I am a bit disappointed with the read.dta function from the foreign
package because IT typically happens that
(i) I get a big R file (for example, a 15Mb Stata file
2003 Oct 06
1
Re: Use of the Foreign package to import Stata files
Long ago (Sat, 2 Nov 2002), Bill Hart <w.hart at sbcglobal.net> wrote:
> An R newbie here. I am using R 1.6 currently and have
> (successfully, I think) installed the Foreign package.
> Tried to import a data file created with Stata 7.0
> SE. Had minor problems with syntax then R decided
> that my file was not really a Stata file. It rejected
> the file saying
2013 Oct 15
1
cluster option in stata for random intercept model in the R language?
Dear R-list,
I am currently working on a dataset with a colleague who uses stata.
We fit a random intercept model to the data (decisions clustered in participants) and get closely the same results in stata (using xtreg re) and R (using the lme4 or multilevel package).
Now in stata, there is an additional option for the regression to control for clustering; the vce(cluster clustvar) option,