Displaying 20 results from an estimated 3000 matches similar to: "How to handle large dataframes?"
2006 Jan 20
2
assign object to list
Dear all
I want to generate a list like this:
a <- data.frame(1:10)
attr(a,'myattribute') <- 'something'
b <- data.frame(11:20)
attr(b,'myattribute') <- 'anything'
mylist <- list(a,b)
Is there a way to place the dataframes into the list giving them the
attribute at the same time?
I don't want to create all the dataframes in my workspace first.
2005 Nov 29
1
Indexing variables within lapply?
Hello
I am using R 2.2.0 with Windows XP.
I've got a five element list object, each element containing two
dataframes of equivalent size.
> str(mylist)
List of 1
$ data1:List of 2
..$ data1a :`data.frame': 77 obs. of 63 variables:
.. ..$ var1 : num [1:77] 0.41375 0.00056 1.43040 1.43528 0.61730 ...
.. ..$ var2 : num [1:77] 1.154 1.686 0.673 0.800 0.760 ...
..
2005 Jul 22
3
problems with submitting an eps-file created in R
Dear all
I've got some problems submitting a manuscript, because I can't manage
creating the favourable eps-file of a graph created in R. The journal's
graphic requirements are as followed:
format: eps
width: max. 6 inches
resolution: min. 1000 dpi
supported fonts: Arial, Courier, Helvetica, Symbol, Times, Charcoal,
Chicago, Geneva, Georgia, Monaco, Zapf, New York
Itried to ways
2009 Aug 26
2
Statistical question about logistic regression simulation
Hi R help list
I'm simulating logistic regression data with a specified odds ratio
(beta) and have a problem/unexpected behaviour that occurs.
The datasets includes a lognormal exposure and diseased and healthy
subjects.
Here is my loop:
ors <- vector()
for(i in 1:200){
# First, I create a vector with a lognormally distributed exposure:
n <- 10000 # number of study subjects
2010 Feb 24
1
Remove missing observations
Hi everyone
I have the following problem: My dataframe has 3 variables: ID, Year and
and an outcome variable. The dataframe contains repeated measurements
because the subjects filled out a questionnaire every year. The time span
covers 2 years.
Now I want to check if there is a significant change in the outcome over
the 2 years with a paired wilcox.test. The problem: Not every subject
2006 Jun 14
4
write data from function into external table
Dear list,
My apologies if a solution / explanation to this already exists on the list,
but it is difficult to assign it to a certain keyword.
test<-c(1:3)
testfct <- function(x) {test[1]<-100}
test
[1] 1 2 3
testfct(1)
[1] 1 2 3
Basically, I would like to write data into an external table that the function
does not know. Why is this not working / what alternatives exist?
Thanks,
2008 Jan 07
4
is there something like or() ?
hi, this may be trivial, but we can't seem to find anything adequate,
(although there is a work around with match() ). We are looking for something
along the lines of
plot(table1[table1$var2==or("a","b","c","d"),"var1"])
would be handy, with the potential or() function leading to what
plot(table1[table1$var2=="a" |
2008 Jun 18
4
Editor for Mac OSX
Dear R-list
I am (forced) to change from Linux to Mac and am now looking for a new editor for R. I would like one that features a split window (console + editor) as well as syntax highlighting. Can anyone help? Especially the split-window feature does not seem to be easily available in the editors desribed on the R-help site, except Emacs, which I am reluctant to start using. I am on a MacBook Air
2004 Oct 20
2
common axis label in multiple plot area
Hello
A very short question: Using multiple plots with par(mfrow=c(3,3)), how
can I get R to indicate one common y- and x-axis label? I tried to use
text() in par, but this is then overwritten when I plot the graphs.
--
Sebastian Leuzinger
Institute of Botany, University of Basel
Schönbeinstr. 6 CH-4056 Basel
Ph. 0041 (0) 61 267 3511
fax 0041 (0) 61 2673504
email: Sebastian.Leuzinger@unibas.ch
2011 Mar 04
2
overleap an iteration within a for-loop when error message produced
Dear R-list member,
I'm using the function pmnorm() (-->library(mnormt)) within a for-loop.
Certain parameter values leads to an error message:
"(In sqrt(diag(S)) : NaNs produced, In sqrt(1/diag(V)) : NaNs
produced, In cov2cor(S) : diag(.) had 0 or NA entries; non-finite result
is doubtful)"
obviously because "NaNs" were produced.
Is it possible to tell R that it
2005 Jul 28
1
stl()
Hello, anyone got an idea on how to use stl() so that the remainder eventually
becomes white noise? i used stl repeatedly but there is autocorrelation in
the remainder that i can't get rid of.
os: linux suse9.3
------------------------------------------------
Sebastian Leuzinger
Institute of Botany, University of Basel
Sch??nbeinstr. 6 CH-4056 Basel
ph 0041 (0) 61 2673511
fax 0041 (0)
2005 Jan 06
1
pdbedit syntax for forcing user to change password
Hi all
Maybe a stupid question, but I'm not able to figure this out from the
manpage nor from the HOWTOs...
How can I force a user to change his password at next logon?
I tried:
# pdbedit -P "user must logon to change password" -C 0 -u username
# pdbedit -u username -P "user must logon to change password" -C 0
# pdbedit -P "user must logon to change password"
2005 Dec 13
2
interruption when pasting code into R under linux
hello,
has anyone come across the following rather mysterious problem:
when pasting large bits of code (100 and more lines) into the R console with
the central mouse button (under linux), only part of the code is pasted, and
the text interrupts somewhere arbitrarily. It does not happen when smaller
bits are pasted subsequently.
I use linux suse 9.3 with the latest version of R
2006 Jun 30
2
Passing arguments to glm()
Hi there
I want to pass arguments (i.e. the response variable and the subset
argument) in a self-made function to glm.
Here is one way I can do this:
f.myglm <- function(y,subfact,subval) {
glm(d.mydata[,y]~d.mydata[,'x1'],family=binomial,subset=d.mydata[,subfact]==subval)
}
> str(d.mydata)
`data.frame': 15806 obs. of 3 variables:
$ y : Factor w/ 2 levels
2007 Jun 22
2
extract index during execution of sapply
Hi there
During execution of sapply I want to extract the number of times the
function given to supply has been executed. I came up with:
mylist <- list(a=3,b=6,c=9)
sapply(mylist,function(x)as.numeric(gsub("[^0-9]","",deparse(substitute(x)))))
This works fine, but looks quite ugly. I'm sure that there's a more
elegant way to do this.
Any suggestion?
Christian
2005 Apr 27
3
libz library missing while installing RMySQL
Hello
Trying to install the MySQL package, I get the following error. The help
archive contains something on this issue but did not help. I work on linux
suse 9.3
Configuration error:
Could not locate the library "libz" required by MySQL.
The library libz however is not on any mirrors I checked.
------------------------------------------------
Sebastian Leuzinger
web
2003 Aug 28
6
Why are the binaries so huge?
Hi list
I just compiled 3.0rc1 on Solaris 9 (UltraSparc IIe) with gcc 3.3. Seems
to work fine but I am surprised by the enormous size of the binaries:
(ub-blade1):/usr/local/samba/sbin # ll
total 232272
-rwxr-xr-x 1 root other 44719632 Aug 28 15:07 nmbd
-rwxr-xr-x 1 root other 89726544 Aug 28 15:07 smbd
-rwxr-xr-x 1 root other 44807356 Aug 28 15:07 swat
-rwxr-xr-x
2005 Mar 23
4
Samba-LDAP TLS problems with inofficial Debian OpenLDAP 2.2 packages
Dear Torsten, dear samba list reader
Three days ago I switched our domain from a NT 4 domaincontroller to
Samba-OpenLDAP, controlled by a Debian Sarge system. I installed the
following inofficial Debian OpenLDAP 2.2 packages (I know these are not
supported, but TLS with OpenSSL is essential to us...):
Package: slapd
Version: 2.2.20-1.hrz.1
Package: libldap2.2
Version: 2.2.20-1.hrz.1
2008 Nov 18
1
Tukey HSD following lme
Hi everyone
I'm using Tukey HSD as post-hoc test following a lme analysis. I'm
measuring hemicelluloses in different species treated with three
different CO2 concentrations (l=low, m=medium, h=high). The whole
experiment is a split-plot design and the Tukey-function from the
package multcomp is suitable for lme-analysis with random factors.
The analysis works fine but I get a non
2005 Aug 15
1
error in predict glm (new levels cause problems)
Dear R-helpers,
I try to perform glm's with negative binomial distributed data.
So I use the MASS library and the commands:
model_1 = glm.nb(response ~ y1 + y2 + ...+ yi, data = data.frame)
and
predict(model_1, newdata = data.frame)
So far, I think everything should be ok.
But when I want to perform a glm with a subset of the data,
I run into an error message as soon as I want to predict