Displaying 20 results from an estimated 10000 matches similar to: "Newbie"
2008 Feb 25
1
Running randomForests on large datasets
Hi,
I am trying to run randomForests on a datasets of size 500000X650 and
R pops up memory allocation error. Are there any better ways to deal
with large datasets in R, for example, Splus had something like
bigData library.
Thank you,
Nagu
2003 Aug 15
6
plot.lm mislabels points with na.exclude (PR#3750)
R 1.7.1 on Windows XP
The "normal Q-Q plot" produced by plot.lm() mislabels points
when the model is fitted using na.action=na.exclude. Example:
x <- 1:50
y <- x + rnorm(50)
y[c(5,10,15)] <- NA # insert some NA's
y[40] <- 50 # add an outlier
plot(lm(y ~ x, na.action=na.omit)) # outlier correctly labeled in all
# four plots
2004 Jan 15
1
Winbind + Acl problem
Hello everybody
When I try to connect (net use) a samba share (Acl protected) from a windows
XP workstation I obtain the following message in the
"/var/log/samba/workstation_name.log" :
. . .
[2004/01/15 11:12:44, 0] smbd/sec_ctx.c:initialise_groups(244)
Unable to initgroups. Error was Operation not permitted
. . .
The network connection to this share run sucessfully but when I
2004 Jan 30
8
MeetMe Video option
Hello All:
Has anyone configured a meetme conference to use video?
I have successfully used video phones to talk through *, but I cannot seem
to get video when those phones dial into a meetme conference.
Is there something else that I need to be doing other than set the "v" flag
on my extension for the meetme app?
Thanks,
Tim
2006 Mar 08
8
how to use the randomForest and rpart function?
Hi all,
I am trying to play around with the randomForest function for
classification. I know its performance is great.
I am currently using the default options.
It has many options.
How do I further tweak the options so that I can make its performance even
better?
What are the options that are mostly used?
Thanks a lot!
M
[[alternative HTML version deleted]]
2006 Mar 16
4
problem for wtd.quantile()
Dear R-users,
I don't know if there is a problem in wtd.quantile (from library "Hmisc"):
--------------------------------
x <- c(1,2,3,4,5)
w <- c(0.5,0.4,0.3,0.2,0.1)
wtd.quantile(x,weights=w)
-------------------------------
The output is:
0% 25% 50% 75% 100%
3.00 3.25 3.50 3.75 4.00
The version of R I am using is: 2.1.0
Best,Jing
2004 Jul 06
3
Code density functions
Hello
I would like to see the algorithm that R uses to generate density functions
for several distributions (i.e. Normal,Weibull, etc). I tried:
>dnorm
function (x, mean = 0, sd = 1, log = FALSE)
.Internal(dnorm(x, mean, sd, log))
<environment: namespace:stats>
How can I see the code used for densities?
Thanks!
2008 Jun 18
2
randomForest outlier
I try to use ?randomForest to find variables that are the most important to
divide my dataset (continuous, categorical variables) in two given groups.
But when I plot the outliers:
plot(outlier(FemMalSex_NAavoid88.rf33, cls=FemMalSex_NAavoid88$Sex),
type="h",col=c("red","green")[as.numeric(FemMalSex_NAavoid88$Sex)])
it seems to me that all my values appear as
2004 Jul 08
2
Getting elements of a matrix by a vector of column indice s
See if the following helps:
> m <- outer(letters[1:5], 1:4, paste, sep="")
> m
[,1] [,2] [,3] [,4]
[1,] "a1" "a2" "a3" "a4"
[2,] "b1" "b2" "b3" "b4"
[3,] "c1" "c2" "c3" "c4"
[4,] "d1" "d2" "d3" "d4"
[5,]
2005 Jun 16
3
Potential minor GUI bug
Is this an interface bug? Using RGUI for windows I run into a "Not
Responding" process (I "smartly" coded an infinite loop, yaiks!), I hit esc
and the interpreter was stopped and I recovered the console functionality
but the caption on the R icon in my windows taskbar (the individual icon
shown for every software currently running in the session) was not updated
so the
2009 Apr 21
1
Polyspline Integration
Hello,
I wrote the function below to integrate polysplines and thought that it
may be useful to others. Please consider this code released under the
GPL2 or later.
Thanks,
Bill
<<integrate.polySpline.R>>
Notice: This e-mail message, together with any attachments, contains
information of Merck & Co., Inc. (One Merck Drive, Whitehouse Station,
New Jersey, USA 08889), and/or
2004 Jul 13
2
e1071 question: what's the definition of performance in t une.* functions?
Basically, the `Detail' section of ?tune says it all:
Details:
As performance measure, the classification error is used for
classification, and the mean squared error for regression. ...
Andy
> From: Tae-Hoon Chung
>
> Hi, all;
>
> Basically, the subject contains the all information I need to know.
> In e1071 library, there are functions to tune parameters.
2004 Jul 26
5
installing problems repeated.tgz linux
Hi,
i try several possibilities adn looking in the archive,
but didn't getting success to install j.lindsey's usefuel "library
repeated" on my linux (suse9.0 with kernel 2.6.7,R.1.9.1)
P.S. Windows, works fine
Many thanks for help
Christian
chris at linux:/space/downs> R CMD INSTALL - l /usr/lib/R/library repeated
WARNING: invalid package '-'
WARNING:
2004 Feb 22
6
help for MLE
Dear Sir/Madam,
I am using R version 1.8.1. I am doing following tast:
First generate 100 Gaussion(3,1) numbers, then write the likelihood function
to estimate the parameters of Gaussian distribution by direct maximizing the
likelihood function.
My likelihood function is:
>fn<-function(x)
>(-50*log((sd(x))^2))-50*log(sqrt(2*pi))-(1/2*((mean(x))^2))*(sum((x-(mean(x))^2))
After I
2003 Aug 20
2
RandomForest
Hello,
When I plot or look at the error rate vector for a random forest
(rf$err.rate) it looks like a descending function except for a few first
points of the vector with error rates values lower(sometimes much lower)
than the general level of error rates for a forest with such number of trees
when the error rates stop descending. Does it mean that there is a tree(s)
(that is built the first in
2009 May 07
4
proposed changes to RSiteSearch
Can someone in R Core please take a look at the attached patches to
RSiteSearch() and its help page? I guess Jon is planning some changes
on his site. Jon: could you elaborate on what the patch does?
Best,
Andy
Notice: This e-mail message, together with any attachments, contains
information of Merck & Co., Inc. (One Merck Drive, Whitehouse Station,
New Jersey, USA 08889), and/or its
2006 Feb 28
3
does svm have a CV to obtain the best "cost" parameter?
Hi all,
I am using the "svm" command in the e1071 package.
Does it have an automatic way of setting the "cost" parameter?
I changed a few values for the "cost" parameter but I hope there is a
systematic way of obtaining the best "cost" value.
I noticed that there is a "cross" (Cross validation) parameter in the "svm"
function.
But I
2004 Jul 10
6
where does R search when source()?
Exactly where does R search for foo.R if I type source("foo.R")?
Only from current working directory (same as getwd()), from all directories
specified by e.g. $PATH? Thanks.
Daehyok Shin
2004 Jul 10
6
where does R search when source()?
Exactly where does R search for foo.R if I type source("foo.R")?
Only from current working directory (same as getwd()), from all directories
specified by e.g. $PATH? Thanks.
Daehyok Shin
2005 Jun 14
3
Calling C from Fortran
I would like to call C routines from Fortran under linux as suggested in
section 5.6 of
the "Writing R extensions" documentation.
I'm familiar with Fortran but not with C.
I understand the example provided in Fortran:
subroutine testit()
double precision normrnd, x
call rndstart()
x = normrnd()
call dblepr("X was", 5, x, 1)
call rndend()
end
but I don't understand