thr3ads.net - similar to: "importing timestamp data into R"

Displaying 20 results from an estimated 5000 matches similar to: "importing timestamp data into R"

2007 Jan 04

randomForest and missing data

Does anyone know a reason why, in principle, a call to randomForest cannot accept a data frame with missing predictor values? If each individual tree is built using CART, then it seems like this should be possible. (I understand that one may impute missing values using rfImpute or some other method, but I would like to avoid doing that.) If this functionality were available, then when the trees

rfImpute (for randomForest) crashed

2003 Aug 26

rfImpute (for randomForest) crashed

In trying to execute this line in R (Version 1.7.1 (2003-06-16), under windows XP pro), with the randomForest library (about two weeks old) loaded, the program crashed: bost4rf <- rfImpute(TargetDensity~.,data=bost4rf0) Specifically, an XP dialog box popped up, saying ?R for windows GUI front-end has encountered a problem and needs to close.? That was the dialog saying asking whether I

rfImpute

2007 Aug 10

rfImpute

I am having trouble with the rfImpute function in the randomForest package. Here is a sample... clunk.roughfix<-na.roughfix(clunk) > > clunk.impute<-rfImpute(CONVERT~.,data=clunk) ntree OOB 1 2 300: 26.80% 3.83% 85.37% ntree OOB 1 2 300: 18.56% 5.74% 51.22% Error in randomForest.default(xf, y, ntree = ntree, ..., do.trace = ntree, : NA not

Error while using rfImpute

2009 May 08

Error while using rfImpute

Dear Administrator, I am using linux (suse 10.2). While attempting rfImpute, I am getting the following error message: > Members <- rfImpute(Status ~ ., data = Members) Error in .C("classRF", x = x, xdim = as.integer(c(p, n)), y = as.integer(y), : C symbol name "classRF" not in DLL for package "randomForest". I need the help to sort out above error.

anyone know why package "RandomForest" na.roughfix is so slow??

2010 Jun 30

anyone know why package "RandomForest" na.roughfix is so slow??

Hi all, I am using the package "random forest" for random forest predictions. I like the package. However, I have fairly large data sets, and it can often take *hours* just to go through the "na.roughfix" call, which simply goes through and cleans up any NA values to either the median (numerical data) or the most frequent occurrence (factors). I am going to start

Problems using rfImpute

2008 May 05

Problems using rfImpute

Hello R-user! I am running R 2.7.0 on a Power Book (Tiger). (I am still R and statistics beginner) I tried rfImpute (randomForest) and as far as I understood should it replace NA`s using a proximity matrix: > set.seed(100000) > Subset5Imputed<-rfImpute(Sex~., data=Subset5) ntree OOB 1 2 300: 11.78% 12.36% 11.21% ntree OOB 1 2 300: 12.07% 12.64%

missing value replacement for test data in random forest

2006 Mar 29

missing value replacement for test data in random forest

Hi, In R, how to do missing value replacement for test data in randome forest in the way Breiman decribed. thanks in advance iris

installing problems repeated.tgz linux

2004 Jul 26

installing problems repeated.tgz linux

Hi, i try several possibilities adn looking in the archive, but didn't getting success to install j.lindsey's usefuel "library repeated" on my linux (suse9.0 with kernel 2.6.7,R.1.9.1) P.S. Windows, works fine Many thanks for help Christian chris at linux:/space/downs> R CMD INSTALL - l /usr/lib/R/library repeated WARNING: invalid package '-' WARNING:

RandomForest and Missing Values

2013 Jan 28

RandomForest and Missing Values

Dear All, I would like to use a randomForest algorithm on a dataset. The set is not particularly large/difficult to handle, but it has some missing values (both factors and numerical values). According to what I found https://stat.ethz.ch/pipermail/r-help/2005-September/078880.html https://stat.ethz.ch/pipermail/r-help/2007-January/123117.html the randomForest package has a problem with missing

Adding two files into one and vlookup

2010 Jul 06

Adding two files into one and vlookup

I have two files with dates and prices in each. The number of rows in each of them will differ. How do I create a new file which contains data from both these files? Cbind and merge are not helpful. For cbind because the rows are not the same replication occurs. Also if I have similar data how do I write a vlookup kind of function? I am giving an example below: Say Price1 file contains the

NA in R package randomForest

2012 Mar 26

NA in R package randomForest

I have a question regarding NA in randomForest (in R). I have a dataset which include both numerical and non-numerical variables, and the data includes some NA. I tried to use na.roughfix but then i get an error message "na.roughfix only works for numeric or factor". I also tried rfImpute but this does not work either because I have some NA in my response variable. Does anyone have som

na.action in randomForest --- Summary

2003 Aug 05

na.action in randomForest --- Summary

A few days ago I asked whether there were options other than na.action=na.fail for the R port of Breiman?s randomForest; the function?s help page did not say anything about other options. I have since discovered that a pdf document called ?The randomForest Package? and made available by Andy Liaw (who made the tool available in R---thank you) does discuss an option. It is an implementation of

Imputing data

2011 Dec 02

Imputing data

So I have a very big matrix of about 900 by 400 and there are a couple of NA in the list. I have used the following functions to impute the missing data data(pc) pc.na<-pc pc.roughfix <- na.roughfix(pc.na) pc.narf <- randomForest(pc.na, na.action=na.roughfix) yet it does not replace the NA in the list. Presently I want to replace the NA with maybe the mean of the rows or columns or

synchronisation of time series data using interpolation

2009 Sep 25

synchronisation of time series data using interpolation

Readers, I have data with different time stamps that I wish to plot (for example): data set 1 time(hh:mm:ss),datum 01:00:00,500 01:00:15,600 01:00:30,750 01:00:45,720 01:01:00,700 01:01:15,725 01:01:30,640 01:01:45,710 data set 2 time,datum 01:00:12,20 01:01:01,55 01:01:55,22 The time interval in data set 1 does not change, but the time interval in data set 2 does change, such that for a

help with the usage of "randomForest"

2004 Mar 31

help with the usage of "randomForest"

Dear all, Can anybody give me some hint on the following error msg I got with using randomForest? I have two-class classification problem. The data file "sample" is: ---------------------------------------------------------- udomain.edu udomain.hcs hpclass 1 1.0000 1 not 2 NA 2 not 3 NA 0.8 not 4 NA 0.2 hp 5 NA 0.9 hp ------------------------------------------------------------ The

time series problem: time points don't match

2010 Apr 01

time series problem: time points don't match

Hi, I have a time series problem that I would like some help with if you have the time. I have many data from many sites that look like this: Site.1 date time level temp 2009/10/01 00:01:52.0 2.8797 18.401 2009/10/01 00:16:52.0 2.8769 18.382 2009/10/01 00:31:52.0 2.8708 18.309 2009/10/01 00:46:52.0 2.8728 18.285 2009/10/01 01:01:52.0

plotting multiple animal tracks against Date/Time

2010 Sep 23

plotting multiple animal tracks against Date/Time

Dear list, I would like to create a time series plot in which the paths of several individuals are stacked above each other, with the x-axis being the total observation period of three years ( 1.1.2004 to 31.12.2007) and the y-axis being some defined range[min,max]. My data consist of Date/Time information and the paths of 45 individual as the distance from the location of release. An example

data formatting

2013 Feb 15

data formatting

Dear Eliza, Try this: Lines1<-readLines(textConnection("1911.01.01?????? 7.87 1911.01.02?????? 9.26 1911.01.03?????? 8.06 1911.01.04?????? 8.13 1911.01.05????? 12.90 1911.02.06?????? 5.45 1911.02.07?????? 3.26 1911.03.08?????? 5.70 1911.03.09?????? 9.24 1911.04.10?????? 7.60 1911.05.11????? 14.82 1911.05.12????? 14.10 1911.06.13?????? 7.87 1911.06.14?????? 9.26

comparing random forests and classification trees

2007 Jan 29

comparing random forests and classification trees

Hi, I have done an analysis using 'rpart' to construct a Classification Tree. I am wanting to retain the output in tree form so that it is easily interpretable. However, I am wanting to compare the 'accuracy' of the tree to a Random Forest to estimate how much predictive ability is lost by using one simple tree. My understanding is that the error automatically displayed by the two

[handling] Missing [values in randomForest]

2005 Sep 12

[handling] Missing [values in randomForest]

Hi Jan-Paul, You definitely want to be careful with na.omit in randomForest -- that wipes out any row with even one NA. If NAs are sprawled throughout your dataset, na.omit might end up killing a lot of rows. Here's my usual MO for missing values: 1) "impute" in Hmisc fills in gaps with the mean, median, most common value, etc. 2) rfImpute: fits a forest on the rows available and

similar to: importing timestamp data into R