similar to: Discretize continous variables....

Displaying 20 results from an estimated 1000 matches similar to: "Discretize continous variables...."

2002 May 07
2
Discretization of numeric attributes
Dear R-helpers: I am interested in discretization methods for numerical attributes, as they are reported in the 'machine learning' community. For example, the work of Fayyad & Irani (IJCAI-93), Kononenko, entropy-based approaches, MDL principle, the C4.5 approach, etc. I am especially interested in those methods that take a factor as goal target into account for discretizing
2010 May 15
3
Discretize factors?
Hi, I'm looking for an easy way to discretize factors in R I've noticed that the lm function does this automatically with a nice result. If I have group <- c("A", "B","B","C","C","C") and run: lm(result ~ x1 + group) The lm function has split the group into separate binary variables {0,1} before performing the
2005 Jan 17
1
discretization
Hi, there: I have a variable whose distribution is far from normal and its qqnorm is S-shape, like a logisitic plot. My purpose is to discretize it into 2 or 3 classes. (basically, a transformation from quantative to discrete). I am wondering if there is a good way to do that. thanks, Ed
2008 Jan 14
2
Loading only particular columns from csv file...
Dear all, I'm trying to process HUGE datasets with R. It's very fast, but I would like to optimize it a bit more, by focusing one one column at time..... say file is 1GB big and has 100 columns..... In order to prevent "out of memory" problems.... I need to load one column at the time.... the only problem is that read.table doesn't support this feature.... Is there some
2006 Jul 12
0
Discretize data.frame
Dear useRs, I use dics.ef function from dprep package to discretize continuous variable using intervals of equal frequencies. Dataset to be discretized include 4 continuous and 2 discrete variables in the following order: Continuous Countinuous Countinuous Discrete Discrete Continuous The problem emerge when I try to discretize the last continuos variable: library(dprep)
2008 Jan 02
2
Subsetting data frame problem....
Dear R users, I'm new but already fascinated R user so please forgive for my ignorance. I have the problem, I read most of help pages but couldn't find the solution. The problem follows.... I have large data set 10,000 rows and more than 100 columns... Say something like var1,var2,var2,var4.......var120 ------------------------------------------- 12,12,345,657,67,8.....
2008 May 13
2
Max consecutive increase in sequence
Hi all R helpers, I'm trying to comeup with nice and elegant way of "detecting" consecutive increases/decreases in the sequence of numbers. I'm trying with combination of which() and diff() functions but unsuccesifuly. For example: sq <- c(1, 2, 3, 4, 4, 4, 5, 6, 5, 4, 3, 2, 1, 1, 1, 1, 1); I'd like to find way to calculate a) maximum consecutive increase = 3 (from 1
2009 Apr 27
1
MDL - Fayyad & Irani Discretization
Hey, I'm looking for a function which provides a supervised Fayyad & Irani discretization (MDL). I've already found RWeka which has such a function. But is there anything else? I'd prefer to use sth lighter in my own package. Regards, Peter
2007 Apr 23
0
New version of actuar
UseRs, actuar is a package for Actuarial Science. A rather preliminary version (0.1-3) of the package has been available on CRAN since February 2006. We now announce the immediate availability of version 0.9-2 sporting a large number of new features. Non actuaries behold! There can be some features of interest for you, especially those related to new probability distribution and to the
2007 Apr 23
0
New version of actuar
UseRs, actuar is a package for Actuarial Science. A rather preliminary version (0.1-3) of the package has been available on CRAN since February 2006. We now announce the immediate availability of version 0.9-2 sporting a large number of new features. Non actuaries behold! There can be some features of interest for you, especially those related to new probability distribution and to the
2006 Sep 08
8
Weighted association map
Could somebody program this kind of plot type to R, if none exists, based on mds or correlation tables or some more suitable method? What do you think about idea? Does it work? None similar or better exists? http://weightedassociationmap.blogspot.com/ Atte Tenkanen University of Turku, Finland
2009 Oct 13
3
cdf
Dear all, I have the cdf of the following power fuction distribution: F(y)=(y/350)^a               ,0<y<350, where " a " is some parameter with range a>0. I want to use it as the argument of the discretize function of the actuar package. So I think I need to define this function to R so that if I entered a=1, I get the following F(y)=(y/350) and if I entered a=4.5, I get the
2012 Nov 08
3
Regrouping dataframe
Hi @ all, I hope for some help of you. I have a dataframe and I want to regroup it. examp4.csv <http://r.789695.n4.nabble.com/file/n4648927/examp4.csv> I need the arguments of "VAL" as table heads and the "TYPE " only in individual expression. The result should look like in the example pic. exp4.png <http://r.789695.n4.nabble.com/file/n4648927/exp4.png> I
2010 Apr 13
2
how to work with big matrices and the ff-package?
Hello everyone, I need to create and work with some big matrices that actually have somewhat over 2 million columns and 117 rows. To do some calculations on such big matrices R just needs too much memory for my PC (4GB installed). So I need a solution to work with large datasets. I'm trying to use the ff-package but I don't think I really understand the whole functionality of the
2011 Dec 15
1
Reordering a numeric variable
I'm running a linear model in R using the car package. I have a variable education, which i have recoded and regrouped to my wishes. However, R seems to place each element of that variable in alphabetical order. When I am running the model, don't I need the model order from lowest to highest to make an inference that a one unit change in one variable produced a one unit change in
2005 Jan 19
2
recoding large number of categories (select in SAS)
Hi, I have data on stomach contents. Possible prey species are in the hundreds, so a list of prey codes has been in used in many labs doing this kind of work. When comes time to do analyses on these data one often wants to regroup prey in broader categories, especially for rare prey. In SAS you can nest a large number of "if-else", or do this more cleanly with "select"
2008 Apr 20
1
Stepwise logistic regression....take too long...
Dear R helpers, I'm trying to build logistic regression model large dataset 360 factors and 850 observations. All 360 factors are known to be good predictors of outcome variable but I have to find best model with maximum 10 factors. I tried to fit full model and use stepAIC function to get best model but unfortenatly, the process takes too long to complete (more than 4 hours)... Is it
2008 Oct 27
1
How to see source code for na.omit?
Hi R helpers, I'd like to see source code for some of built-in R functions... for example, I would like too see how "na.omit" was implemented? Thanks?
2009 Jun 07
1
One rather theoretical question about fitting algorithm
Hi, What I'm trying to achieve is very fast algorithm for fitting logistic regression model. I have to estimate regression coeficients using about 10k observations. Once I have coefficients estimated, new 100 rows of data becomes available.... Now I need to reestimate coeficients using 100 newly arrived observations and removing 100 oldest observations. So, my question is would it be
2007 Nov 25
1
Problem for creating continous streaming and viwing in a webpage
Hi guys I'm using Icecast , Ezstream for creating my Webtv for creating a continuos streaming of some files In the filename filed of Ezstream xml configuration i put a simple m3u playlist. My problems begin when ezstream begin to encode and stream the second file: If a use a system player , like Vlc or totem , the stream it's continous , if I use a web player like Cortado or Itheora