similar to: Simple question on binning data

Displaying 20 results from an estimated 10000 matches similar to: "Simple question on binning data"

2012 Oct 03
3
Fastest non-overlapping binning mean function out there?
Hi, I'm looking for a super-duper fast mean/sum binning implementation available in R, and before implementing z = binnedMeans(x y) in native code myself, does any one know of an existing function/package for this? I'm sure it already exists. So, given data (x,y) and B bins bx[1] < bx[2] < ... < bx[B] < bx[B+1], I'd like to calculate the binned means (or sums)
2011 Dec 05
1
Problem in while loop
Hi all, I have the following code, When I run the code, it never terminate this is because of the while loop i am using. In general, if you need a loop for which you don't know in advance how many iterations there will be, you can use the `while' statement so here too i don't know the number how many iterations are there. So Can some one suggest me whats going on? I am using the
2012 Jun 11
3
Simple Binning of Values
Hello I am very new to R.  I have an R task to complete that I have not been able to find a straightforward answer to as of yet.  I have a list of values. I would like to count the number of values that are in one bin, the number that fall in the next bin, etc. For example My input file is:  123 48 342 442 43 232 32 129 191 147 I would like the output to be similar to:  0-100 3 100-200 4
2010 Jun 16
0
Binning function in R
Is there some function in R that does what Interactive Grouping node in SAS Enterprise Miner does? It makes variable binning using WOE(weight of evidence) for scoring modeling. I’ve found 3 binning function in R : rattle::binning http://finzi.psych.upenn.edu/R/library/rattle/html/binning.html ks::binning http://finzi.psych.upenn.edu/R/library/ks/html/binning.html sm ::binning
2011 Jan 21
1
3D Binning
I am trying to do binning on three variables (3d binning). The bin boundaries are specified by the user separately for each variable. I used the bin2 function in the 'ash' package for 2d binning that involves only two variables but didn't any package for similar binning with three variables. Are there any packages or codes available for 3d binning?? Thank you. -- View this message in
2011 Dec 01
1
strange row numbering after rbind-ing a list
"Not that it really matters, but" Can someone explain how the row numbers get assigned in the following sequence? It looks like something funky happens when rbind() coerces 'bar' into a dataframe. In either sequence of rbind below, once you get past the first two rows, the row numbers count normally. Rgames> (foo<-data.frame(x=5,y=4,r=3)) x y r 1 5 4 3 Rgames>
2009 Sep 10
1
Complex binning?
Hi, I need to do some binning which to date I've done just writing subset commands. I'm now wondering if there are any good packages that have some good pre-designed functions for multi-variable binning using say 4 or 5 variables, sometimes binning on 3 or more levels of each variable, and then supporting some sort of reporting mechanism to tell me how many data points fell into each
2011 Jul 12
1
High density scatter plot with logarithmic binning
How can perform logarithmic binning in the scatterplot? I could only take the log of the variables and plot them, but I am sure that is not the way. I have a very huge data, and would want to plot those high density scatterplots and code then with different colors for the bins/density. -- View this message in context:
2008 Jun 16
0
Constrained Optimized Binning Procedure....implementation help/idea needed.
Dear R Helpers, At the moment I'm working on the project to implement "optimal binning" function. It will be primarily used as a tool for logistic regression..... something very similar to http://www2.sas.com/proceedings/forum2008/153-2008.pdf* *but applied in diferent problem space...* *The problem might be descibed as finding optimal binning which will satisfy all of the rules
2012 Apr 04
0
crosstabs and histograms with flexible binning of dates
Hi, First, thank you to Duncan Mackay for getting me started processing dates with R. Unfortunately, I need to do a little more than I initially expected. I have 5K lines of data that look like this: ID AREA DATE 0001 Center 2010-10-15 0002 Center 2010-01-02 0003 NorthWest 2010-02-05 0004 SouthWest 2010-05-11 I would like to create a script to create crosstabs like
2009 Nov 07
0
Binning of integers with hist() function odd results (P (PR#14047)
On 06-Nov-09 23:30:12, gug at fnal.gov wrote: > Full_Name: Gerald Guglielmo > Version: 2.8.1 (2008-12-22) > OS: OSX Leopard > Submission from: (NULL) (131.225.103.35) > > When I attempt to use the hist() function to bin integers the behavior > seems > very odd as the bin boundary seems inconsistent across the various > bins. For > some bins the upper boundary
2009 Nov 07
0
Binning of integers with hist() function odd results (P (PR#14048)
Hi, Thank you for responding quickly and explaining the behavior. By adding "include.lowest=TRUE,right=FALSE" and manually including breaks that resolved the simple test case. Next I updated my more complex data set, which already had manually defined breaks, and that resolved my issues there too. I have now gone in and updated all my functions which use hist() so I
2009 Nov 06
2
Binning of integers with hist() function odd results (PR#14046)
Full_Name: Gerald Guglielmo Version: 2.8.1 (2008-12-22) OS: OSX Leopard Submission from: (NULL) (131.225.103.35) When I attempt to use the hist() function to bin integers the behavior seems very odd as the bin boundary seems inconsistent across the various bins. For some bins the upper boundary includes the next integer value, while in others it does not. If I add 0.1 to every value, then the
2004 Jun 16
4
non-linear binning? power-law in R
First, thanks to everyone who helped me get to grips with R in (x)emacs (I get confused easily). Special thanks to Stephen Eglen for continued support. My question is about non-linear binning, or density functions over distributions governed by a power law ... y ~ mu*x**lambda # In one of its forms # (can't find Pareto in the online help) Looking at the following
2007 Nov 09
0
Automated Binning for building predictive models
Hello, Currently I am using R for building a logistic model using numerical and nominal data as predictors. Before doing the regression, the predictors are grouped. The groups I determine manually by trying to maximize the information value (which is an indicator for the discriminatory power of the variable) under the condition that enough data are within each group (approx. 5%). Is there a
2011 Mar 13
1
Binning data
Hello I have a large series of data value -- effectivly say the point across the x-axis where a pitch crosses home plate. What I want to do is find the % of ground balls at various distances across home plate. I therefore need to 'bin' the two data sets I have - plate location for ground balls and plate location for all other outcomes. Question is how can I set up a series of bins
2011 Dec 17
1
Binning a 2 column matrix by avarages of rows.
Newbie here. Many apologies in advance for using the incorrect lingo. I'm new to statistics and VERY new to R. I have a "nx2" matrix , I want to sort the values based on the average of 2 columns and put k lowest (or highest) values in bin1, second k high/low values in bin2, and so on (bins would be of the same dimensions). I should also know what the first index (or position) of
2008 Sep 09
1
Binning
Dear List: I have a dataset with over 5000 records and I would like to put the Count in bins based on the ForkLength. e.g. Forklength Count 32-34 ? 35-37 ? 38-40 ? and so on... and lastly I would like to plot (scatterplot) including the SampleDate along the X axis and ForkLength along the Y axis. I recently saw an example similar to
2007 Oct 12
2
Automating binning for chisq.test()
The standard chisq.test() and fisher.test() functions, when applied to two distributions (to determine whether the same underlying distribution applies to both) requires one to pre-bin the distributions. Is there a library function (either built-in or in a package) that acts more like the ks.test() function, in that one can simply pass the two distributions and have it do the necessary binning as
2010 Apr 13
1
Binning Question
Hi, I'm trying to setup some complicated binning with statistics and could use a little help. I've found the bin2 function from the ash package, but it doesn't do everything I need. My intention is to copy some of their code and then modify as needed. I have a vector of two columns: head(data) r1 r2 [1,] 0.03516559 0.03102128 [2,] 0.02162539 0.14847034