Displaying 20 results from an estimated 10000 matches similar to: "Simple question on binning data"
2012 Oct 03
3
Fastest non-overlapping binning mean function out there?
Hi,
I'm looking for a super-duper fast mean/sum binning implementation
available in R, and before implementing z = binnedMeans(x y) in native
code myself, does any one know of an existing function/package for
this? I'm sure it already exists. So, given data (x,y) and B bins
bx[1] < bx[2] < ... < bx[B] < bx[B+1], I'd like to calculate the
binned means (or sums)
2011 Dec 05
1
Problem in while loop
Hi all,
I have the following code,
When I run the code, it never terminate this is because of the while loop i
am using. In general, if you need a loop for which you don't know in
advance how many iterations there will be, you can use the `while'
statement so here too i don't know the number how many iterations are
there. So Can some one suggest me whats going on?
I am using the
2012 Jun 11
3
Simple Binning of Values
Hello
I am very new to R. I have an R task to complete that I have not been able to find a straightforward answer to as of yet. I have a list of values. I would like to count the number of values that are in one bin, the number that fall in the next bin, etc.
For example
My input file is: 123 48 342 442 43 232 32 129 191 147
I would like the output to be similar to:
0-100 3
100-200 4
2010 Jun 16
0
Binning function in R
Is there some function in R that does what Interactive Grouping node in SAS
Enterprise Miner does?
It makes variable binning using WOE(weight of evidence) for scoring
modeling.
I’ve found 3 binning function in R :
rattle::binning
http://finzi.psych.upenn.edu/R/library/rattle/html/binning.html
ks::binning
http://finzi.psych.upenn.edu/R/library/ks/html/binning.html
sm ::binning
2011 Jan 21
1
3D Binning
I am trying to do binning on three variables (3d binning). The bin boundaries
are specified by the user separately for each variable. I used the bin2
function in the 'ash' package for 2d binning that involves only two
variables but didn't any package for similar binning with three variables.
Are there any packages or codes available for 3d binning?? Thank you.
--
View this message in
2011 Dec 01
1
strange row numbering after rbind-ing a list
"Not that it really matters, but"
Can someone explain how the row numbers get assigned in the following
sequence? It looks like something funky happens when rbind() coerces
'bar' into a dataframe.
In either sequence of rbind below, once you get past the first two rows,
the row numbers count normally.
Rgames> (foo<-data.frame(x=5,y=4,r=3))
x y r
1 5 4 3
Rgames>
2009 Sep 10
1
Complex binning?
Hi,
I need to do some binning which to date I've done just writing
subset commands. I'm now wondering if there are any good packages that
have some good pre-designed functions for multi-variable binning using
say 4 or 5 variables, sometimes binning on 3 or more levels of each
variable, and then supporting some sort of reporting mechanism to tell
me how many data points fell into each
2011 Jul 12
1
High density scatter plot with logarithmic binning
How can perform logarithmic binning in the scatterplot? I could only take the
log of the variables and plot them, but I am sure that is not the way. I
have a very huge data, and would want to plot those high density
scatterplots and code then with different colors for the bins/density.
--
View this message in context:
2008 Jun 16
0
Constrained Optimized Binning Procedure....implementation help/idea needed.
Dear R Helpers,
At the moment I'm working on the project to implement "optimal binning"
function. It will be primarily used as a tool for logistic regression.....
something very similar to
http://www2.sas.com/proceedings/forum2008/153-2008.pdf* *but applied in
diferent problem space...*
*The problem might be descibed as finding optimal binning which will satisfy
all of the rules
2012 Apr 04
0
crosstabs and histograms with flexible binning of dates
Hi,
First, thank you to Duncan Mackay for getting me started processing dates
with R. Unfortunately, I need to do a little more than I initially expected.
I have 5K lines of data that look like this:
ID AREA DATE
0001 Center 2010-10-15
0002 Center 2010-01-02
0003 NorthWest 2010-02-05
0004 SouthWest 2010-05-11
I would like to create a script to create crosstabs like
2009 Nov 07
0
Binning of integers with hist() function odd results (P (PR#14047)
On 06-Nov-09 23:30:12, gug at fnal.gov wrote:
> Full_Name: Gerald Guglielmo
> Version: 2.8.1 (2008-12-22)
> OS: OSX Leopard
> Submission from: (NULL) (131.225.103.35)
>
> When I attempt to use the hist() function to bin integers the behavior
> seems
> very odd as the bin boundary seems inconsistent across the various
> bins. For
> some bins the upper boundary
2009 Nov 07
0
Binning of integers with hist() function odd results (P (PR#14048)
Hi,
Thank you for responding quickly and explaining the behavior. By
adding "include.lowest=TRUE,right=FALSE" and manually including breaks
that resolved the simple test case. Next I updated my more complex
data set, which already had manually defined breaks, and that resolved
my issues there too. I have now gone in and updated all my functions
which use hist() so I
2009 Nov 06
2
Binning of integers with hist() function odd results (PR#14046)
Full_Name: Gerald Guglielmo
Version: 2.8.1 (2008-12-22)
OS: OSX Leopard
Submission from: (NULL) (131.225.103.35)
When I attempt to use the hist() function to bin integers the behavior seems
very odd as the bin boundary seems inconsistent across the various bins. For
some bins the upper boundary includes the next integer value, while in others it
does not. If I add 0.1 to every value, then the
2004 Jun 16
4
non-linear binning? power-law in R
First, thanks to everyone who helped me get to grips with R in (x)emacs
(I get confused easily). Special thanks to Stephen Eglen for continued
support.
My question is about non-linear binning, or density functions over
distributions governed by a power law ...
y ~ mu*x**lambda # In one of its forms
# (can't find Pareto in the online help)
Looking at the following
2007 Nov 09
0
Automated Binning for building predictive models
Hello,
Currently I am using R for building a logistic model using numerical and nominal data as predictors. Before doing the regression, the predictors are grouped. The groups I determine manually by trying to maximize the information value (which is an indicator for the discriminatory power of the variable) under the condition that enough data are within each group (approx. 5%).
Is there a
2011 Mar 13
1
Binning data
Hello
I have a large series of data value -- effectivly say the point across the
x-axis where a pitch crosses home plate. What I want to do is find the % of
ground balls at various distances across home plate.
I therefore need to 'bin' the two data sets I have - plate location for
ground balls and plate location for all other outcomes.
Question is how can I set up a series of bins
2011 Dec 17
1
Binning a 2 column matrix by avarages of rows.
Newbie here. Many apologies in advance for using the incorrect lingo.
I'm new to statistics and VERY new to R.
I have a "nx2" matrix , I want to sort the values based on the average of 2
columns and put k lowest (or highest) values in bin1, second k high/low
values in bin2, and so on (bins would be of the same dimensions). I should
also know what the first index (or position) of
2008 Sep 09
1
Binning
Dear List:
I have a dataset with over 5000 records and I would like to put the Count in bins
based on the ForkLength. e.g.
Forklength Count
32-34 ?
35-37 ?
38-40 ?
and so on...
and lastly I would like to plot (scatterplot) including the SampleDate
along the X axis and ForkLength along the Y axis. I recently saw an
example similar to
2007 Oct 12
2
Automating binning for chisq.test()
The standard chisq.test() and fisher.test() functions, when applied to
two distributions (to determine whether the same underlying
distribution applies to both) requires one to pre-bin the
distributions.
Is there a library function (either built-in or in a package) that
acts more like the ks.test() function, in that one can simply pass the
two distributions and have it do the necessary binning as
2010 Apr 13
1
Binning Question
Hi,
I'm trying to setup some complicated binning with statistics and could
use a little help.
I've found the bin2 function from the ash package, but it doesn't do
everything I need. My intention is to copy some of their code and then
modify as needed.
I have a vector of two columns:
head(data)
r1 r2
[1,] 0.03516559 0.03102128
[2,] 0.02162539 0.14847034