Displaying 20 results from an estimated 3000 matches similar to: "Constrained Optimized Binning Procedure....implementation help/idea needed."
2010 Jun 16
0
Binning function in R
Is there some function in R that does what Interactive Grouping node in SAS
Enterprise Miner does?
It makes variable binning using WOE(weight of evidence) for scoring
modeling.
I’ve found 3 binning function in R :
rattle::binning
http://finzi.psych.upenn.edu/R/library/rattle/html/binning.html
ks::binning
http://finzi.psych.upenn.edu/R/library/ks/html/binning.html
sm ::binning
2009 Nov 07
0
Binning of integers with hist() function odd results (P (PR#14047)
On 06-Nov-09 23:30:12, gug at fnal.gov wrote:
> Full_Name: Gerald Guglielmo
> Version: 2.8.1 (2008-12-22)
> OS: OSX Leopard
> Submission from: (NULL) (131.225.103.35)
>
> When I attempt to use the hist() function to bin integers the behavior
> seems
> very odd as the bin boundary seems inconsistent across the various
> bins. For
> some bins the upper boundary
2009 Nov 07
0
Binning of integers with hist() function odd results (P (PR#14048)
Hi,
Thank you for responding quickly and explaining the behavior. By
adding "include.lowest=TRUE,right=FALSE" and manually including breaks
that resolved the simple test case. Next I updated my more complex
data set, which already had manually defined breaks, and that resolved
my issues there too. I have now gone in and updated all my functions
which use hist() so I
2011 Jul 12
1
High density scatter plot with logarithmic binning
How can perform logarithmic binning in the scatterplot? I could only take the
log of the variables and plot them, but I am sure that is not the way. I
have a very huge data, and would want to plot those high density
scatterplots and code then with different colors for the bins/density.
--
View this message in context:
2012 Apr 04
0
crosstabs and histograms with flexible binning of dates
Hi,
First, thank you to Duncan Mackay for getting me started processing dates
with R. Unfortunately, I need to do a little more than I initially expected.
I have 5K lines of data that look like this:
ID AREA DATE
0001 Center 2010-10-15
0002 Center 2010-01-02
0003 NorthWest 2010-02-05
0004 SouthWest 2010-05-11
I would like to create a script to create crosstabs like
2009 Nov 06
2
Binning of integers with hist() function odd results (PR#14046)
Full_Name: Gerald Guglielmo
Version: 2.8.1 (2008-12-22)
OS: OSX Leopard
Submission from: (NULL) (131.225.103.35)
When I attempt to use the hist() function to bin integers the behavior seems
very odd as the bin boundary seems inconsistent across the various bins. For
some bins the upper boundary includes the next integer value, while in others it
does not. If I add 0.1 to every value, then the
2010 Apr 13
1
Binning Question
Hi,
I'm trying to setup some complicated binning with statistics and could
use a little help.
I've found the bin2 function from the ash package, but it doesn't do
everything I need. My intention is to copy some of their code and then
modify as needed.
I have a vector of two columns:
head(data)
r1 r2
[1,] 0.03516559 0.03102128
[2,] 0.02162539 0.14847034
2011 Dec 17
1
Binning a 2 column matrix by avarages of rows.
Newbie here. Many apologies in advance for using the incorrect lingo.
I'm new to statistics and VERY new to R.
I have a "nx2" matrix , I want to sort the values based on the average of 2
columns and put k lowest (or highest) values in bin1, second k high/low
values in bin2, and so on (bins would be of the same dimensions). I should
also know what the first index (or position) of
2011 Mar 13
1
Binning data
Hello
I have a large series of data value -- effectivly say the point across the
x-axis where a pitch crosses home plate. What I want to do is find the % of
ground balls at various distances across home plate.
I therefore need to 'bin' the two data sets I have - plate location for
ground balls and plate location for all other outcomes.
Question is how can I set up a series of bins
2007 Oct 12
2
Automating binning for chisq.test()
The standard chisq.test() and fisher.test() functions, when applied to
two distributions (to determine whether the same underlying
distribution applies to both) requires one to pre-bin the
distributions.
Is there a library function (either built-in or in a package) that
acts more like the ks.test() function, in that one can simply pass the
two distributions and have it do the necessary binning as
2011 Jan 21
1
3D Binning
I am trying to do binning on three variables (3d binning). The bin boundaries
are specified by the user separately for each variable. I used the bin2
function in the 'ash' package for 2d binning that involves only two
variables but didn't any package for similar binning with three variables.
Are there any packages or codes available for 3d binning?? Thank you.
--
View this message in
2009 Sep 10
1
Complex binning?
Hi,
I need to do some binning which to date I've done just writing
subset commands. I'm now wondering if there are any good packages that
have some good pre-designed functions for multi-variable binning using
say 4 or 5 variables, sometimes binning on 3 or more levels of each
variable, and then supporting some sort of reporting mechanism to tell
me how many data points fell into each
2012 Oct 03
3
Fastest non-overlapping binning mean function out there?
Hi,
I'm looking for a super-duper fast mean/sum binning implementation
available in R, and before implementing z = binnedMeans(x y) in native
code myself, does any one know of an existing function/package for
this? I'm sure it already exists. So, given data (x,y) and B bins
bx[1] < bx[2] < ... < bx[B] < bx[B+1], I'd like to calculate the
binned means (or sums)
2004 Jun 16
4
non-linear binning? power-law in R
First, thanks to everyone who helped me get to grips with R in (x)emacs
(I get confused easily). Special thanks to Stephen Eglen for continued
support.
My question is about non-linear binning, or density functions over
distributions governed by a power law ...
y ~ mu*x**lambda # In one of its forms
# (can't find Pareto in the online help)
Looking at the following
2007 Nov 09
0
Automated Binning for building predictive models
Hello,
Currently I am using R for building a logistic model using numerical and nominal data as predictors. Before doing the regression, the predictors are grouped. The groups I determine manually by trying to maximize the information value (which is an indicator for the discriminatory power of the variable) under the condition that enough data are within each group (approx. 5%).
Is there a
2010 May 13
2
Simple question on binning data
Hello everyone,
I have a data set, and I need to bin my data using a bin width of say g(n).
Would anyone be willing to tell me how to do this in R?
Thanks
--
View this message in context: http://r.789695.n4.nabble.com/Simple-question-on-binning-data-tp2202644p2202644.html
Sent from the R help mailing list archive at Nabble.com.
2011 Oct 24
1
binning runtimes
Hello,
Suppose I have the dataset shown below. The amount of observations is too massive to get a nice geom_point and smoother on top. What I would like to do is to bin the data first. The data is indexed by Time (minutes from 1 to 120 i.e. two hours of System benchmarking).
Option 1) group the data by Time i.e. minute 1, minute 2, etc and within each group create bins of N consecutive
2012 Mar 08
1
binning dates by decade for simulated data
I have a simulated matrix of dates that I generated from a probability
function. Each column represents a single iteration.
I would like to bin each run _separately_ by decades and dump them into a
new matrix where each column is the length of all decades a single run with
the number dates binned by decade.
I have successfully done this for a single vector of dates, but not for a
matrix:
2008 Aug 19
1
Histogram binning
I am trying to produce frequencies in defined intervals however I can't seem
to figure out how to get R to bin my data the way I want it to.
I have several thousand lengths of fish that I want to be binned as follows:
Ex.
Length Bin
209 200
219 210
431 430
727 720
That is, bins with any length equal to or greater than the lower
2017 Jul 14
2
Efficient Binning
Hi all,
I have a situation where I have 16 bins. I generate a random number and
then want to know which bin number the random number falls in. Right now, I
am using a serious of 16 if() else {} statements which get very complicated
with the embedded curly braces. Is there a more efficient (i.e., easier)
way to go about this?
boundaries<-(0:16)/16
rand<-runif(1)
Which bin number (1:16)