Displaying 20 results from an estimated 6000 matches similar to: "Add "bin" variable to dataframe"
2009 Nov 16
3
Sum over indexed value
I am sure this is easy but I am not finding a function to do this.
I have two columns in a matrix. The first column contains multiple entries
of numbers from 1 to 100 (i.e. 10 ones, 8 twos etc.). The second column
contains unique numbers. I want to sum the numbers in column two based on
the indexed values in column one (e.g. sum of all values in column two
associated with the value 1 in column
2011 Apr 06
3
Calculated mean value based on another column bin from dataframe.
Dear list,
I have a dataframe with two column as fellow.
> head(dat)
V1 V2
0.15624 0.94567
0.26039 0.66442
0.16629 0.97822
0.23474 0.72079
0.11037 0.83760
0.14969 0.91312
I want to get the column V2 mean value based on the bin of column of
V1. I write the code as fellow. It works, but I think this is not the
elegant way. Any suggestions?
2011 Mar 13
1
Binning data
Hello
I have a large series of data value -- effectivly say the point across the
x-axis where a pitch crosses home plate. What I want to do is find the % of
ground balls at various distances across home plate.
I therefore need to 'bin' the two data sets I have - plate location for
ground balls and plate location for all other outcomes.
Question is how can I set up a series of bins
2008 Oct 20
4
aggregating along bins and bin-quantiles
Dear all,
I would like to aggregate a data frame (consisting of 2 columns - one
for the bins, say factors, and one for the values) along bins and
quantiles within the bins.
I have tried
aggregate(data.frame$values, list(bin = data.frame
$bin,Quantile=cut2(data.frame$bin,g=10)),sum)
but then the quantiles apply to the population as a whole and not the
individual bins. Upon this
2012 Dec 06
2
Best way to coerce numerical data to a predetermined histogram bin?
Folks:
Say I have a set of histogram breaks:
breaks=c(1:10,15)
# With bin ids:
bin_ids=1:(length(breaks)-1)
# and some data (note that some of it falls outside the breaks:
data=runif(min=1,max=20,n=100)
***
What is the MOST EFFICIENT way to "classify" data into the histogram bins
(return the bin_ids) and, say, return NA if the value falls outside of the
bins.
By classify, I mean
2009 Jun 15
2
Bin Category Labels on Axis
Hi,
I'd really appreciate if someone could give me some help or advice about
this - I've tried everything I know and am clueless about how to proceed!
I've written a script to import ASCII data of raster maps, bin them into
categories, calculate the mean values within these bins and plot the two in
a simple graph. I'm running into problems with my x axis, as R cannot add
the bin
2011 Mar 19
2
How to find position in bin-data?
Hi there,
probably there is a very simple solution, but I cannot think of one...
I have a vector with values:
data <- c(1,6,3,4,8,4,2,9)
and I have a vector with bin breaks:
bins <- c(1,3,5,7,9,11)
Now, I'd like to get for each data point the index of the bin-vector
where the value falls in (or equals the lower bin break).
In the example case, I'd like to get:
2011 Aug 21
1
Histogram from frequency data in pre-made bins
Dear R user,
I am using UK census data on travel to work. The authorities have provided a
breakdown in each area by mode (car, bicycle etc.) and distance travelled (0
? 2 km, 2 ? 5 km etc). Therefore, after processing, the data for Sheffield
look like this https://files.one.ubuntu.com/ej2VtVbJTEaelvMRlsocRg :
dshef <- read.table("distmodesheff.csv", sep=",", header=TRUE)
2009 Oct 23
1
cut2 once, bin twice...
Hello,
I'm using the Hmisc cut2 function to bin a set of data. It produces bins
that I like with results like this:
[96,270]:171
[69, 96): 54
[49, 69): 40
[35, 49): 28
[28, 35): 14
[24, 28): 8
(Other) : 48
I would like to take a second set of data, and assign it to bins based on
factors defined by my call to cut 2.
Does anyone know how I can do this?
Thank you,
-S
--
View this message
2012 Feb 20
1
chisq.test vs manual calculation - why are different results produced?
Hello,
I am trying to fit gamma, negative exponential and inverse power functions
to a dataset, and then test whether the fit of each curve is good. To do
this I have been advised to calculate predicted values for bins of data (I
have grouped a continuous range of distances into 1km bins), and then apply
a chi-squared test. Example:
> data <- data.frame(distance=c(1,2,3,4,5,6,7),
2008 Feb 18
3
tabulation on dataframe question
I have a data frame with data similar to this:
NameA GrpA NameB GrpB Dist
A Alpha B Alpha 0.2
A Alpha C Beta 0.2
A Alpha D Beta 0.4
B Alpha C Beta 0.2
B Alpha D Beta 0.1
C Beta D Beta 0.3
Dist is a distance measure between two entities. The table displays
all to all distances, but the
2012 May 20
2
Histograms with bin proportions on the y-axis
I have what is probably a simple problem. I have a data file from an MCMC
Bayes estimation problem that is a vector of 500,000 numeric values (just
one variable) ranging from 100,000 to 700,000. I need to display the
histogram of this data in a high quality graphic for a figure in a journal
publication. I want 100 bins so as to display a reasonable complete and
smooth histogram, and I need the
2006 Feb 26
5
Voice Over WiFi
Hello all,
this is not really an * question but it is somehow related, i am trying to
develop a working proposal for cheap and quick telephony services using Voip
running over *. By running a wireless network (over 802.11 a/b/g devices),
i plan to be able to reach customers directly with eithe table top or
handheld 802.11 sip enabled phones.
But the disadvantage is that how do i power each radio
2006 Apr 05
1
Bin by bin histogram comparisons
Hello,
I have created two histograms with:
hist2d(gps2, nbins=200, col = c("white",heat.colors(16)))
Both of them have the same range and the same number of bins.
Now I would like to compare them bin by bin and plot the results.
Could someone please tell me how to do that. I searched the man pages and
the web, but couldn't find anything.
Thank you very much.
Phil
2010 Sep 22
3
Extracting bins and frequencies from frequency table
Dear R users,
I would like to great a frequency table from raw data and then access
the classes/bins and
their respective frequencies separately. Here the code to create the
frequency tables:
x1 <- c(1,5,1,1,2,2,3,4,5,3,2,3,6,4,3,8)
t1 <- table(x1)
print(t1[1])
Its easy to plot this, but how do I actually access the frequencies
alone and the bins alone?
Basically I am looking to get:
2012 Mar 16
1
Identifying a change in events between bins
Hi there,
First off, despite this being my first post here, I have scanned the R help forums a lot in the past few months to help with some questions, so a big thank you to the community as a whole for being so helpful!
I'm somewhat of an R newbie, and have run up against a problem that I can't seem to solve. If anyone is able to help I would really appreciate it!
I'm looking at a
2017 Nov 08
0
Default for bin limits in hist()
Hello all.
I noticed that the default setting for breaks in the construction of histograms in hist() is ?right = TRUE?.
I think ?right=FALSE? would be more consistent with usual definitions of lower and upper limits for bins in applied statistics, and I suggest that you consider making it the default for hist().
For example, I generated the following frequency distribution for duration of
2006 Nov 07
1
histogram bin width
hi all : i have the data below and then below that, i call the hist
function three times using the Scott method for the widths of the bins.
the bin width is different for the three histograms but I would like it
to always be 0.05 regfardless of the data
set being histogrammed.
I'm sure there must be a manual way to do this which is fine with me. i
tried breaks=0.05 but it wasn't happy
2006 Nov 14
2
dividing vectors into bins with equal widths
Hi R-users,
I am trying to divide a vector (say X) into equal frequency bins. If one uses the hist()
function, then a histogram is plotted, but with bins of equal widths, and not with bins
having the same number of data points.
I have then tried the histogram() function as follows:
histogram(X, nint=10, breaks=NULL, equal.widths=F)
This works as I want. However, I can't extract which
2010 Apr 13
1
Binning Question
Hi,
I'm trying to setup some complicated binning with statistics and could
use a little help.
I've found the bin2 function from the ash package, but it doesn't do
everything I need. My intention is to copy some of their code and then
modify as needed.
I have a vector of two columns:
head(data)
r1 r2
[1,] 0.03516559 0.03102128
[2,] 0.02162539 0.14847034