Displaying 20 results from an estimated 3000 matches similar to: "cut2 once, bin twice..."
2011 Oct 11
1
warning with cut2 function
Dear r user,
please find my attached sample of the dataset i? am using to create a crosstable and eventually plot a histogram from the output.
I am using? the cut2 function to create bins, about 7 of them using the code after reading the data:
cluster <- cut2(cross_val$value, g=7)
I get the warning:
Warning message:
In min(xx[xx > upper]) : no non-missing arguments to min; returning Inf
2008 Oct 20
4
aggregating along bins and bin-quantiles
Dear all,
I would like to aggregate a data frame (consisting of 2 columns - one
for the bins, say factors, and one for the values) along bins and
quantiles within the bins.
I have tried
aggregate(data.frame$values, list(bin = data.frame
$bin,Quantile=cut2(data.frame$bin,g=10)),sum)
but then the quantiles apply to the population as a whole and not the
individual bins. Upon this
2012 Oct 17
2
cut2 error
To R users,
I am trying to use cut2 function from the 'Hmisc' library. However, when I
try and run the function on the following variable, I get an error message
(displayed below). I suspect it is because of the NA but I have no idea
how to address the error. Many thanks to any insights.
structure(list(var1 = c(97, 97, 98, 98, 97, 99, 97,
98, 99, 98, 99, 98, 98, 97, 97, 98, 99, 98,
2010 Mar 08
1
Help with Hmisc, cut2, split and quantile
Hello,
I have a set of data with two columns: "Target" and "Actual". A
http://n4.nabble.com/file/n1584647/Sample_table.txt Sample_table.txt is
attached but the data looks like this:
Actual Target
-0.125 0.016124906
0.135 0.120799865
... ...
... ...
I want to be able to break the data into tables based on quantiles in the
"Target" column. I can see (using
2013 Feb 01
2
Nested loop and output help
Hello Everyone,
My name is Thomas and I have been using R for one week. I recently found
your site and have been able to search the archives of posts. This has
given me some great information that has allowed me to craft an initial
design to an inquiry I would like to make into the breakdown of McNemar's
test. I have read an intro to R manual and the posting guides and hope I am
not violating
2009 May 20
1
sem with categorical data
I am trying to run a confirmatory factor analysis using the SEM package. My
data are ordinal. I have read
http://socserv.mcmaster.ca/jfox/Misc/sem/SEM-paper.pdf.
When I apply the hetcor function, I receive the following error:
Error in checkmvArgs(lower = lower, upper = upper, mean = mean, corr = corr,
:
at least one element of 'lower' is larger than 'upper'
Example:
2004 Jul 06
3
Improving effeciency - better table()?
Hi,
I've been running some simulations for a while and the performance of R
has been great. However, I've recently changed the code to perform a sort
of chi-square goodness-of-fit test. To get the observed values for each
cell I've been using table() - specifically I've been using cut2 from
Hmisc to divide up the range into a specified number of cells and then
using
2012 Oct 17
2
loop of quartile groups
Greetings R users,
My goal is to generate quartile groups of each variable in my data set. I
would like each experiment to have its designated group added as a
subsequent column. I can accomplish this individually with the following
code:
brks <- with(data_variables,
cut2(var2, g=4))
#I don't want the actual numbers, I need a numbered group
data$test1=factor(brks,
2007 Dec 13
3
what does cut(data, breaks=n) actually do?
Hello,
I'm trying to bin a quantity into 2-3 bins for calculating entropy and
mutual information. One of the approaches I'm exploring is the cut()
function, which is what the mutualInfo function in binDist uses. When it's
called in the format cut(data, breaks=n), it somehow splits the data into n
distinct bins. Can anyone tell me how cut() decides where to cut?
Thanks,
Melissa
2009 Sep 22
1
cut and re-factor data
Hello R-users,
I have a data frame with a factor of ages in 5 year increments, and various
count data for each age group. I only have this summary information in R at
the moment.
I want to create a new factor that aggregates the age factors if the
existing factors have insufficient counts. Then I can use aggregate to
build a new data set.
I figured out I can get the cut values I want using cut2
2010 Aug 06
1
Grouping clusters from dendrograms
Hi,
I have produced a dendrogram of categorical data in R using the hclust
function, although the input was a dissimilarity matrix produced in SAS, as
I have defined my own distances.
The dendrogram is fine and I can view and use this. However, I was wondering
if there is a method by which I can find out the optimal place to place
groups, rather than relying on my visual analysis? I don't
2001 Oct 02
4
plot of Bernoulli data
I have some Bernoulli data something like this:
x<-sort(runif(100,1,20))
p<-pnorm(x,10,3)
y<-as.numeric(runif(x)<p)
plot(x,y)
lines(x,p)
This plot is not very satisfactory because the ogive does not visually
fit the (0,1) points very well, and also because the points tend to fall
on top of one another. The second problem can be eliminated by adding
vertical jitter. However I was
2009 Sep 30
1
rcs fits in design package
Hi all,
I have a vector of proportions (post_op_prw) such that
>summary(amb$post_op_prw)
Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
0.0000 0.0000 0.0000 0.3985 0.9134 0.9962 1.0000
> summary(cut2(amb$post_op_prw,0.0001))
[0.0000,0.0001) [0.0001,0.9962] NA's
1904 1672 1
2008 Apr 17
1
Error in Design package: dataset not found for options(datadist)
Hi,
Design isn't strictly an R base package, but maybe someone can explain
the following.
When lrm is called within a function, it can't find the dataset dd:
> library(Design)
> age <- rnorm(30, 50, 10)
> cholesterol <- rnorm(30, 200, 25)
> ch <- cut2(cholesterol, g=5, levels.mean=TRUE)
> fit <- function(ch, age)
+ {
+ d <- data.frame(ch, age)
+
2007 Nov 24
1
Hmisc: can not reproduce figure 4 of Statistical Tables and Plots using S and LATEX
Dear R-users:
I can not reproduce figure 4 of *Statistical Tables and Plots using S and
LATEX* by Prof. Frank Harrell with the following code:
rm(list=ls())
library(Hmisc)
getHdata(pbc)
attach(pbc)
age.groups <- cut2(age, c(45,60))
g <- function(y) apply(y, 2, quantile, c(.25,.5,.75))
y <- with(pbc, cbind(Chol=chol,Bili=bili))
# You can give new column names that are not legal S names
2008 Sep 19
0
Error message in lmer
Dear list
I try to run a bootstrap with lmer.
I often, but not always, get the error message:
Error in objective(.par, ...) :
Leading minor of order 6 in downdated X'X is not positive definite
(with the number (here 6) varying)
In R-archives I came across some threads that treated this problem,
nevertheless they refer to lmer when using it with family = "binomial", so
the
2012 Dec 06
2
Best way to coerce numerical data to a predetermined histogram bin?
Folks:
Say I have a set of histogram breaks:
breaks=c(1:10,15)
# With bin ids:
bin_ids=1:(length(breaks)-1)
# and some data (note that some of it falls outside the breaks:
data=runif(min=1,max=20,n=100)
***
What is the MOST EFFICIENT way to "classify" data into the histogram bins
(return the bin_ids) and, say, return NA if the value falls outside of the
bins.
By classify, I mean
2009 Jun 15
2
Bin Category Labels on Axis
Hi,
I'd really appreciate if someone could give me some help or advice about
this - I've tried everything I know and am clueless about how to proceed!
I've written a script to import ASCII data of raster maps, bin them into
categories, calculate the mean values within these bins and plot the two in
a simple graph. I'm running into problems with my x axis, as R cannot add
the bin
2012 Feb 22
2
rank with uniform count for each rank
Hello,
What is the best way to get ranks for a vector of values, limit the range
of rank values and create equal count in each group? I call this uniform
ranking...uniform count/number in each group.
Here is an example using three groups:
Say I have values:
x = c(3, 2, -3, 1, 0, 5, 10, 30, -1, 4)
names(x) = letters[1:10]
> x
a b c d e f g h i j
3 2 -3 1 0 5 10 30 -1 4
I
2011 Mar 19
2
How to find position in bin-data?
Hi there,
probably there is a very simple solution, but I cannot think of one...
I have a vector with values:
data <- c(1,6,3,4,8,4,2,9)
and I have a vector with bin breaks:
bins <- c(1,3,5,7,9,11)
Now, I'd like to get for each data point the index of the bin-vector
where the value falls in (or equals the lower bin break).
In the example case, I'd like to get: