Displaying 20 results from an estimated 1000 matches similar to: "Rank-based p-value on large dataset"
2008 Jul 01
2
"Invalid object" error in boxplot
Hi,
I'm trying to make a boxplot with the data at the end of the message, and when I
try to execute the command
>boxplot(Diatoms) (or for any other field instead of "Diatoms")
I get the following error message:
Error in oldClass(stats) <- cl : adding class "factor" to an invalid object
Any advice would be much appreciated.
Thanks a lot,
Miriam
Date
2003 Sep 23
1
AW: Rank and extract data from a series
Hi,
>I would like to rank a time-series of data, extract the top ten data items from this series, determine the
>corresponding row numbers for each value in the sample, and take a mean of these *row numbers* (not the data).
>I would like to do this in R, rather than pre-process the data on the UNIX command line if possible, as I need to >calculate other statistics for the series.
2012 Jan 11
1
get the percentage rank of a value based on an empirical data vector
Hi,
I have a vector with values:
x <- rnorm(1000, 5, 2)
and one single value:
y <- 6.2
now I would like to know the percent rank of y based on the 'population'-vector x.
Is there a convenient function that calculates the percent rank of a y for the given vector x?
thanks!
2003 Sep 23
0
Rank and extract data from a series
Here's one way. Suppose your "time series" is in a vector called "x".
top10 <- sort(x, decreasing=TRUE)[1:10]
mean.index <- mean(which(x %in% top10))
HTH,
Andy
> -----Original Message-----
> From: James Brown [mailto:jdb33 at hermes.cam.ac.uk]
> Sent: Tuesday, September 23, 2003 7:51 AM
> To: r-help at stat.math.ethz.ch
> Subject: [R] Rank and
2012 Dec 11
1
Rprof causing R to crash
I'm trying to use Rprof() to identify bottlenecks and speed up a particullary
slow section of code which reads in a portion of a tif file and compares
each of the values to values of predictors used for model fitting. I've
written up an example that anyone can run. Generally temp would be a
section of a tif read into a data.frame and used later for other processing.
The first portion
2011 May 05
1
memory and bootstrapping
hello,
the following questions will without doubt reveal some fundamental
ignorance, but hopefully you can still help me out.
I'd like to bootstrap a coefficient gained on the basis of the
coefficients in a logistic regression model (the mean differences in
the predicted probabilities between two groups, where each predict()
operation uses as the newdata-argument a dataframe of equal size as
2007 Mar 22
5
netapp/maildir/dovecot performance
We are seeing some poor performance recently that is focused around
users with large mailboxes (100,000 message /INBOX, 80,000 message
subfolders, etc).
The performance problem manifests as very high system% utilization -
basically iowait for NFS.
There are two imap servers with plenty of horsepower/memory/etc. They
are connected to a 3050c cluster via gig-e. Here are the mount
options:
2005 Sep 15
4
Error in vector("double", length) : vector size specified is too large....VLDs
I have what R seems to consider a very large dataset, a 12MB text file of
lat,long,and height values, 130,000 rows to be exact.
Here's what I get:
Thomas Colson
North Carolina State University
Department of Forestry and Environmental Resources
(919) 673 8023
tom_colson at ncsu.edu
Calendar:
www4.ncsu.edu/~tpcolson
2006 Jul 18
13
RANDOM
I am pretty much new at this ROR game and had what I think to be a
simple question.
I have a set of Sponsors that I would like to be able to select one at
random and display in the my html. I have already set up the DB,
scaffolded, set index controller and all is working smoothly.
I know that I can display them all by doing
<% for sponsor in @sponsors %>
<%= sponsor.name %>
2006 Nov 20
5
Parallal Building?
I''m trying to index ~130,000 documents [soon to grow to about 500,000
documents] and I''m wondering if its possible to combine ferret databases
or in some other way split up the building process.
Normally, indexing 130k documents wouldn''t be that painful except that
there are different types of links between these documents and they are
not absolute (so for example
2004 Mar 30
4
rank() vs SAS proc rank
SAS proc rank has ties options of high and low that would allow
producing ranks of the type found in the sports pages, e.g.,
rank (c(1,1,2,2,2,2,3)) == 1 1 3 3 3 3 7
Could R support these ties.methods?
2001 Jul 26
2
Installing Smart Suite 97
I get to 53 percent of files copied and I get the message that
c:\lotuscomponent/lttsn32.dll is in use by another application and
terminates install. Help me?
BTW I get a message about contacting Microsoft support about mytab11.c.
Well this is a Microsoft clean installation. Took me hours to the point
where the installer will load and run.
-----= Posted via Newsfeeds.Com, Uncensored Usenet
2012 Feb 22
2
rank with uniform count for each rank
Hello,
What is the best way to get ranks for a vector of values, limit the range
of rank values and create equal count in each group? I call this uniform
ranking...uniform count/number in each group.
Here is an example using three groups:
Say I have values:
x = c(3, 2, -3, 1, 0, 5, 10, 30, -1, 4)
names(x) = letters[1:10]
> x
a b c d e f g h i j
3 2 -3 1 0 5 10 30 -1 4
I
2010 Sep 27
2
Sample size estimation for non-inferiority log-rank and Wilcoxon rank-sum tests
Hello Everyone,
I'm trying to conduct a couple of power analyses and was hoping someone might be able to help. I want to estimate the sample size that would be necessary to adequately power a couple of non-inferiority tests. The first would be a log-rank test and the second would be a Wilcoxon rank-sum test. I want to be able to determine the sample size that would be necessary to test for a
2020 Aug 05
5
Re: More parallelism in VDDK driver (was: Re: CFME-5.11.7.3 Perf. Tests)
Nir, BTW what are you using for performance testing?
As far as I can tell it's not possible to make qemu-img convert use
multi-conn when connecting to the source (which is going to be a
problem if we want to use this stuff in virt-v2v).
Instead I've hacked up a copy of this program from libnbd:
https://github.com/libguestfs/libnbd/blob/master/examples/threaded-reads-and-writes.c
so
2020 Aug 05
1
Re: More parallelism in VDDK driver (was: Re: CFME-5.11.7.3 Perf. Tests)
On Wed, Aug 5, 2020 at 3:47 PM Richard W.M. Jones <rjones@redhat.com> wrote:
>
>
> Here are some results anyway. The command I'm using is:
>
> $ ./nbdkit -r -U - vddk \
> libdir=/path/to/vmware-vix-disklib-distrib \
> user=root password='***' \
> server='***' thumbprint=aa:bb:cc:... \
> vm=moref=3 \
>
2001 Dec 02
1
GLM with ranks as response variable
Dear R's,
I have a survey where customers rank a set of 5 packages for a product,
so the response variable looks like
a d b c
a c d b
d b a c
Predictors variables are 4 socio-economic parameters. I have modelled the FIRST
choice of each subject as a multinomial model, similar to the housing example in
MASS ch7.3, , but I would prefer to use the whole rank-set instead.
Can someone give me a
2007 Oct 22
4
Bar plot with error bars
Apologies if this has been asked before. I am having trouble
understanding the R mailing list never mind R!
I am relatively new to R having migrated from Minitab and SPSS. I
have managed to do some more complicated statistics such as
hierarchical partitioning of variance on an 80,000 record dataset but
have to admit that drawing a simple bar plot I could do by hand is
proving extremely
2013 Jun 08
1
splitting a string column into multiple columns faster
Hello!
I have a column in my data frame that I have to split: I have to distill
the numbers from the text. Below is my example and my solution.
x<-data.frame(x=c("aaa1_bbb1_ccc3","aaa2_bbb3_ccc2","aaa3_bbb2_ccc1"))
x
library(stringr)
out<-as.data.frame(str_split_fixed(x$x,"aaa",2))
out2<-as.data.frame(str_split_fixed(out$V2,"_bbb",2))
2007 Aug 11
1
R Weka and cobweb
Hi,
I never use cobweb before and I'm quite new to this. I have a couple of questions around the cobweb implementation in R Weka. If you could supply answer or insight, I would really appreciate.
1. From Fisher's paper in 1987, it seems that Cobweb only deals with nominal data. In R Weka cobweb, is it allowed to accommodate real/continuous value?
2. My understanding is that Cobweb