similar to: replicating SAS's "proc rank" procedure

Displaying 20 results from an estimated 2000 matches similar to: "replicating SAS's "proc rank" procedure"

2011 Oct 21
1
replicating SAS's "proc rank" procedure
Hi try this function ive written it should be self explantory but let me know if you have any problems. I've only been using R for a few eeeks so apologies if its not the most efficient! rankit2<-function(rankvar,cuts,data,factor) { ranker<-rankvar ranker<-0 range<-c(1:cuts) range2<-range/cuts range3<-quantile(factor,range2) over<-length(factor) for (i in 1:over){
2012 Apr 20
1
Package "demography" - calculating quintiles of survival probabilities
Hi, I am using the package "demography" from Rob Hyndman for the Lee-Carter-Model. It is an amazing powerful tool but I am struggling with one issue: *I want to compute different quintiles for the cumulative survival probability derived from the Lee-Carter-Forecast (e.g. the 50%-quintile, 75%-quintile and 99%-quintile) for the next 10 years. * I am sure the package possess this
2011 May 08
2
help with mysql and R: partitioning by quintile
Hi, I have a mysql table with fields userid,track,frequency e.g u1,1,10 u1,2,100 u1,3,110 u1,4,200 u1,5,120 u1,6,130 . u2,1,23 . . where "frequency" is the number of times a music track is played by a "userid" I need to turn my 'frequency' table into a rating table (it's for a recommender system). So, for each user, I need to categorise the frequency of tracks
2009 Jan 19
1
conditional weighted quintiles
Dear All, I am economist and working on poverty / income inequality. I need descriptive statitics like the ratio of education expentitures between different income quintiles where each household has a different weight. After a bit of google search I found 'Hmisc' and 'quantreg' libraries for weighted quantiles. The problem is that these packages give me only weighted quintiles;
2013 Apr 06
1
Creating quintiles on monthly basis
Hi, I am trying in R to indicate in which quintile a value of a variable is for every month of my data frame in this case based on volatility. For each month I want to know for each stock if it is in the most volatile quintile of if it is in one of the others. So far I have come up with the following function (see below). Unfortunately, the function only works in some cases and often gives the
2011 Nov 18
1
R: writing data from one matrix into another with keeping NA's
Hi, I am looking to build even quintiles for a set of data. I managed to get it done, but I would like to know if there is a more direct way to write the data from my loop output x in the bottom of the code into the "empty" matrix p1, which I filled with NA''s. The way I am doing it at the moment is, more or less adding the matrix x after p1 and then deleting in a second step
2004 Oct 20
1
Drawing multiple line plots
Hi All: Greetings, and best wishes from the festive times here at Kolkata, India -- the time of Durga Puja celebrations. I seek your advice as I try plotting lines for my data. The problem: I have created a dataframe that looks like this (name: myFrame): lowest second third fourth highest significance INAS 0.107 0.115 0.123 0.115 0.166 0.000 MMA 0.091 0.107 0.115
2010 Apr 07
0
Help with manipulation of svytotal output
I have some output from a survey shown below and would like to insert total rows for each category to then be able to use it as a data set for further analysis. Is it possible to do this in R. In this example I would like to insert a row between rows 5 and 6, 10 and 11, 15 and 16, 20 and 21 and put in the sum of the rows 1-5, 6-10, 11-15, 16-20 and to delete row 21 which has NA in it. I would
2013 Mar 26
2
GAM model with interactions between continuous variables and factors
Hi all, I am not sure how to handle interactions with categorical predictors in the GAM models. For example what is the different between these bellow two models. Tests are indicating that they are different but their predictions are essentially the same. Thanks a bunch, > gam.1 <- gam(mortality.under.2~ maternal_age_c+ I(maternal_age_c^2)+ + s(birth_year,by=wealth) + +
2010 Oct 06
1
Does R have function/package works similar to SAS's 'PROC REG'?
Hello, I am working on a variable selection problem and I wonder whether there is some function or package in R works similar to the 'PROC REG' in SAS? Thank you. Some facts about 'PROC REG': PROC REG in SAS first composes a crossproducts matrix. The matrix can be calculated from input data, reformed from an input correlation matrix, or read in from an SSCP data set. For each
2004 Mar 30
4
rank() vs SAS proc rank
SAS proc rank has ties options of high and low that would allow producing ranks of the type found in the sports pages, e.g., rank (c(1,1,2,2,2,2,3)) == 1 1 3 3 3 3 7 Could R support these ties.methods?
2007 Sep 27
1
SAS proc reg stepwise procedure in R
I try to reproduce the SAS proc reg stepwise model selection procedure in R, but the only function I found was "step" which select new variables based on AIC. The SAS procedure I use add a new variable to the model based on F statistics and a pre defined significant level. Then before any new variables are added variables in the model that not meet F statistics at the significant level
2007 Dec 13
2
use ggplot in a function to which a column name is given
Hi everyone, Hi ggplot users in particular, ggplot makes it very easy to plot things given their names when you use it interactively (and therefore can provide the names of the columns). qplot(x,foo,data=A) where A has columns (x,y,foo,bar) for example but I would like to use this from inside a function to which the name of the column is given. I cannot find an elegant way to make this
2007 May 31
3
Problem with Weighted Variance in Hmisc
The function wtd.var(x,w) in Hmisc calculates the weighted variance of x where w are the weights. It appears to me that wtd.var(x,w) = var(x) if all of the weights are equal, but this does not appear to be the case. Can someone point out to me where I am going wrong here? Thanks. Tom La Bone [[alternative HTML version deleted]]
2009 Sep 10
2
Merge data frames but prefer values in one
Hello everyone, My problem is better explained with an example: > x=data.frame(a=1:4,b=1:4,c=rnorm(4)) > x a b c 1 1 1 -0.8821089 2 2 2 -0.7082583 3 3 3 -0.5948835 4 4 4 -1.8571443 > y=data.frame(a=c(1,3),b=3,c=rnorm(2)) > y a b c 1 1 3 -0.273155973 2 3 3 0.009517862 Now I want to merge x and y by columns a and b, hence creating a data.frame with all
2009 Mar 22
3
'require' equivalent for local functions
Hello everyone, I often create some local "libraries" of functions (.R files with only functions in them) that I latter call. In scripts that call a function from such library, I would like to be able to test whether the function is already known in the namespace and, only if it is not, source the library file. I.e. what `require` does for packages, I want to do with my local
2007 Sep 20
1
ggplot and xlim/ylim
Hello everyone, I am (happily) using ggplot2 for all my plotting now and I wondered is there is an easy way to specify xlim and ylim somewhere when using the ggplot syntax, as opposed to the qplot syntax. Eg. qplot(data=mtcars,y=wt, x=qsec,xlim=c(0,30)) <-> ggplot(mtcars, aes(y=wt, x=qsec)) + geom_point() + ??? Indeed the ggplot syntax is in general more flexible and powerful and
2007 Jul 30
2
apply, lapply and data.frame in R 2.5
Hello everyone, A recent (in 2.5 I suspect) change in R is giving me trouble. I want to apply a function (tolower) to all the columns of a data.frame and get a data.frame in return. Currently, on a data.frame, both apply (for arrays) and lapply (for lists) work, but each returns its native class (resp. matrix and list): apply(mydat,2,tolower) # gives a matrix lapply(mydat,tolower) # gives
2005 Feb 09
4
subset
Dear all, I am trying to extract rows from a data.frame based on the rowSums != 0. I want to preserve rownames in the first column in the subset. Does anyone know how to extract all species that don't have rowSums equal to zero? Here it is: # dataset x <- data.frame( species=c("sp.1","sp.2","sp.3","sp.4"), site1=c(2,3,0,0), site2=c(0,0,0,0),
2008 Feb 05
2
Incomplete ouput with sink and split=TRUE
Dear List, I am trying to get R's terminal output to a file and to the terminal at the same time, so that I can walk through some tests and keep a log concurrently. The function 'sink' with the option split=TRUE seems to do just that. It works fine for most output but for objects of class htest, the terminal output is incomplete (the lines are there but empty). Here is an