Displaying 20 results from an estimated 1000 matches similar to: "Re: Re: Find Closest 5 Cases?"
2004 Apr 04
1
How to improve this code?
Hi all,
I've got some functioning code that I've literally taken hours to
write. My 'R' coding is getting better...it used to take days :)
I know I've done a poor job of optimizing the code. In addition, I'm
missing an important step and don't know where to put it.
So, three questions:
1) I'd like the resulting output to be sorted on distance (ascending)
and
2004 Feb 13
3
Calculate Closest 5 Cases?
I've only begun investigating R as a substitute for SPSS.
I have a need to identify for each CASE the closest (or most similar) 5
other CASES (not including itself as it is automatically the closest). I
have a fairly large matrix (50000 cases by 50 vars). In SPSS, I can use Correlate > Distances to generate a matrix of similarity, but only on a small sample. The entire matrix can not
2004 Mar 31
4
Removing leading and trailing spaces (string manipulation)
Hi all,
I'm running the following code to generate 40 different jpegs based on the resulting data. I'd like the file names to be 'Cluster1.jpeg', however the code write filenames like 'Cluster 1 .jpeg'.
How can I get rid of the unwanted spaces? I've looked at ?format and it doesn't seem to work - at least in this context.
###################
ClusCount <- 40
2011 Dec 02
2
Imputing data
So I have a very big matrix of about 900 by 400 and there are a couple of NA
in the list. I have used the following functions to impute the missing data
data(pc)
pc.na<-pc
pc.roughfix <- na.roughfix(pc.na)
pc.narf <- randomForest(pc.na, na.action=na.roughfix)
yet it does not replace the NA in the list. Presently I want to replace the
NA with maybe the mean of the rows or columns or
2012 Aug 11
1
Imputing data below detection limit
Hello,
I'm trying to impute data below detection limit (with multiple detection
limits)
so i need just a method or a code for imputation and then extract the
complete dataset to do the analyses.
Is there any package which could do that simply as i'm a beginner in R
Thank you
--
View this message in context:
2004 Aug 31
2
Sparse Matrices in R
I have data in i,j,r format,
where r is the value in location A[i,j] for some imaginary matrix A.
I need to build this matrix A, but given the sizes of i and j, I believe that using a sparse format would be most adequate.
Hopefully this will allow me to perform some basic matrix manipulation such as multiplication, addition, rowsums, transpositions, subsetting etc etc.
Is there any way
2004 Sep 01
3
Imputing missing values
Dear all,
Apologies for this beginner's question. I have a
variable Price, which is associated with factors
Season and Crop, each of which have several levels.
The Price variable contains missing values (NA), which
I want to substitute by the mean of the remaining
(non-NA) Price values of the same Season-Crop
combination of levels.
Price Crop Season
10 Rice Summer
12
2004 Feb 24
3
Calculate Distance and Aggregate Data?
Hi all,
I've been struggling learning R and need to turn to the list again.
I've got a dataset (comma-delimited file) with the following fields: recid, latitude, longitude, population, dwelling and age. For each observation, I'd like to calculate the total number of people and dwellings and average age within 2 k.m. Distance could be Euclidean, however, a proper distance
2010 Jul 14
1
Changing model parameters in the mi package
I am trying to use the mi package to impute data, but am running into
problems with the functions it calls.
For instance, I am trying to impute a categorical variable called
"min.func." The mi() function calls the mi.categorical() function to
deal with this variable, which in turn calls the nnet.default()
function, and passes it a fixed parameter MaxNWts=1500. However, as
2008 Dec 22
1
imputing the numerical columns of a dataframe, returning the rest unchanged
Hi R-experts,
how can I apply a function to each numeric column of a data frame and return
the whole data frame with changes in numeric columns only?
In my case I want to do a median imputation of the numeric columns and
retain the other columns. My dataframe (DF) contains factors, characters and
numerics.
I tried the following but that does not work:
foo <- function(x){
2010 Jul 06
1
Error message using mi() in mi package
Hello!
I get the following message when I run the mi() function from the mi
package.
Error while imputing variable: c3 , model: mi.polr
Error in eval(expr, envir, enclos) : could not find function
"c14ordered"
Here's the situation:
I am running R v. 2.9.2 on Mac OSX v. 10.5.8. I am trying to impute
missing data in a data set that I've trimmed down to 302 variables.
2004 Mar 15
2
imputation of sub-threshold values
Is there a good way in R to impute values which exist,
but are less than the detection level for an assay?
Thanks,
Jonathan Williams
OPTIMA
Radcliffe Infirmary
Woodstock Road
OXFORD OX2 6HE
Tel +1865 (2)24356
2012 Apr 03
1
Imputing missing values using "LSmeans" (i.e., population marginal means) - advice in R?
Hi folks,
I have a dataset that consists of counts over a ~30 year period at multiple (>200) sites. Only one count is conducted at each site in each year; however, not all sites are surveyed in all years. I need to impute the missing values because I need an estimate of the total population size (i.e., sum of counts across all sites) in each year as input to another model.
>
2005 Jan 11
1
transcan() from Hmisc package for imputing data
Hello:
I have been trying to impute missing values of a data
frame which has both numerical and categorical values
using the function transcan() with little luck.
Would you be able to give me a simple example where a
data frame is fed to transcan and it spits out a new
data frame with the NA values filled up?
Or is there any other function that i could use?
Thank you
avneet
=====
I believe in
2007 Jun 22
1
Imputing missing values in time series
Folks,
This must be a rather common problem with real life time series data
but I don't see anything in the archive about how to deal with it. I
have a time series of natural gas prices by flow date. Since gas is not
traded on weekends and holidays, I have a lot of missing values,
FDate Price
11/1/2006 6.28
11/2/2006 6.58
11/3/2006 6.586
11/4/2006 6.716
11/5/2006 NA
11/6/2006 NA
11/7/2006
2008 Mar 05
1
rrp.impute: for what sizes does it work?
Hi,
I have a survey dataset of about 20000 observations
where for 2 factor variables I have about 200 missing
values each. I want to impute these using 10 possibly
explanatory variables which are a mixture of integers
and factors.
Since I was quite intrigued by the concept of rrp I
wanted to use it but it takes ages and terminates with
an error. First time it stopped complaining about too
little
2010 Nov 01
1
Error message in fit.mult.impute (Hmisc package)
Hello,
I would like to use the aregImpute and fit.mult.impute to impute missing
values for my dataset and then conduct logistic regression analyses on the
data, taking into account that we imputed values. I have no problems
imputing the values using aregImpute, but I am getting an error at the
fit.mult.impute stage.
Here is some sample code (I actually have more observations and variables to
2011 Jun 23
2
Rms package - problems with fit.mult.impute
Hi!
Does anyone know how to do the test for goodness of fit of a logistic model (in rms package) after running fit.mult.impute?
I am using the rms and Hmisc packages to do a multiple imputation followed by a logistic regression model using lrm.
Everything works fine until I try to run the test for goodness of fit: residuals(type=c("gof"))
One needs to specify y=T and x=T in the fit. But
2008 Oct 29
1
Help with impute.knn
ear all,
This is my first time using this listserv and I am seeking help from the
expert. OK, here is my question, I am trying to use impute.knn function
in impute library and when I tested the sample code, I got the error as
followingt:
Here is the sample code:
library(impute)
data(khanmiss)
khan.expr <- khanmiss[-1, -(1:2)]
## ## First example
## if(exists(".Random.seed"))
2011 Jun 08
1
install the “impute” package in unix
Hi,
I am trying to install the “impute” package in unix. but I get the
following error message. I followed the following steps. Do you know what is
causing this and how I can solve this problem?
source("http://www.bioconductor.org/biocLite.R")
biocLite("impute")
Using R version 2.11.1, biocinstall version 2.6.10.
Installing Bioconductor version 2.6 packages:
[1]