similar to: deleting certain observations in a data frame

Displaying 20 results from an estimated 4000 matches similar to: "deleting certain observations in a data frame"

2008 Mar 12
3
type of object of a variable in a data frame
Hello! I have used read.csv to read in a data frame, and there are a few variables in it, however, when I tried is.list(data$V1) >FALSE In fact, I have tried, they are not vectors either. I'm wondering: 1. What objects are these "lists" of data? 2. How could I find out about the type/inheritence of an object in general? 3. The reason I want it to be a list or vector, is that I
2008 Mar 06
2
extracting a percentage of data by random
Hello Gurus: If I have a dataframe with one of the variables called "age" for example, and I want to extract a random 10% of the observations from each "age" group of the entire data frame. Do I have to double loop to split the data and then loop again to assign random numbers? Or is there a better way to do this? Thanks! Karen
2009 Jan 18
2
Deleting columns based on the number of non-blank observations
Hello, I have a dataset (named "x") with many (966) columns. What I would like to do is delete any columns that do not have at least 375 non-blank observations (i.e., the cells have some value in them besides NA). How can I do this? I have come up with the following code to _count_ the non-blank observations in each column, but how would I adapt this code to _delete_ columns from the
2019 Oct 10
1
Broken link on doc.dovecot.org
Hi again, I wanted to check in and see if you got my note about the broken link on your site. Thanks! Karen On Monday, October 7, 2019 at 5:05 PM, Karen Woodman <karen at getmailbird.co> wrote: > Hi there, > > I noticed that you have a broken link to a website called Qmail.org. That > site was first published 23 years ago (back in 1996!) but unfortunately, it > is no
2008 Apr 09
1
vectorized way to combine levels of a factor
Hi Gurus: If I have a large dataset of the form of: > x <- data.frame(V1 = runif(10), V2 = sample(c('A','B','C'),10,T)) > x V1 V21 0.2691580 A2 0.8711267 B3 0.2674728 C4 0.3278876 A5 0.1809152 A6 0.2499651 C7 0.9155174 A8 0.8004974 B9 0.7885516 A10 0.9301630 A And I want a V3 that =V2 if V2=A, and =D if V2=B or C. In other words I
2010 Aug 17
4
replacing values in a vector
Dear helpRs Does anyone have an elegant way of doing the following: For a given numeric vector, e.g. vec <- c(3,2,6,4,7) Create a series of vectors where all but 1 of the values are replaced by 0's, e.g. vec.a <- c(3,0,0,0,0) vec.b <- c(0,2,0,0,0) vec.c <- c(0,0,6,0,0) vec.d <- c(0,0,0,4,0) vec.e <- c(0,0,0,0,7) I have looked at `replace', but can't think of a
2008 Feb 26
3
R package to perform Horn's parallel analysis
I am seeking information on whether anyone has written code to perform Horn's parallel analysis (a procedure that informs the selection of the proper number of components in PCA) in R. Thank you in advance for any help you can provide. Please respond off-list at the email address below. Karen Douglas ******************************************************************* Karen Douglas,
2002 Aug 15
1
Winbind in Samba 2.2.5 not automatically mapping the NT users with corresponding UNIX accounts
Have you tried in smb.conf username map = /usr/local/samba/lib/domainuser.map winbind use default domain = no For every user you have to create an entry in the /usr/local/samba/lib/domainuser.map, that looks like <unixusername> = "<domainname>+<username>" I?m using a script that generates the domainuser.map automaticly, because our users are using NIS
2005 Jun 28
1
enhanced multidimensional scaling?
Dear R list Would anyone be able to tell me whether it is possible to do "enhanced multidimensional scaling" (enhanced MDS) in R? In other words, something that goes beyond "cmdscale" by iteratively improving the fit between observed dissimilarities and inter-object distances, using the KYST algorithm (Kruskal, 1964). I have found several implementations of non-metric MDS
2005 Jun 28
2
enhanced MDS
Hi again Sorry, in looking again at sammon and isoMDS I see that they seem to do exactly what I want, except that they are non-metric, which means, as I understand it, that they relate the rank orders of the variables rather than the actual distances. Could I use these non-metric MDS packages even if my distances are metric? Thanks Karen -- Karen Kotschy Centre for Water in the Environment
2002 Apr 11
1
rsync : old file dates generating error during nfs rsync session: Value Too large for defined data type
I was troubleshooting a problem we were having with some files not rsyncing properly over an nfs mount (the destination device is a snapserver (NAS) that did not have native ability to receive streaming rsync info, that's why we were doing this rsync over an NFS connection to the snapserver). Anyway, at first I thought this was just one of the quirks of the snapserver (it isn't
2005 Feb 19
16
Snom phone hint exten question
Hi, I am sorry to be asking this but the wiki is down and has been for a couple of days and I need to get this working before Monday to get my live system setup. Trying to get the Snom 190's and soon to arrive 3com 3102's to use the function keys and for the life of me I can't work it out from the conversations on the archive what I am going exactly wrong here? The snom 190 with
2007 Jan 21
1
identify selected substances across individuals
An embedded and charset-unspecified text was scrubbed... Name: inte tillg?nglig Url: https://stat.ethz.ch/pipermail/r-help/attachments/20070121/436ed377/attachment.pl
2006 Jan 05
4
Q: R 2.2.1: Memory Management Issues?
Dear Developers: I have a question about memory management in R 2.2.1 and am wondering if you would be kind enough to help me understand what is going on. (It has been a few years since I have done software development on Windows, so I apologize in advance if these are easy questions.) ------------- MY SYSTEM ------------- I am currently using R (version 2.2.1) on a PC running Windows 2000
2005 Oct 21
1
finite mixture model (2-component gaussian): plotting component gaussian components?
Dear Knowledgeable R Community Members, Please excuse my ignorance, I apologize in advance if this is an easy question, but I am a bit stumped and could use a little guidance. I have a finite mixture modeling problem -- for example, a 2-component gaussian mixture -- where the components have a large overlap, and I am trying to use the "mclust" package to solve this problem. I need
2007 Sep 05
1
question: randomization t-test function already defined in R?
Dear R Users, I am hoping you can help me. I have received code from a colleague who uses Matlab. I need to translate it into R. I am wondering if there is a randomization t-test (from non-parametric statistics) function already defined in R. (In Matlab the function is randtest.m.) ************************************************************************ ********** QUESTION: Is anyone
2003 Apr 21
2
randomForest crash?
I am attempting to use randomForests to look for interesting genes in microarray data with 216genes, 2 classes and 52 samples. My data.frame is 52x217 with the last column, V217 being the class(1 or 2). When I try lung.rf <- randomForest(V217 ~ ., data=tlSA216cda, importance= TRUE, proximity = TRUE) the GUI crashes. I am running R-1.6.2 under windo$e98, and most
2013 Jun 25
1
[LLVMdev] [icFuzz] Help needed with analyzing randomly generated tests that fail on clang 3.4 trunk
Hi Karen, Thanks much for your comment and for sharing of your experience. icFuzz has a core that is "really" random, but does not cover the entire C space. The tool was designed from scratch to be extensible, and comes with a couple of extensions that target some of compiler optimizations optimizing compilers typically do: CSE, loop interchange, vectorization, etc. But even in the case
2013 Jun 25
0
[LLVMdev] [icFuzz] Help needed with analyzing randomly generated tests that fail on clang 3.4 trunk
On Mon, Jun 24, 2013 at 11:20:09PM +0000, Haghighat, Mohammad R wrote: > The tests by design are syntactically correct, semantically correct, and have deterministic output. > > -moh Hi, I wanted to believe you had a randomized code generator that could cover the entire valid input space. (smiles ;) But the test cited was not generated by a randomized code generator. It is way too
2007 May 22
2
error message
Hi, I am trying to install the package exonmap and RMySQL however I keep getting the following error: "Error in library(pkg, character.only = TRUE) : 'RMySQL' is not a valid package -- installed < 2.0.0?" I have R version 2.4.1 so I know its not a version issue. I deleted and reinstalled the folders again and the same thing happened. Has anyone any ideas? Thanks,