Displaying 20 results from an estimated 4000 matches similar to: "deleting certain observations in a data frame"
2008 Mar 12
3
type of object of a variable in a data frame
Hello!
I have used read.csv to read in a data frame, and there are a few variables in it, however, when I tried
is.list(data$V1)
>FALSE
In fact, I have tried, they are not vectors either.
I'm wondering:
1. What objects are these "lists" of data?
2. How could I find out about the type/inheritence of an object in general?
3. The reason I want it to be a list or vector, is that I
2008 Mar 06
2
extracting a percentage of data by random
Hello Gurus:
If I have a dataframe with one of the variables called "age" for example, and I want to extract a random 10% of the observations from each "age" group of the entire data frame. Do I have to double loop to split the data and then loop again to assign random numbers? Or is there a better way to do this?
Thanks!
Karen
2009 Jan 18
2
Deleting columns based on the number of non-blank observations
Hello,
I have a dataset (named "x") with many (966) columns. What I would like to do is delete any columns that do not have at least 375 non-blank observations (i.e., the cells have some value in them besides NA).
How can I do this? I have come up with the following code to _count_ the non-blank observations in each column, but how would I adapt this code to _delete_ columns from the
2019 Oct 10
1
Broken link on doc.dovecot.org
Hi again,
I wanted to check in and see if you got my note about the broken link on
your site.
Thanks!
Karen
On Monday, October 7, 2019 at 5:05 PM, Karen Woodman <karen at getmailbird.co>
wrote:
> Hi there,
>
> I noticed that you have a broken link to a website called Qmail.org. That
> site was first published 23 years ago (back in 1996!) but unfortunately, it
> is no
2008 Apr 09
1
vectorized way to combine levels of a factor
Hi Gurus:
If I have a large dataset of the form of:
> x <- data.frame(V1 = runif(10), V2 = sample(c('A','B','C'),10,T)) > x V1 V21 0.2691580 A2 0.8711267 B3 0.2674728 C4 0.3278876 A5 0.1809152 A6 0.2499651 C7 0.9155174 A8 0.8004974 B9 0.7885516 A10 0.9301630 A
And I want a V3 that =V2 if V2=A, and =D if V2=B or C. In other words I
2010 Aug 17
4
replacing values in a vector
Dear helpRs
Does anyone have an elegant way of doing the following:
For a given numeric vector, e.g. vec <- c(3,2,6,4,7)
Create a series of vectors where all but 1 of the values are replaced by
0's, e.g.
vec.a <- c(3,0,0,0,0)
vec.b <- c(0,2,0,0,0)
vec.c <- c(0,0,6,0,0)
vec.d <- c(0,0,0,4,0)
vec.e <- c(0,0,0,0,7)
I have looked at `replace', but can't think of a
2008 Feb 26
3
R package to perform Horn's parallel analysis
I am seeking information on whether anyone has written code to perform
Horn's parallel analysis (a procedure that informs the selection of the
proper number of components in PCA) in R.
Thank you in advance for any help you can provide. Please respond
off-list at the email address below.
Karen Douglas
*******************************************************************
Karen Douglas,
2002 Aug 15
1
Winbind in Samba 2.2.5 not automatically mapping the NT users with corresponding UNIX accounts
Have you tried in smb.conf
username map = /usr/local/samba/lib/domainuser.map
winbind use default domain = no
For every user you have to create an entry in the
/usr/local/samba/lib/domainuser.map, that looks like
<unixusername> = "<domainname>+<username>"
I?m using a script that generates the domainuser.map automaticly, because
our users are using NIS
2005 Jun 28
1
enhanced multidimensional scaling?
Dear R list
Would anyone be able to tell me whether it is possible to do "enhanced
multidimensional scaling" (enhanced MDS) in R? In other words, something that
goes beyond "cmdscale" by iteratively improving the fit between observed
dissimilarities and inter-object distances, using the KYST algorithm
(Kruskal, 1964).
I have found several implementations of non-metric MDS
2005 Jun 28
2
enhanced MDS
Hi again
Sorry, in looking again at sammon and isoMDS I see that they seem to do
exactly what I want, except that they are non-metric, which means, as I
understand it, that they relate the rank orders of the variables rather than
the actual distances.
Could I use these non-metric MDS packages even if my distances are metric?
Thanks
Karen
--
Karen Kotschy
Centre for Water in the Environment
2002 Apr 11
1
rsync : old file dates generating error during nfs rsync session: Value Too large for defined data type
I was troubleshooting a problem we were having with some files not rsyncing
properly over an nfs mount (the destination device is a snapserver (NAS)
that did not have native ability to receive streaming rsync info, that's why
we were doing this rsync over an NFS connection to the snapserver).
Anyway, at first I thought this was just one of the quirks of the snapserver
(it isn't
2005 Feb 19
16
Snom phone hint exten question
Hi,
I am sorry to be asking this but the wiki is down and has been for a
couple of days and I need to get this working before Monday to get my
live system setup.
Trying to get the Snom 190's and soon to arrive 3com 3102's to use the
function keys and for the life of me I can't work it out from the
conversations on the archive what I am going exactly wrong here?
The snom 190 with
2007 Jan 21
1
identify selected substances across individuals
An embedded and charset-unspecified text was scrubbed...
Name: inte tillg?nglig
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20070121/436ed377/attachment.pl
2006 Jan 05
4
Q: R 2.2.1: Memory Management Issues?
Dear Developers:
I have a question about memory management in R 2.2.1 and am wondering if you would be kind enough to help me understand what is going on.
(It has been a few years since I have done software development on Windows, so I apologize in advance if these are easy questions.)
-------------
MY SYSTEM
-------------
I am currently using R (version 2.2.1) on a PC running Windows 2000
2005 Oct 21
1
finite mixture model (2-component gaussian): plotting component gaussian components?
Dear Knowledgeable R Community Members,
Please excuse my ignorance, I apologize in advance if this is an easy question, but I am a bit stumped and could use a little guidance.
I have a finite mixture modeling problem -- for example, a 2-component gaussian mixture -- where the components have a large overlap, and
I am trying to use the "mclust" package to solve this problem.
I need
2007 Sep 05
1
question: randomization t-test function already defined in R?
Dear R Users,
I am hoping you can help me.
I have received code from a colleague who uses Matlab. I need to
translate it into R.
I am wondering if there is a randomization t-test (from non-parametric
statistics) function already defined in R.
(In Matlab the function is randtest.m.)
************************************************************************
**********
QUESTION: Is anyone
2003 Apr 21
2
randomForest crash?
I am attempting to use randomForests to look for interesting genes in
microarray data with 216genes, 2 classes and 52 samples. My data.frame
is 52x217 with the last column, V217 being the class(1 or 2).
When I try
lung.rf <- randomForest(V217 ~ ., data=tlSA216cda, importance=
TRUE, proximity = TRUE)
the GUI crashes.
I am running R-1.6.2 under windo$e98, and most
2013 Jun 25
1
[LLVMdev] [icFuzz] Help needed with analyzing randomly generated tests that fail on clang 3.4 trunk
Hi Karen,
Thanks much for your comment and for sharing of your experience. icFuzz has a core that is "really" random, but does not cover the entire C space. The tool was designed from scratch to be extensible, and comes with a couple of extensions that target some of compiler optimizations optimizing compilers typically do: CSE, loop interchange, vectorization, etc. But even in the case
2013 Jun 25
0
[LLVMdev] [icFuzz] Help needed with analyzing randomly generated tests that fail on clang 3.4 trunk
On Mon, Jun 24, 2013 at 11:20:09PM +0000, Haghighat, Mohammad R wrote:
> The tests by design are syntactically correct, semantically correct, and have deterministic output.
>
> -moh
Hi,
I wanted to believe you had a randomized code generator that could cover the entire
valid input space. (smiles ;) But the test cited was not generated by a randomized
code generator. It is way too
2007 May 22
2
error message
Hi,
I am trying to install the package exonmap and RMySQL however I keep
getting the following error:
"Error in library(pkg, character.only = TRUE) :
'RMySQL' is not a valid package -- installed < 2.0.0?"
I have R version 2.4.1 so I know its not a version issue. I deleted and
reinstalled the folders again and the same thing happened. Has anyone
any ideas?
Thanks,