thr3ads.net - similar to: "Wishlist: merge and subset to keep attributes (PR#8658)"

Displaying 20 results from an estimated 2000 matches similar to: "Wishlist: merge and subset to keep attributes (PR#8658)"

Wishlist: xtabs and table to optionally use attribute value labels (PR#8659)

2006 Mar 05

Wishlist: xtabs and table to optionally use attribute value labels (PR#8659)

Full_Name: Ulrike Gr?mping Version: 2.2.1 OS: Windows Submission from: (NULL) (84.190.139.94) A wish somehow related to my wish 8658: Package foreign allows to import categorical data from SPSS (and possibly other software) using the original codes, which are often useful for data manipulation, since one can use already available lists of codes from others who don't use R etc. The original

Matrix problem to extract animal associations

2012 Feb 26

Matrix problem to extract animal associations

Dear List, I have been trying to extract associations from a matrix whereby individual locations are within a certain distance threshold from one another. I have been able to extract those individuals where there is 'no interaction' (i.e. where these individuals are not within a specified distance threshold from another individual) and give these individuals a unique Group ID containing

How to check if a value of a variable is in a list

2011 Apr 20

How to check if a value of a variable is in a list

Hi all, I am working with some social network analysis in R and ran into a problem I just cannot solve. Each observation in my data consists of a respondent, some characteristics and up to five friends. The problem is that all of these five friends might no show up later as a respondent (observation). Therefore I might not have characteristics on all the friends listed in the data and I want to

colnames for data.frame could be greatly improved

2016 Dec 27

colnames for data.frame could be greatly improved

Hi there, Any update on this? Should I create bugzilla ticket and submit patch? Regards Jan Gorecki On 20 December 2016 at 01:27, Jan Gorecki <J.Gorecki at wit.edu.pl> wrote: > Hello, > > colnames seems to be not optimized well for data.frame. It escapes > processing for data.frame in > > if (is.data.frame(x) && do.NULL) > return(names(x)) > > but

colnames for data.frame could be greatly improved

2016 Dec 20

colnames for data.frame could be greatly improved

Hello, colnames seems to be not optimized well for data.frame. It escapes processing for data.frame in if (is.data.frame(x) && do.NULL) return(names(x)) but only when do.NULL true. This makes huge difference when do.NULL false. Minimal edit to `colnames`: if (is.data.frame(x)) { nm <- names(x) if (do.NULL || !is.null(nm)) return(nm) else

Recoding variables based on reference values in data frame

2013 Jul 02

Recoding variables based on reference values in data frame

I'm new to R (previously used SAS primarily) and I have a genetics data frame consisting of genotypes for each of 300+ subjects (ID1, ID2, ID3, ...) at 3000+ genetic locations (SNP1, SNP2, SNP3...). A small subset of the data is shown below: SNP_ID SNP1 SNP2 SNP3 SNP4 Maj_Allele C G C A Min_Allele T A T G ID1 CC GG CT AA ID2 CC GG CC AA ID3 CC GG nc AA

Doubt in simple merge

2014 Jan 16

Doubt in simple merge

Dear R community I have a two data set called "Elder" and "Younger". This is my code for simple merge. Elder <- data.frame( ID=c("ID1","ID2","ID3"), age=c(38,35,31)) Younger <- data.frame( ID=c("ID4","ID5","ID3"), age=c(29,21,31)) mer <- merge(Elder,Younger,by="ID", all=T) Output I am

List Conversion

2006 Feb 09

List Conversion

Hello, I have a list (mode and class are list) in R that is many elements long and of the form: >length(list) [1] 5778 >list[1:4] $ID1 [1] "num1" $ID2 [1] "num2" "num3" $ID3 [1] "num4" $ID4 [1] NA I'd like to convert the $ID2 value to be in one element rather than in two.?? It shows up as c(\"num2\", \"num3\") if I try to use

average columns of data frame corresponding to replicates

2010 Sep 07

average columns of data frame corresponding to replicates

Hi Group, I have a data frame below. Within this data frame there are samples (columns) that are measured more than once. Samples are indicated by "idx". So "id1" is present in columns 1, 3, and 5. Not every id is repeated. I would like to create a new data frame so that the repeated ids are averaged. For example, in the new data frame, columns 1, 3, and 5 of the original

multiple bar plot annotation text labelling

2016 Apr 04

multiple bar plot annotation text labelling

Use only plain text emails. Don't attach file types that will be stripped. See the footer at the bottom of your email for more information. Do give us the data using dput(): > dput(testbarplot) structure(list(V1 = c(1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L), V2 = c(3L, 8L, 1L, 9L, 5L, 4L, 2L, 0L), V3 = structure(c(1L, 2L, 3L, 4L, 3L, 5L, 6L, 7L), .Label = c("A", "B",

lazy evaluation (was RE: Number of replications of a term)

2006 Jan 25

lazy evaluation (was RE: Number of replications of a term)

From: Thomas Lumley > > On Wed, 25 Jan 2006, Ray Brownrigg wrote: > > > There's an even faster one, which nobody seems to have > mentioned yet: > > > > rep(l <- rle(ids)$lengths, l) > > I considered this but it wasn't clear to me from the initial > post that > each ID occupied a contiguous section of the vector. > > Also, lazy

multiple bar plot annotation text labelling

2016 Apr 04

multiple bar plot annotation text labelling

Readers, The attempt is to create a bar plot with text labels adjacent to each datum value. Data file: 1,3,A 1,8,B 1,1,C 1,9,D 2,5,C 2,4,E 2,2,F 2,0,G testbarplot<-read.csv('data1.csv', header=FALSE) barplot(axes=FALSE, ann=FALSE, horiz=TRUE, testbarplot[,2], ylab= 'group', xlab= '(x values)', space=c(1,0,0,0, 1,0,0,0)) text(testbarplot[,2], testbarplot[,1],

how to replace <NA> values

2014 Jan 19

how to replace <NA> values

Dear R community I have a large data set contain some empty cells. Because of that, may be I am wrong, <NA> values are produced. Now I want replace both empty and <NA> values with zero. Elder1 <- data.frame( ID=c("ID1","ID2","ID3","ID6","ID8"), age=c(38,35,"",NA,NA)) Output I am expecting ID age ID1 38 ID2 35

Query about data manipulation

2007 Mar 01

Query about data manipulation

Hi Thanks much for the prompt response to my earlier enquiry on packages for regression analyses. Along the same topic(?), I have another question about which I could use some input. I am retreiving data from a MySQL database using RODBC. The table has many BLOB columns and each BLOB column has data in the format "id1 \t id2 \t measure \n id3 \t id4 \t measure...." (i.e. multiple rows

Number of replications of a term

2006 Jan 24

Number of replications of a term

Hello, Is there a simple and fast function that returns a vector of the number of replications for each object of a vector ? For example : I have a vector of IDs : ids <- c( "ID1", "ID2", "ID2", "ID3", "ID3","ID3", "ID5") I want the function returns the following vector where each term is the number of replicates for the

Calculate Closest 5 Cases?

2004 Feb 13

Calculate Closest 5 Cases?

I've only begun investigating R as a substitute for SPSS. I have a need to identify for each CASE the closest (or most similar) 5 other CASES (not including itself as it is automatically the closest). I have a fairly large matrix (50000 cases by 50 vars). In SPSS, I can use Correlate > Distances to generate a matrix of similarity, but only on a small sample. The entire matrix can not

Fastest way to repeatedly subset a data frame?

2007 Apr 20

Fastest way to repeatedly subset a data frame?

Hi - I have a data frame with a large number of observations (62,000 rows, but only 2 columns - a character ID and a result list). Sample: > my.df <- data.frame(id=c("ID1", "ID2", "ID3"), result=1:3) > my.df id result 1 ID1 1 2 ID2 2 3 ID3 3 I have a list of ID vectors. This list will have anywhere from 100 to 1000 members, and

data.frame manipulation: Unbinding strings in a row

2008 Jan 10

data.frame manipulation: Unbinding strings in a row

Hi all, I have a data.frame I received with data that look like this (comma separated strings in last row): ID Shop Items ID1 A1 item1, item2, item3 ID2 A2 item4, item5 ID3 A1 item1, item3, item4 But I would like to unbind the strings in col(2) items so that it will look like this: ID Shop Items ID1 A1 item1 ID1 A1 item2 ID1 A1 item3 ID2 A2 item4 ID2 A2 item5 ID3 A1 item1 ID3 A1 item3 ID3 A1

Problem with ddply in the plyr-package: surprising output of a date-column

2011 Apr 25

Problem with ddply in the plyr-package: surprising output of a date-column

Hi Together, I have a problem with the plyr package - more precisely with the ddply function - and would be very grateful for any help. I hope the example here is precise enough for someone to identify the problem. Basically, in this step I want to identify observations that are identical in terms of certain identifiers (ID1, ID2, ID3) and just want to save those observations (in this step,

dataframe without repetition

2005 Nov 09

dataframe without repetition

Hello, with a data.frame like this : > toto <- data.frame(id=c("id1","id1","id2","id3","id3","id3"),dpt=c("13","13","34","30","30","30")) > toto id dpt 1 id1 13 2 id1 13 3 id2 34 4 id3 30 5 id3 30 6 id3 30 what is the most efficient ways to obtain : id

similar to: Wishlist: merge and subset to keep attributes (PR#8658)