similar to: find duplicates... need help!

Displaying 20 results from an estimated 100000 matches similar to: "find duplicates... need help!"

2009 Jun 18
2
Any method to speed up this problem?
Hi all, Suppose I have a vector like this: [1] "STAT1" "STAT1" "STAT1" "STAT1" "GAPDH" "GAPDH" "GAPDH" "ACTB" "ACTB" [10] "ACTB" "DDR1" "RFC2" "HSPA6" "PAX8" "GUCA1A" "UBE1L" "THRA" "PTPN21" [19]
2009 Jun 11
2
need help with strsplit function
Hi, if I have this string: "a.b.c.d" and I use this function: unlist(strsplit("a.b.c.d", "\\.")), I get this as my output: "a", "b", "c", and "d". Is there a way to just split on the first period so I obtain only two pieces like: "a" and "b.c.d"? Anyways, thanks in advance! -- View this message in context:
2009 Aug 30
1
Re moving the numbers of the X/Y axis
Hi all, Suppose I have some data that I plot using the histogram command - ie. hist(x) Is there an option that will allow me to remove the numbers that appear along the X and Y axis as I'm just interested in the overall distribution of the data and not the actual values? Anyways, any help is greatly appreciated! -- View this message in context:
2003 Jun 21
1
Need help with inbound/outbound PRI calls
I'm running a pretty successful Asterisk system and recently moved our PRI to a T100P board. The PRI was previously connected to a Cisco 2600 that was serving as a voice gateway. We are having a frequent problem with inbound and outbound calls being disconnected shortly after they are answered since moving the PRI directly to the Asterisk box. Most calls work fine, but approx 3 out 10 are
2009 Jun 12
2
Creating this vector, any suggetions?
Suppose I have the first vector: c(1, 6, 8, 9) I will like to create a second vector of size 10 composed of 0 and 1's. The second vector will be composed of four 1's and six 0's. The position of the 1's will be specificed by the first vector. So essentially, I want a second vector in the form: c(1, 0, 0, 0, 0, 1, 0, 1, 1, 0) Any help is greatly appreciated! -- View this message
2009 Jun 23
2
Vector of string
Hi all. Suppose I have a vector of strings ie: ["12384", "TYU123123", "AVC3939", "hhr1919", "TYU0029"] Is there an efficient method that would allow me to return a vector of the position of strings that start with "TYU"? So essentially, I would like to get back in return [2, 5]. Anyways, any help with be greatly appreciated! -- View
2009 Aug 29
3
Sequence generation
Hey guys, I was wondering how to create this sequence: 1, 2, 3, 1, 2, 3, 1, 2, 3, 1, 2, 3... with the '1, 2, 3' repeated over 10 times. Also, is there a simple method to generate 1, 1, 1, 2, 2, 2, 3, 3, 3? Anyways, any help with be greatly appreciated! -- View this message in context: http://www.nabble.com/Sequence-generation-tp25205593p25205593.html Sent from the R help mailing list
2011 Mar 05
2
Grouping data in ranges in table
Working with the built in R data set Orange, e.g. with(Orange, table(age, circumference)). How should I go about about grouping the ages and circumferences in the following ranges and having them display as such in a table? age range: 118 - 664 1004 - 1372 1582 circumference range: 30-58 62- 115 120-142 145-177 179-214 Thanks for any feedback and insights, as I hoping for an output that
2019 Feb 27
3
Making LLD PDB generation faster
On Wed, Feb 27, 2019 at 3:17 PM Leonardo Santagada <santagada at gmail.com> wrote: > > Also we are investigating why we have 420 million types being linked while > it appears that 95-99 % of them are not being used. De anyone know if pch > can help here? My feeling is not much as template instantiation still > generates a ton of weak symbols on the pch users, but I might be
2010 Mar 30
2
Choosing and preserving a random duplicate
Dear R-Helpers, I have a dataframe (g10df) formatted like this: GENE PVAL 1 KCTD12 4.06904e-22 2 UNC93A 9.91852e-22 3 CDKN3 1.24695e-21 4 CLEC2B 4.71759e-21 5 DAB2 1.12062e-20 The rows are ranked in ascending order by PVAL, and I need to end up with the same relative order. There are duplicate entries for genes in the first column with corresponding
2009 Mar 30
2
which rows are duplicates?
I would like to know which rows are duplicates of each other, not simply that a row is duplicate of another row. In the following example rows 1 and 3 are duplicates. > x <- c(1,3,1) > y <- c(2,4,2) > z <- c(3,4,3) > data <- data.frame(x,y,z) x y z 1 1 2 3 2 3 4 4 3 1 2 3 I can't figure out how to get R to tell me that observation 1 and 3 are the same.
2009 Mar 30
2
which rows are duplicates?
I would like to know which rows are duplicates of each other, not simply that a row is duplicate of another row. In the following example rows 1 and 3 are duplicates. > x <- c(1,3,1) > y <- c(2,4,2) > z <- c(3,4,3) > data <- data.frame(x,y,z) x y z 1 1 2 3 2 3 4 4 3 1 2 3 I can't figure out how to get R to tell me that observation 1 and 3 are the same.
2004 Jun 25
4
Unique.data.frame...still getting duplicates
Hi there I have a data frame with about 65,000 rows and 8 variables. I am trying to get rid of the double entries of a factor variable "ID" so I can get a unique observation for each ID I tried: >dupl_unique.data.frame(data[ID,]) #I obtain a data frame with 21,547 >observations..so far so good, but then when I check for duplicates >d_duplicated(dupl2$ID)
2009 May 15
3
need help
Dear all please ,I need to write a function in R to estimate the parameters of negative binomial distribution and then calculate the loglikelihood amount for given data.Is there any one to help me. thank you very much for any help Best regards [[alternative HTML version deleted]]
2011 Aug 31
3
how to create data.frames from vectors with duplicates
Hi R users, suppose I have two vectors, > x=c(1,2,3,4,5) > y=c('a','b','c','a','c') How can I get a data.frame like this? > xy count a 5 b 2 c 8 I know a few ways to fulfill the task. However, I have a huge number of this kind calculations, so I'd like an efficient solution. Thanks -- Best, Zhenjiang
2010 Dec 29
1
Counting number of datasets and appending them
Hi there, I have a question on how to read a bunch of dataset, assign each of the dataset to a matrix in the memory, and append them. Suppose I have 20 dataset saved to different .rda files named gradeFileData1, gradeFileData2,...., gradeFileData20. And I would like to read them each into a dataset in the memory, then combine them. I wrote something like: e1<-new.env(parent=.GlobalEnv)
2012 Jul 23
1
duplicated() variation that goes both ways to capture all duplicates
Dear all The trouble with the current duplicated() function in is that it can report duplicates while searching fromFirst _or_ fromLast, but not both ways. Often users will want to identify and extract all the copies of the item that has duplicates, not only the duplicates themselves. To take the example from the man page: > data(iris) > iris[duplicated(iris), ] ##duplicates while
2006 Jun 19
2
finding duplicates in an array
an application i am working on sends emails to an array of users. it is possible for the user''s email to actually be in the array more than once. i don''t have a real strong grasp of the ruby syntax yet, but is there an easy way to go through and either remove all of the duplicates or at least only send an email to that user once? -- Posted via http://www.ruby-forum.com/.
2010 Mar 14
3
Removing Duplicates
Hi all, I am starting fresh with a local repository of mails, which almost certainly have duplicates in them. I am going to use maildirs, and ensure all mails are input with CRLFs. The question is: does anybody know how I can find and remove duplicates, either while injecting mail with IMAP, or afterward? I can use tools to find duplicate Message-IDs, but don't know of a way to remove
2008 Dec 15
2
Duplicates among columns of a data frame
Dear list, I have a data frame of survey respondents, a little like this: set.seed(20081215) n <- 100 dat <- data.frame(id=1:100, addr1=sample(LETTERS, n, replace=TRUE), addr2=sample(LETTERS, n, replace=TRUE), addr3=sample(LETTERS, n, replace=TRUE)) head(dat) id addr1 addr2 addr3 1 1 R H Q 2 2 H C K 3 3