thr3ads.net - similar to: "find duplicates... need help!"

Displaying 20 results from an estimated 100000 matches similar to: "find duplicates... need help!"

2009 Jun 18

Any method to speed up this problem?

Hi all, Suppose I have a vector like this: [1] "STAT1" "STAT1" "STAT1" "STAT1" "GAPDH" "GAPDH" "GAPDH" "ACTB" "ACTB" [10] "ACTB" "DDR1" "RFC2" "HSPA6" "PAX8" "GUCA1A" "UBE1L" "THRA" "PTPN21" [19]

need help with strsplit function

2009 Jun 11

need help with strsplit function

Hi, if I have this string: "a.b.c.d" and I use this function: unlist(strsplit("a.b.c.d", "\\.")), I get this as my output: "a", "b", "c", and "d". Is there a way to just split on the first period so I obtain only two pieces like: "a" and "b.c.d"? Anyways, thanks in advance! -- View this message in context:

Re moving the numbers of the X/Y axis

2009 Aug 30

Re moving the numbers of the X/Y axis

Hi all, Suppose I have some data that I plot using the histogram command - ie. hist(x) Is there an option that will allow me to remove the numbers that appear along the X and Y axis as I'm just interested in the overall distribution of the data and not the actual values? Anyways, any help is greatly appreciated! -- View this message in context:

Need help with inbound/outbound PRI calls

2003 Jun 21

Need help with inbound/outbound PRI calls

I'm running a pretty successful Asterisk system and recently moved our PRI to a T100P board. The PRI was previously connected to a Cisco 2600 that was serving as a voice gateway. We are having a frequent problem with inbound and outbound calls being disconnected shortly after they are answered since moving the PRI directly to the Asterisk box. Most calls work fine, but approx 3 out 10 are

Creating this vector, any suggetions?

2009 Jun 12

Creating this vector, any suggetions?

Suppose I have the first vector: c(1, 6, 8, 9) I will like to create a second vector of size 10 composed of 0 and 1's. The second vector will be composed of four 1's and six 0's. The position of the 1's will be specificed by the first vector. So essentially, I want a second vector in the form: c(1, 0, 0, 0, 0, 1, 0, 1, 1, 0) Any help is greatly appreciated! -- View this message

Vector of string

2009 Jun 23

Vector of string

Hi all. Suppose I have a vector of strings ie: ["12384", "TYU123123", "AVC3939", "hhr1919", "TYU0029"] Is there an efficient method that would allow me to return a vector of the position of strings that start with "TYU"? So essentially, I would like to get back in return [2, 5]. Anyways, any help with be greatly appreciated! -- View

Sequence generation

2009 Aug 29

Sequence generation

Hey guys, I was wondering how to create this sequence: 1, 2, 3, 1, 2, 3, 1, 2, 3, 1, 2, 3... with the '1, 2, 3' repeated over 10 times. Also, is there a simple method to generate 1, 1, 1, 2, 2, 2, 3, 3, 3? Anyways, any help with be greatly appreciated! -- View this message in context: http://www.nabble.com/Sequence-generation-tp25205593p25205593.html Sent from the R help mailing list

Grouping data in ranges in table

2011 Mar 05

Grouping data in ranges in table

Working with the built in R data set Orange, e.g. with(Orange, table(age, circumference)). How should I go about about grouping the ages and circumferences in the following ranges and having them display as such in a table? age range: 118 - 664 1004 - 1372 1582 circumference range: 30-58 62- 115 120-142 145-177 179-214 Thanks for any feedback and insights, as I hoping for an output that

Making LLD PDB generation faster

2019 Feb 27

Making LLD PDB generation faster

On Wed, Feb 27, 2019 at 3:17 PM Leonardo Santagada <santagada at gmail.com> wrote: > > Also we are investigating why we have 420 million types being linked while > it appears that 95-99 % of them are not being used. De anyone know if pch > can help here? My feeling is not much as template instantiation still > generates a ton of weak symbols on the pch users, but I might be

Choosing and preserving a random duplicate

2010 Mar 30

Choosing and preserving a random duplicate

Dear R-Helpers, I have a dataframe (g10df) formatted like this: GENE PVAL 1 KCTD12 4.06904e-22 2 UNC93A 9.91852e-22 3 CDKN3 1.24695e-21 4 CLEC2B 4.71759e-21 5 DAB2 1.12062e-20 The rows are ranked in ascending order by PVAL, and I need to end up with the same relative order. There are duplicate entries for genes in the first column with corresponding

which rows are duplicates?

2009 Mar 30

which rows are duplicates?

I would like to know which rows are duplicates of each other, not simply that a row is duplicate of another row. In the following example rows 1 and 3 are duplicates. > x <- c(1,3,1) > y <- c(2,4,2) > z <- c(3,4,3) > data <- data.frame(x,y,z) x y z 1 1 2 3 2 3 4 4 3 1 2 3 I can't figure out how to get R to tell me that observation 1 and 3 are the same.

which rows are duplicates?

2009 Mar 30

which rows are duplicates?

Unique.data.frame...still getting duplicates

2004 Jun 25

Unique.data.frame...still getting duplicates

Hi there I have a data frame with about 65,000 rows and 8 variables. I am trying to get rid of the double entries of a factor variable "ID" so I can get a unique observation for each ID I tried: >dupl_unique.data.frame(data[ID,]) #I obtain a data frame with 21,547 >observations..so far so good, but then when I check for duplicates >d_duplicated(dupl2$ID)

need help

2009 May 15

need help

Dear all please ,I need to write a function in R to estimate the parameters of negative binomial distribution and then calculate the loglikelihood amount for given data.Is there any one to help me. thank you very much for any help Best regards [[alternative HTML version deleted]]

how to create data.frames from vectors with duplicates

2011 Aug 31

how to create data.frames from vectors with duplicates

Hi R users, suppose I have two vectors, > x=c(1,2,3,4,5) > y=c('a','b','c','a','c') How can I get a data.frame like this? > xy count a 5 b 2 c 8 I know a few ways to fulfill the task. However, I have a huge number of this kind calculations, so I'd like an efficient solution. Thanks -- Best, Zhenjiang

Counting number of datasets and appending them

2010 Dec 29

Counting number of datasets and appending them

Hi there, I have a question on how to read a bunch of dataset, assign each of the dataset to a matrix in the memory, and append them. Suppose I have 20 dataset saved to different .rda files named gradeFileData1, gradeFileData2,...., gradeFileData20. And I would like to read them each into a dataset in the memory, then combine them. I wrote something like: e1<-new.env(parent=.GlobalEnv)

duplicated() variation that goes both ways to capture all duplicates

2012 Jul 23

duplicated() variation that goes both ways to capture all duplicates

Dear all The trouble with the current duplicated() function in is that it can report duplicates while searching fromFirst _or_ fromLast, but not both ways. Often users will want to identify and extract all the copies of the item that has duplicates, not only the duplicates themselves. To take the example from the man page: > data(iris) > iris[duplicated(iris), ] ##duplicates while

finding duplicates in an array

2006 Jun 19

finding duplicates in an array

an application i am working on sends emails to an array of users. it is possible for the user''s email to actually be in the array more than once. i don''t have a real strong grasp of the ruby syntax yet, but is there an easy way to go through and either remove all of the duplicates or at least only send an email to that user once? -- Posted via http://www.ruby-forum.com/.

Removing Duplicates

2010 Mar 14

Removing Duplicates

Hi all, I am starting fresh with a local repository of mails, which almost certainly have duplicates in them. I am going to use maildirs, and ensure all mails are input with CRLFs. The question is: does anybody know how I can find and remove duplicates, either while injecting mail with IMAP, or afterward? I can use tools to find duplicate Message-IDs, but don't know of a way to remove

Duplicates among columns of a data frame

2008 Dec 15

Duplicates among columns of a data frame

Dear list, I have a data frame of survey respondents, a little like this: set.seed(20081215) n <- 100 dat <- data.frame(id=1:100, addr1=sample(LETTERS, n, replace=TRUE), addr2=sample(LETTERS, n, replace=TRUE), addr3=sample(LETTERS, n, replace=TRUE)) head(dat) id addr1 addr2 addr3 1 1 R H Q 2 2 H C K 3 3

similar to: find duplicates... need help!