thr3ads.net - similar to: "Adding same items together in data.frame"

Displaying 20 results from an estimated 9000 matches similar to: "Adding same items together in data.frame"

2012 Sep 02

a newbie seeking for a simple problem

Dear Experienced R users, I have a looks-like simple but complicated problem urgently needed to be solved. Below is the detail: I have two dataframes, df1, df2. df1 contains two column and many thousands rows: column 1 is a "gene_name", column 2 is "value". df2 contains only one column which is "gene_name" with couple hundred rows. I want to change "value"

index to select rows of a large matrix

2009 May 21

index to select rows of a large matrix

Dear R Users, I have created a 1500 x 20000 data frame - DataSeq. Each of the 1500 rows represents a data sequence. I have another data frame iData that stores the information of these 1500 data sequences in the same order, for example, condition, gender, etc. If I use "subset" to select certain groups within iData according to some criteria that I have set, e.g. condition, gender Then

how to make row.names based on column1 with duplicated values

2018 Mar 01

how to make row.names based on column1 with duplicated values

Dear All, Suppose I have a dataframe like this with many thousands rows all with different names: data.frame(gene=c("a","b","c","d","c","d","c","f"),value=c(20,300,48,55,9,2,100,200)), I want to set column "gene" as row.names, but there are duplicates (c, d), which I want to transform into this as row names:

how to make row.names based on column1 with duplicated values

2018 Mar 01

how to make row.names based on column1 with duplicated values

On Wed, 28 Feb 2018, Stephen HonKit Wong wrote: > Dear All, > Suppose I have a dataframe like this with many thousands rows all with > different names: > data.frame(gene=c("a","b","c","d","c","d","c","f"),value=c(20,300,48,55,9,2,100,200)), > > I want to set column "gene" as row.names, but

dumpadm and using dumpfile on zfs?

2007 Jan 29

dumpadm and using dumpfile on zfs?

Hi All, I''d like to set up dumping to a file. This file is on a mirrored pool using zfs. It seems that the dump setup doesn''t work with zfs. This worked for both a standard UFS slice and a SVM mirror using zfs. Is there something that I''m doing wrong, or is this not yet supported on ZFS? Note this is Solaris 10 Update 3, but I don''t think that should

a difficult situation, how to do this using base function.

2017 Jul 22

a difficult situation, how to do this using base function.

1. Please always reply to the list, especially here so that others can see your clarification. 2. What happens if your match.start value exceeds all the cumulative sums?? -- you seem to imply that this cannot happen. Your minimal example, while a little confusing (to me) and in html -- this can get mangled in this plain text list, though seemingly not here -- was very helpful. Essential even .

duplicate password prompt

2003 Apr 24

duplicate password prompt

I'm running samba 2.2.8a on a redhat 9 box, everything works great, but ever since I playd with the system-auth I now have to enter a password in twice when I logon to the console. Any one got any ideas ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Iwan Davies Server Support and Development Technician Cyngor Sir Ceredigion County Council IT Section Finance iwandav@ceredigion.gov.uk

Session and open files

2003 May 15

Session and open files

Hi, Hi I'm using redhat 9 I've got samba working and I have users onj the system accessing the shares, Can anyone recomend a programme that can tell me whch users have got files open on my samba box Thanks ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Iwan Davies Server Support and Development Technician Cyngor Sir Ceredigion County Council IT Section Finance iwandav@ceredigion.gov.uk

plot 4th variable contour lines on filled.contour

2009 Jun 02

plot 4th variable contour lines on filled.contour

Hello, I have a dataset with 4 variables, each consisting of a vector, all with the same length. I start by interpolating the first three variables using the function "interp", and plot the interpolation successfully using "filled.contour". I then interpolate the first two variables and a fourth using "interp" again, but when I try to overlay the contour lines

how to make the code more efficient using lapply

2018 May 25

how to make the code more efficient using lapply

Dear All, I have a following for-loop code which is basically intended to read in many excel files (each file has many columns and rows) in a directory and extract the some rows and columns out of each file and then combine them together into a dataframe. I use for loop which can do the work but quite slow. How to make it faster using lapply function ? Thanks in advance! temp.df<-c() #

a difficult situation, how to do this using base function.

2017 Jul 21

a difficult situation, how to do this using base function.

Hello, I have a following dataframe with many rows. data.frame(match.start=c(5,10,100,200),range.coordinates=c("1000-1050","1500-1555","5000-5050,6000-6180","100-150,200-260,600-900")) match.start range.coordinates 5 1000-1050 10 1500-1555 100 5000-5050,6000-6180 200

For-loop dummy variables?

2010 Oct 19

For-loop dummy variables?

Hi everyone, I've got a dataset with 12,000 observations. One of the variables (cleary$D1) is for an individual's country, coded 1 - 15. I'd like to create a dummy variable for the Baltic states which are coded 4,6, and 7. In other words, as a dummy variable Baltic states would be coded 1, else 0. I've attempted the following for loop: dummy <- matrix(NA, nrow=nrow(cleary),

formatting a 6 million row data set; creating a censoring variable

2011 Aug 31

formatting a 6 million row data set; creating a censoring variable

List, Consider the following data. gender mygroup id 1 F A 1 2 F B 2 3 F B 2 4 F B 2 5 F C 2 6 F C 2 7 F C 2 8 F D 2 9 F D 2 10 F D 2 11 F D 2 12 F D 2 13 F D 2 14 M A 3 15 M A 3 16 M A 3 17

Patch for rgl with gcc 4.0 in R 2.3.0 on OS X

2006 Jun 26

Patch for rgl with gcc 4.0 in R 2.3.0 on OS X

Hi, I recently had a problem installing the rgl package on OS X and put together a simple patch. The patched package is available here: http://jinome.stanford.edu/files/rgl_0.66-patched_for_gcc4.tar.gz It can be installed with "R CMD INSTALL rgl_0.66-patched_for_gcc4.tar.gz" as normal at the command line. Also -- as of right now rgl is not in the repository of version 2.3 packages

how to make the code more efficient using lapply

2018 May 25

how to make the code more efficient using lapply

Hi Stephen, I am not sure that the "for loop" is the source of slowness. You seem to be doing a lot of unnecessary work each time through the loop. e.g. no need to check if it's the last file, just move that section outside of the loop. It will be executed when the loop finishes. As it is you are calling list.files() each time through the loop which could be slow. In any case

Power of a single sample binomial test

2006 Jul 30

Power of a single sample binomial test

The only references to this I can find searching the archives are to a student who asked in relation to his course work on a stats course. Promise I'm not doing that! I have a situation in which we want to test proportions against an expected proportion, binom.test() is great. I'd like to do some post hoc power tests (the x and n were beyond our control in the survey as all we could set

Collapsing Categorical Variables

2004 Feb 26

Collapsing Categorical Variables

Hi, Suppose I have a categorical variable called STREET, and I have 30 levels for it (i.e. 30 different streets). I want to find all those streets with only 15 observations or below then collapse them into a level called OTHER. Is there a quick way, other than using a for() loop, to do it? Currently what I'm doing is something like: ### Collapse STREET (those < 15) st <- c()

[LLVMdev] [LLD] support for dlltool generated libs in COFF/PECOFF

2015 Jul 23

[LLVMdev] [LLD] support for dlltool generated libs in COFF/PECOFF

Hi again rui, :) I've got all the patches into llvm and clang for supporting mingw-w64 via compiler-rt and now we are able to build a full mingw-w64 toolchain without gcc :) With great help from yaron and rnk. I've CC'd them as they might have interest in seeing this target through with me to the end :) So I have again turned my attention to LLD so that we can also remove ld as a

How do I compute interactions with anova.mlm ?

2008 Sep 09

How do I compute interactions with anova.mlm ?

Hi, I wish to compute multivariate test statistics for a within-subjects repeated measures design with anova.mlm. This works great if I only have two factors, but I don't know how to compute interactions with more than two factors. I suspect, I have to create a new "grouping" factor and then test with this factor to get these interactions (as it is hinted in R News 2007/2), but

Oddities upgrading from 2.4.1 to 2.5.0

2007 May 13

Oddities upgrading from 2.4.1 to 2.5.0

I'm working on Windoze XP and have for some time installed R in D:\R and upgraded my packages after installing the new version of R with a bit of code I think I got from the list: ip <- installed.packages(lib.loc = "d:\\R\\R-2.4.1\\library") ip <- ip[ip[,1]!="base" ,1] install.packages(ip) This time around that's produced some warning grumbles: Warning messages:

similar to: Adding same items together in data.frame