similar to: Adding same items together in data.frame

Displaying 20 results from an estimated 9000 matches similar to: "Adding same items together in data.frame"

2012 Sep 02
2
a newbie seeking for a simple problem
Dear Experienced R users, I have a looks-like simple but complicated problem urgently needed to be solved. Below is the detail: I have two dataframes, df1, df2. df1 contains two column and many thousands rows: column 1 is a "gene_name", column 2 is "value". df2 contains only one column which is "gene_name" with couple hundred rows. I want to change "value"
2009 May 21
3
index to select rows of a large matrix
Dear R Users, I have created a 1500 x 20000 data frame - DataSeq. Each of the 1500 rows represents a data sequence. I have another data frame iData that stores the information of these 1500 data sequences in the same order, for example, condition, gender, etc. If I use "subset" to select certain groups within iData according to some criteria that I have set, e.g. condition, gender Then
2018 Mar 01
3
how to make row.names based on column1 with duplicated values
Dear All, Suppose I have a dataframe like this with many thousands rows all with different names: data.frame(gene=c("a","b","c","d","c","d","c","f"),value=c(20,300,48,55,9,2,100,200)), I want to set column "gene" as row.names, but there are duplicates (c, d), which I want to transform into this as row names:
2018 Mar 01
0
how to make row.names based on column1 with duplicated values
On Wed, 28 Feb 2018, Stephen HonKit Wong wrote: > Dear All, > Suppose I have a dataframe like this with many thousands rows all with > different names: > data.frame(gene=c("a","b","c","d","c","d","c","f"),value=c(20,300,48,55,9,2,100,200)), > > I want to set column "gene" as row.names, but
2007 Jan 29
3
dumpadm and using dumpfile on zfs?
Hi All, I''d like to set up dumping to a file. This file is on a mirrored pool using zfs. It seems that the dump setup doesn''t work with zfs. This worked for both a standard UFS slice and a SVM mirror using zfs. Is there something that I''m doing wrong, or is this not yet supported on ZFS? Note this is Solaris 10 Update 3, but I don''t think that should
2017 Jul 22
0
a difficult situation, how to do this using base function.
1. Please always reply to the list, especially here so that others can see your clarification. 2. What happens if your match.start value exceeds all the cumulative sums?? -- you seem to imply that this cannot happen. Your minimal example, while a little confusing (to me) and in html -- this can get mangled in this plain text list, though seemingly not here -- was very helpful. Essential even .
2003 Apr 24
1
duplicate password prompt
I'm running samba 2.2.8a on a redhat 9 box, everything works great, but ever since I playd with the system-auth I now have to enter a password in twice when I logon to the console. Any one got any ideas ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Iwan Davies Server Support and Development Technician Cyngor Sir Ceredigion County Council IT Section Finance iwandav@ceredigion.gov.uk
2003 May 15
5
Session and open files
Hi, Hi I'm using redhat 9 I've got samba working and I have users onj the system accessing the shares, Can anyone recomend a programme that can tell me whch users have got files open on my samba box Thanks ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Iwan Davies Server Support and Development Technician Cyngor Sir Ceredigion County Council IT Section Finance iwandav@ceredigion.gov.uk
2009 Jun 02
1
plot 4th variable contour lines on filled.contour
Hello, I have a dataset with 4 variables, each consisting of a vector, all with the same length. I start by interpolating the first three variables using the function "interp", and plot the interpolation successfully using "filled.contour". I then interpolate the first two variables and a fourth using "interp" again, but when I try to overlay the contour lines
2018 May 25
2
how to make the code more efficient using lapply
Dear All, I have a following for-loop code which is basically intended to read in many excel files (each file has many columns and rows) in a directory and extract the some rows and columns out of each file and then combine them together into a dataframe. I use for loop which can do the work but quite slow. How to make it faster using lapply function ? Thanks in advance! temp.df<-c() #
2017 Jul 21
1
a difficult situation, how to do this using base function.
Hello, I have a following dataframe with many rows. data.frame(match.start=c(5,10,100,200),range.coordinates=c("1000-1050","1500-1555","5000-5050,6000-6180","100-150,200-260,600-900")) match.start range.coordinates 5 1000-1050 10 1500-1555 100 5000-5050,6000-6180 200
2010 Oct 19
5
For-loop dummy variables?
Hi everyone, I've got a dataset with 12,000 observations. One of the variables (cleary$D1) is for an individual's country, coded 1 - 15. I'd like to create a dummy variable for the Baltic states which are coded 4,6, and 7. In other words, as a dummy variable Baltic states would be coded 1, else 0. I've attempted the following for loop: dummy <- matrix(NA, nrow=nrow(cleary),
2011 Aug 31
1
formatting a 6 million row data set; creating a censoring variable
List, Consider the following data. gender mygroup id 1 F A 1 2 F B 2 3 F B 2 4 F B 2 5 F C 2 6 F C 2 7 F C 2 8 F D 2 9 F D 2 10 F D 2 11 F D 2 12 F D 2 13 F D 2 14 M A 3 15 M A 3 16 M A 3 17
2006 Jun 26
1
Patch for rgl with gcc 4.0 in R 2.3.0 on OS X
Hi, I recently had a problem installing the rgl package on OS X and put together a simple patch. The patched package is available here: http://jinome.stanford.edu/files/rgl_0.66-patched_for_gcc4.tar.gz It can be installed with "R CMD INSTALL rgl_0.66-patched_for_gcc4.tar.gz" as normal at the command line. Also -- as of right now rgl is not in the repository of version 2.3 packages
2018 May 25
0
how to make the code more efficient using lapply
Hi Stephen, I am not sure that the "for loop" is the source of slowness. You seem to be doing a lot of unnecessary work each time through the loop. e.g. no need to check if it's the last file, just move that section outside of the loop. It will be executed when the loop finishes. As it is you are calling list.files() each time through the loop which could be slow. In any case
2006 Jul 30
1
Power of a single sample binomial test
The only references to this I can find searching the archives are to a student who asked in relation to his course work on a stats course. Promise I'm not doing that! I have a situation in which we want to test proportions against an expected proportion, binom.test() is great. I'd like to do some post hoc power tests (the x and n were beyond our control in the survey as all we could set
2004 Feb 26
3
Collapsing Categorical Variables
Hi, Suppose I have a categorical variable called STREET, and I have 30 levels for it (i.e. 30 different streets). I want to find all those streets with only 15 observations or below then collapse them into a level called OTHER. Is there a quick way, other than using a for() loop, to do it? Currently what I'm doing is something like: ### Collapse STREET (those < 15) st <- c()
2015 Jul 23
2
[LLVMdev] [LLD] support for dlltool generated libs in COFF/PECOFF
Hi again rui, :) I've got all the patches into llvm and clang for supporting mingw-w64 via compiler-rt and now we are able to build a full mingw-w64 toolchain without gcc :) With great help from yaron and rnk. I've CC'd them as they might have interest in seeing this target through with me to the end :) So I have again turned my attention to LLD so that we can also remove ld as a
2008 Sep 09
1
How do I compute interactions with anova.mlm ?
Hi, I wish to compute multivariate test statistics for a within-subjects repeated measures design with anova.mlm. This works great if I only have two factors, but I don't know how to compute interactions with more than two factors. I suspect, I have to create a new "grouping" factor and then test with this factor to get these interactions (as it is hinted in R News 2007/2), but
2007 May 13
2
Oddities upgrading from 2.4.1 to 2.5.0
I'm working on Windoze XP and have for some time installed R in D:\R and upgraded my packages after installing the new version of R with a bit of code I think I got from the list: ip <- installed.packages(lib.loc = "d:\\R\\R-2.4.1\\library") ip <- ip[ip[,1]!="base" ,1] install.packages(ip) This time around that's produced some warning grumbles: Warning messages: