similar to: Query about extracting subsets from a table

Displaying 20 results from an estimated 700 matches similar to: "Query about extracting subsets from a table"

2007 Jan 25
1
unique/subset problem
Hi I am new to R programming and am using subset to extract part of a data as follows names(dataset) = c("genome1","genome2","dist","score"); prunedrelatives <- subset(dataset, score < -5); However when I use unique to find the number of unique genomes now present in prunedrelatives I get results identical to calling unique(dataset$genome1) although
2009 Nov 24
1
migrating NT4 PDC net rpc vampire errors with capital letters
Hi, I have searched for days on Google and can't find a clear answer to my question. I have a NT4 PDC which I am migrating to Samba 3 (Version 3.4.2-47.fc12) on FC12 with kernel(2.6.31.5-127.fc12.i686). I am using tdbsam as my passdb backend. I setup Samba as a BDC and then joined to NT4 Domain succesfully. When I go to vampire the accounts I get lots of errors and some user accounts get
2009 Nov 12
0
Problems migrating NT4 domain to Samba
Hi, I am finally biting the bullet and migrating our NT4 domain to Samba. I am using the following guide: http://vermeulen.ca/linux-windows-nt.html I installed a fresh copy of FC11 and installed samba 3 through yum. Hostname of the linux machine is LEONIDAS. the DOMAIN name is GENOME1. I created an account on the NT4 domain for a backup DC under the server name LEONIDAS. testparm returns that
2007 May 02
3
Query about finding correlations
Hi I have a dataframe which has 3 columns of numeric data A,B,C each of which has been obtained independent of the other. We are trying to find out, which of A or B cause C i.e. We are hypothesising that C is the effect and either A or B, not both is the cause. i.e. A causes C and this cause-effect relationship explains B. The data for A contains more noise than that for B. We are working with
2007 Jan 22
2
Query about using optimizers in R without causing program to crash
Hi I am a newbie to R and am using the lm function to fit my data. This optimization is to be performed for around 45000 files not all of which lend themselves to optimization. Some of these will and do crash. However, How do I ensure that the program simply goes to the next file in line without exiting the code with the error "Error in lm.fit(x, y, offset = offset, singular.ok =
2007 Feb 28
3
Packages in R for least median squares regression and computing outliers (thompson tau technique etc.)
Hi I am looking for suitable packages in R that do regression analyses using least median squares method (or better). Additionally, I am also looking for packages that implement algorithms/methods for detecting outliers that can be discarded before doing the regression analyses. Although some websites refer to "lms" method under package "lps" in R, I am unable to find such a
2006 May 01
4
table of means/medians across bins used for a histogram
Hi I am trying to get a table of means of parameter 1 across BINS of parameter 2. I am working in proteomics and a sample of my data is as follows cluster-age clock-rate(evolutionary rate) scopclass 0.002 10 A 0.045 0.1 B 0.13 15 A 0.15 34 D .... .... ....
2006 Oct 26
2
Query about using table
Hi I have data of the following form ID age member_FLAG 1 25 Y 2 36.75 N 3 75.5 N ......... ......... I want to get a histogram of this data showing distribution of member_flag in each age-bin i.e. how many values in each age bin have a member_flag of 'Y' and how many have 'N'. I was able to do the same using barplot2. However I also need similar
2007 Mar 01
2
Query about data manipulation
Hi Thanks much for the prompt response to my earlier enquiry on packages for regression analyses. Along the same topic(?), I have another question about which I could use some input. I am retreiving data from a MySQL database using RODBC. The table has many BLOB columns and each BLOB column has data in the format "id1 \t id2 \t measure \n id3 \t id4 \t measure...." (i.e. multiple rows
2007 Jan 26
1
Package for phylogenetic tree analyses
Hi I am looking for a package that 1. reads in a phylogenetic tree in NEXUS format 2. given two members/nodes on the tree, can return the distance between the two using the tree. I came across the following packages on CRAN ouch, ape, apTreeShape, phylgr all of which seem to provide extensive range of functions for reading in a Nexus-format tree and performing phylogenetic analyses, tree
2006 Jul 11
1
Query about getting averages across a certain parameter in a table
Hi I have a table that goes data cluster_ac clockrate age class 7337 0.9 0.001 alpha_proteins 7888 0.1 0.78 beta proteins etc The class column can have 7-8 different unique values While the clockrate and age columns are floats varying from 0 to 1. I wish to get the average clockrate across each of the classes for this data. I would appreciate your help
2007 Mar 12
2
Query about substituting characters in a df
Hi I have a data frame with 40,000 rows and 4 columns, one of which is "class". For each row, the "class" column can be one of 10 possible NUMERIC values. I wish to substitute these numeric values with words/characters. For example, I wish to substitute all occurences of "5467" in the column "class" with "alpha", "7867" with
2013 Jan 01
1
Order variables automatically
Hi, I have a dataset with 6 categorical variables. I have used this following code to make the variables u1-u6 ordered factors and this works well. cat1cat2 cat3 cat4 cat5 cat6 ? 0 ? ?? 1 ? ? 1????? 0 ??? 0? ?? 1 ? 1 ? ?? 1 ? ? 0 ? ?? 0 ? ? 0 ? ? 0 ....... .... ############ data<-read,table("example.txt") data <- as.data.frame(lapply(data, ordered)) ############ Now,
2007 Jan 21
1
identify selected substances across individuals
An embedded and charset-unspecified text was scrubbed... Name: inte tillg?nglig Url: https://stat.ethz.ch/pipermail/r-help/attachments/20070121/436ed377/attachment.pl
2011 Mar 21
3
Computing row differences in new columns
Hi I have the following columns with dates and results, sorted by subject and date. I'd like to compute the differences in dates and results for each patient, based on the previous row. Obviously the last entry for each subject should be a NA. Which would be the best way to accomplished that ? I guess questions like that have been already answered a thousand times, so I apologize for
2013 Jan 17
1
Help with interpolation
hi guys I need to interpolate values for the zero coupon yield curve. Following data is given date days rate 1996 01
2012 Nov 05
1
Plot 3 lines in one graph
I'm new with R. I want to plot 3 lines in one graph. This is my data: print(x) V1 V2 V3 V41 -4800 25195.73 7415.219 7264.282 -2800 15195.73 5415.219 7264.28 I tried using matplot, but I cannot get exactly what I want. This is what I get, and this is my code: matplot(x[,1],x[,-1],type='b', xlab = "epsilon_h", ylab = "Value2", xlim=
2011 Jun 22
1
Subsetting data systematically
I would like to subset data from a larger dataset and generate a smaller dataset. However, I don't want to use sample() because it does it randomly. I would like to take non-random subsamples, for example, every 2nd number, or every 3rd number. Is there a procedure that does this? Thanks, Nate -- View this message in context:
2010 Feb 23
3
how to rearrange a dataframe
Hi all, I'd appreciate if anyone can help me with this... I have a data frame that looks like this: 1 + name1 1 2 3 2 + name2 5 9 10 2 - name3 56 74 93 1 - name4 65 75 98 I need to rearrange this in a way so that the rows with "1" in the first column, and "-" in the second column; then columns 4 and 6 should switch places. That is, column 6 would be now column 4 and
2010 Jul 24
4
Trouble retrieving the second largest value from each row of a data.frame
I have a data frame with a couple million lines and want to retrieve the largest and second largest values in each row, along with the label of the column these values are in. For example row 1 strongest=-11072 secondstrongest=-11707 strongestantenna=value120 secondstrongantenna=value60 Below is the code I am using and a truncated data.frame. Retrieving the largest value was easy, but I have