thr3ads.net - similar to: "Query about extracting subsets from a table"

Displaying 20 results from an estimated 700 matches similar to: "Query about extracting subsets from a table"

2007 Jan 25

unique/subset problem

Hi I am new to R programming and am using subset to extract part of a data as follows names(dataset) = c("genome1","genome2","dist","score"); prunedrelatives <- subset(dataset, score < -5); However when I use unique to find the number of unique genomes now present in prunedrelatives I get results identical to calling unique(dataset$genome1) although

migrating NT4 PDC net rpc vampire errors with capital letters

2009 Nov 24

migrating NT4 PDC net rpc vampire errors with capital letters

Hi, I have searched for days on Google and can't find a clear answer to my question. I have a NT4 PDC which I am migrating to Samba 3 (Version 3.4.2-47.fc12) on FC12 with kernel(2.6.31.5-127.fc12.i686). I am using tdbsam as my passdb backend. I setup Samba as a BDC and then joined to NT4 Domain succesfully. When I go to vampire the accounts I get lots of errors and some user accounts get

Problems migrating NT4 domain to Samba

2009 Nov 12

Problems migrating NT4 domain to Samba

Hi, I am finally biting the bullet and migrating our NT4 domain to Samba. I am using the following guide: http://vermeulen.ca/linux-windows-nt.html I installed a fresh copy of FC11 and installed samba 3 through yum. Hostname of the linux machine is LEONIDAS. the DOMAIN name is GENOME1. I created an account on the NT4 domain for a backup DC under the server name LEONIDAS. testparm returns that

Query about finding correlations

2007 May 02

Query about finding correlations

Hi I have a dataframe which has 3 columns of numeric data A,B,C each of which has been obtained independent of the other. We are trying to find out, which of A or B cause C i.e. We are hypothesising that C is the effect and either A or B, not both is the cause. i.e. A causes C and this cause-effect relationship explains B. The data for A contains more noise than that for B. We are working with

Query about using optimizers in R without causing program to crash

2007 Jan 22

Query about using optimizers in R without causing program to crash

Hi I am a newbie to R and am using the lm function to fit my data. This optimization is to be performed for around 45000 files not all of which lend themselves to optimization. Some of these will and do crash. However, How do I ensure that the program simply goes to the next file in line without exiting the code with the error "Error in lm.fit(x, y, offset = offset, singular.ok =

Packages in R for least median squares regression and computing outliers (thompson tau technique etc.)

2007 Feb 28

Packages in R for least median squares regression and computing outliers (thompson tau technique etc.)

Hi I am looking for suitable packages in R that do regression analyses using least median squares method (or better). Additionally, I am also looking for packages that implement algorithms/methods for detecting outliers that can be discarded before doing the regression analyses. Although some websites refer to "lms" method under package "lps" in R, I am unable to find such a

table of means/medians across bins used for a histogram

2006 May 01

table of means/medians across bins used for a histogram

Hi I am trying to get a table of means of parameter 1 across BINS of parameter 2. I am working in proteomics and a sample of my data is as follows cluster-age clock-rate(evolutionary rate) scopclass 0.002 10 A 0.045 0.1 B 0.13 15 A 0.15 34 D .... .... ....

Query about using table

2006 Oct 26

Query about using table

Hi I have data of the following form ID age member_FLAG 1 25 Y 2 36.75 N 3 75.5 N ......... ......... I want to get a histogram of this data showing distribution of member_flag in each age-bin i.e. how many values in each age bin have a member_flag of 'Y' and how many have 'N'. I was able to do the same using barplot2. However I also need similar

Query about data manipulation

2007 Mar 01

Query about data manipulation

Hi Thanks much for the prompt response to my earlier enquiry on packages for regression analyses. Along the same topic(?), I have another question about which I could use some input. I am retreiving data from a MySQL database using RODBC. The table has many BLOB columns and each BLOB column has data in the format "id1 \t id2 \t measure \n id3 \t id4 \t measure...." (i.e. multiple rows

Package for phylogenetic tree analyses

2007 Jan 26

Package for phylogenetic tree analyses

Hi I am looking for a package that 1. reads in a phylogenetic tree in NEXUS format 2. given two members/nodes on the tree, can return the distance between the two using the tree. I came across the following packages on CRAN ouch, ape, apTreeShape, phylgr all of which seem to provide extensive range of functions for reading in a Nexus-format tree and performing phylogenetic analyses, tree

Query about getting averages across a certain parameter in a table

2006 Jul 11

Query about getting averages across a certain parameter in a table

Hi I have a table that goes data cluster_ac clockrate age class 7337 0.9 0.001 alpha_proteins 7888 0.1 0.78 beta proteins etc The class column can have 7-8 different unique values While the clockrate and age columns are floats varying from 0 to 1. I wish to get the average clockrate across each of the classes for this data. I would appreciate your help

Query about substituting characters in a df

2007 Mar 12

Query about substituting characters in a df

Hi I have a data frame with 40,000 rows and 4 columns, one of which is "class". For each row, the "class" column can be one of 10 possible NUMERIC values. I wish to substitute these numeric values with words/characters. For example, I wish to substitute all occurences of "5467" in the column "class" with "alpha", "7867" with

Order variables automatically

2013 Jan 01

Order variables automatically

Hi, I have a dataset with 6 categorical variables. I have used this following code to make the variables u1-u6 ordered factors and this works well. cat1cat2 cat3 cat4 cat5 cat6 ? 0 ? ?? 1 ? ? 1????? 0 ??? 0? ?? 1 ? 1 ? ?? 1 ? ? 0 ? ?? 0 ? ? 0 ? ? 0 ....... .... ############ data<-read,table("example.txt") data <- as.data.frame(lapply(data, ordered)) ############ Now,

identify selected substances across individuals

2007 Jan 21

identify selected substances across individuals

An embedded and charset-unspecified text was scrubbed... Name: inte tillg?nglig Url: https://stat.ethz.ch/pipermail/r-help/attachments/20070121/436ed377/attachment.pl

Computing row differences in new columns

2011 Mar 21

Computing row differences in new columns

Hi I have the following columns with dates and results, sorted by subject and date. I'd like to compute the differences in dates and results for each patient, based on the previous row. Obviously the last entry for each subject should be a NA. Which would be the best way to accomplished that ? I guess questions like that have been already answered a thousand times, so I apologize for

Help with interpolation

2013 Jan 17

Help with interpolation

hi guys I need to interpolate values for the zero coupon yield curve. Following data is given date days rate 1996 01

Plot 3 lines in one graph

2012 Nov 05

Plot 3 lines in one graph

I'm new with R. I want to plot 3 lines in one graph. This is my data: print(x) V1 V2 V3 V41 -4800 25195.73 7415.219 7264.282 -2800 15195.73 5415.219 7264.28 I tried using matplot, but I cannot get exactly what I want. This is what I get, and this is my code: matplot(x[,1],x[,-1],type='b', xlab = "epsilon_h", ylab = "Value2", xlim=

Subsetting data systematically

2011 Jun 22

Subsetting data systematically

I would like to subset data from a larger dataset and generate a smaller dataset. However, I don't want to use sample() because it does it randomly. I would like to take non-random subsamples, for example, every 2nd number, or every 3rd number. Is there a procedure that does this? Thanks, Nate -- View this message in context:

how to rearrange a dataframe

2010 Feb 23

how to rearrange a dataframe

Hi all, I'd appreciate if anyone can help me with this... I have a data frame that looks like this: 1 + name1 1 2 3 2 + name2 5 9 10 2 - name3 56 74 93 1 - name4 65 75 98 I need to rearrange this in a way so that the rows with "1" in the first column, and "-" in the second column; then columns 4 and 6 should switch places. That is, column 6 would be now column 4 and

Trouble retrieving the second largest value from each row of a data.frame

2010 Jul 24

Trouble retrieving the second largest value from each row of a data.frame

I have a data frame with a couple million lines and want to retrieve the largest and second largest values in each row, along with the label of the column these values are in. For example row 1 strongest=-11072 secondstrongest=-11707 strongestantenna=value120 secondstrongantenna=value60 Below is the code I am using and a truncated data.frame. Retrieving the largest value was easy, but I have

similar to: Query about extracting subsets from a table