Displaying 20 results from an estimated 700 matches similar to: "Query about extracting subsets from a table"
2007 Jan 25
1
unique/subset problem
Hi
I am new to R programming and am using subset to
extract part of a data as follows
names(dataset) =
c("genome1","genome2","dist","score");
prunedrelatives <- subset(dataset, score < -5);
However when I use unique to find the number of unique
genomes now present in prunedrelatives I get results
identical to calling unique(dataset$genome1) although
2009 Nov 24
1
migrating NT4 PDC net rpc vampire errors with capital letters
Hi,
I have searched for days on Google and can't find a clear answer to my
question. I have a NT4 PDC which I am migrating to Samba 3 (Version
3.4.2-47.fc12) on FC12 with kernel(2.6.31.5-127.fc12.i686). I am using
tdbsam as my passdb backend.
I setup Samba as a BDC and then joined to NT4 Domain succesfully. When I go
to vampire the accounts I get lots of errors and some user accounts get
2009 Nov 12
0
Problems migrating NT4 domain to Samba
Hi,
I am finally biting the bullet and migrating our NT4 domain to Samba.
I am using the following guide: http://vermeulen.ca/linux-windows-nt.html
I installed a fresh copy of FC11 and installed samba 3 through yum.
Hostname of the linux machine is LEONIDAS. the DOMAIN name is
GENOME1. I created an account on the NT4 domain for a backup DC under
the server name LEONIDAS.
testparm returns that
2007 May 02
3
Query about finding correlations
Hi
I have a dataframe which has 3 columns of numeric data
A,B,C each of which has been obtained independent of
the other.
We are trying to find out, which of A or B cause C
i.e. We are hypothesising that C is the effect and
either A or B, not both is the cause.
i.e. A causes C and this cause-effect relationship
explains B.
The data for A contains more noise than that for B.
We are working with
2007 Jan 22
2
Query about using optimizers in R without causing program to crash
Hi
I am a newbie to R and am using the lm function to
fit my data.
This optimization is to be performed for around 45000
files not all of which lend themselves to
optimization. Some of these will and do crash.
However, How do I ensure that the program simply goes
to the next file in line without exiting the code with
the error
"Error in lm.fit(x, y, offset = offset, singular.ok =
2007 Feb 28
3
Packages in R for least median squares regression and computing outliers (thompson tau technique etc.)
Hi
I am looking for suitable packages in R that do
regression analyses using least median squares method
(or better). Additionally, I am also looking for
packages that implement algorithms/methods for
detecting outliers that can be discarded before doing
the regression analyses.
Although some websites refer to "lms" method under
package "lps" in R, I am unable to find such a
2006 May 01
4
table of means/medians across bins used for a histogram
Hi
I am trying to get a table of means of parameter 1
across BINS of parameter 2.
I am working in proteomics and a sample of my data is
as follows
cluster-age clock-rate(evolutionary rate) scopclass
0.002 10 A
0.045 0.1 B
0.13 15 A
0.15 34 D
....
....
....
2006 Oct 26
2
Query about using table
Hi
I have data of the following form
ID age member_FLAG
1 25 Y
2 36.75 N
3 75.5 N
.........
.........
I want to get a histogram of this data showing
distribution of member_flag in each age-bin i.e. how
many values in each age bin have a member_flag of 'Y'
and how many have 'N'.
I was able to do the same using barplot2.
However I also need similar
2007 Mar 01
2
Query about data manipulation
Hi
Thanks much for the prompt response to my earlier
enquiry on packages for regression analyses.
Along the same topic(?), I have another question about
which I could use some input.
I am retreiving data from a MySQL database using
RODBC.
The table has many BLOB columns and each BLOB column
has data in the format
"id1 \t id2 \t measure \n id3 \t id4 \t measure...."
(i.e. multiple rows
2007 Jan 26
1
Package for phylogenetic tree analyses
Hi
I am looking for a package that
1. reads in a phylogenetic tree in NEXUS format
2. given two members/nodes on the tree, can return the
distance between the two using the tree.
I came across the following packages on CRAN
ouch, ape, apTreeShape, phylgr all of which seem to
provide extensive range of functions for reading in a
Nexus-format tree and performing phylogenetic
analyses, tree
2006 Jul 11
1
Query about getting averages across a certain parameter in a table
Hi
I have a table that goes
data
cluster_ac clockrate age class
7337 0.9 0.001 alpha_proteins
7888 0.1 0.78 beta proteins
etc
The class column can have 7-8 different unique values
While the clockrate and age columns are floats varying
from 0 to 1.
I wish to get the average clockrate across each of the
classes for this data.
I would appreciate your help
2007 Mar 12
2
Query about substituting characters in a df
Hi
I have a data frame with 40,000 rows and 4 columns,
one of which is "class".
For each row, the "class" column can be one of 10
possible NUMERIC values.
I wish to substitute these numeric values with
words/characters.
For example, I wish to substitute all occurences of
"5467" in the column "class" with "alpha", "7867" with
2013 Jan 01
1
Order variables automatically
Hi,
I have a dataset with 6 categorical variables. I have used this following code to make the variables u1-u6 ordered factors and this works well.
cat1cat2 cat3 cat4 cat5 cat6
? 0 ? ?? 1 ? ? 1????? 0 ??? 0? ?? 1
? 1 ? ?? 1 ? ? 0 ? ?? 0 ? ? 0 ? ? 0
.......
....
############
data<-read,table("example.txt")
data <- as.data.frame(lapply(data, ordered))
############
Now,
2007 Jan 21
1
identify selected substances across individuals
An embedded and charset-unspecified text was scrubbed...
Name: inte tillg?nglig
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20070121/436ed377/attachment.pl
2011 Mar 21
3
Computing row differences in new columns
Hi
I have the following columns with dates and results, sorted by subject and date. I'd like to compute the differences in dates and results for each patient, based on the previous row. Obviously the last entry for each subject should be a NA.
Which would be the best way to accomplished that ?
I guess questions like that have been already answered a thousand times, so I apologize for
2013 Jan 17
1
Help with interpolation
hi guys
I need to interpolate values for the zero coupon yield curve. Following data
is given
date days rate
1996 01
2012 Nov 05
1
Plot 3 lines in one graph
I'm new with R. I want to plot 3 lines in one graph. This is my data:
print(x)
V1 V2 V3 V41 -4800 25195.73 7415.219 7264.282
-2800 15195.73 5415.219 7264.28
I tried using matplot, but I cannot get exactly what I want. This is what I
get, and this is my code:
matplot(x[,1],x[,-1],type='b', xlab = "epsilon_h",
ylab = "Value2", xlim=
2011 Jun 22
1
Subsetting data systematically
I would like to subset data from a larger dataset and generate a smaller
dataset. However, I don't want to use sample() because it does it randomly.
I would like to take non-random subsamples, for example, every 2nd number,
or every 3rd number. Is there a procedure that does this?
Thanks, Nate
--
View this message in context:
2010 Feb 23
3
how to rearrange a dataframe
Hi all,
I'd appreciate if anyone can help me with this...
I have a data frame that looks like this:
1 + name1 1 2 3
2 + name2 5 9 10
2 - name3 56 74 93
1 - name4 65 75 98
I need to rearrange this in a way so that the rows with "1" in the
first column, and "-" in the second column; then columns 4 and 6
should switch places. That is, column 6 would be now column 4 and
2010 Jul 24
4
Trouble retrieving the second largest value from each row of a data.frame
I have a data frame with a couple million lines and want to retrieve the largest and second largest values in each row, along with the label of the column these values are in. For example
row 1
strongest=-11072
secondstrongest=-11707
strongestantenna=value120
secondstrongantenna=value60
Below is the code I am using and a truncated data.frame. Retrieving the largest value was easy, but I have