Displaying 20 results from an estimated 700 matches similar to: "How to read this file into R."
2010 May 25
4
R eat my data
HI, Dear R community,
My original file has 1932 lines, but when I read into R, it changed to 1068
lines, how comes?
cdu@nuuk:~/operon$ wc -l id_name_gh5.txt
1932 id_name_gh5.txt
> gene_name<-read.table("/home/cdu/operon/id_name_gh5.txt", sep="\t",
skip=0, header=F, fill=T)
> dim(gene_name)
[1] 1068 3
--
Sincerely,
Changbin
--
Changbin Du
DOE Joint Genome
2011 Jun 22
1
question about read.columns
HI, Dear R community,
I have a large data set names dd.txt, the columns are: there are 2402
variables.
a1, b1, ..z1, a11, b11, ...z11, a111, b111, ..z111..
IF I dont know the relative position of the columns, but I know I need the
following variables:
var<-c(a1, c1,a11,b11,f111)
Can I use read.columns to read the data into R?
I have tried the following codes, but it does not work
2010 Jun 02
1
how to label the som notes by the majority vote
HI, Dear R community,
I am using the following codes to do the som. I tried to label the notes by
the majority vote. either through mapping or prediction.
I attached my output, the left one dont have any labels in the note, the
right one has more than one label in each note. I need to have only one
label for each note either by majority vote or prediction.
Can anyone give some suggestions or
2010 Apr 23
1
help in conditional histogram
Dear Dr. Sarkar,
When I try to run the codes, I found the following problem:
> h<- sample(1:14, 319, rep=T)
> c<- sample(1:14, 608, rep=T)
> n<- sample(1:14, 1140, rep=T)
> vt<-c(h, c, n)
> ta<-rep(c("h", "c", "n"), c(319, 608, 1140))
>
> to<-data.frame(vt,ta)
> library(lattice)
Attaching package: 'lattice'
2010 Apr 06
2
help output figures in R
somfunc<- function (file) {
aa_som<-scale(file)
final.som<-som(data=aa_som, rlen=10000, grid=somgrid(5,4, "hexagonal"))
pdf(file="/home/cdu/changbin/file.pdf") #output graphic file.
plot(final.som, main="Unsupervised SOM")
dev.off()
}
I have many different files, if I want output pdf file with the same name
as for each dataset I feed to the function
2010 Sep 07
1
change the for loops with lapply
cv.fold<-function(i, size=3, rang=0.3){
cat('Fold ', i, '\n')
out.fold.c <-((i-1)*c.each.part +1):(i*c.each.part)
out.fold.n <-((i-1)*n.each.part +1):(i*n.each.part)
train.cv <- n.cc[-out.fold.c, c(2:2401, 2417)]
train.nv <- n.nn[-out.fold.n, c(2:2401, 2417)]
train.v<-rbind(train.cv, train.nv) #training data for feature
2010 May 26
1
how to Store loop output from a function
HI, Dear R community,
I am writing the following function to create one data set(*tree.pred*) and
one vector(*valid.out*) from loops. Later, I want to use the data set from
this loop to plot curves. I have tried return, list, but I can not use the
*tree.pred* data and *valid.out* vector.
auc.tree<- function(msplit,mbucket) {
* tree.pred<-data.frame()
2010 Nov 04
4
how to work with long vectors
HI, Dear R community,
I have one data set like this, What I want to do is to calculate the
cumulative coverage. The following codes works for small data set (#rows =
100), but when feed the whole data set, it still running after 24 hours.
Can someone give some suggestions for long vector?
id reads
Contig79:1 4
Contig79:2 8
Contig79:3 13
Contig79:4 14
Contig79:5 17
2011 May 31
1
Please HELP
Hi R people.
I have a problem. What can I create by using functions the
combinations of distances in multiple comparisons tests such as
duncan multiple range test?
cur=sample(rnorm(15),15,replace=TRUE)
varr=factor(c(rep("var1",5),rep("var2",5),rep("var3",5)))
dat<-data.frame(cur, varr)
lmm<-lm(cur~ varr,data=dat)
2013 Feb 26
2
Efficient way to perform linear regressions
Hi All,
I have millions of regression lines to fit. So I am looking for the
most efficient approach in R.
Details:
I have a large desing matrix X. The dimension is n by p.
Each time when fitting the model, select rows from this matrix X and
form a new design matrix, called X_current.
There is another binary matrix M, with dim m by n, and each row is a
1*n vector. It helps to determin X_current.
2012 Nov 16
1
polycor package
I am currently working with R's polycor package and I have encountered a
problem. I tried to follow the steps as outlined in the sem.pdf file where
a CFA model is run using polychoric correlations. Every time I run the
command sem(model, data, N=.), I get the following warning message:
Warning message:
In if (orthogonal) { :
the condition has length > 1 and only the first element will be
2008 Aug 22
3
simple generation of artificial data with defined features
Dear R-colleagues,
I am quite a newbie to R fighting my stupidity to solve a probably quite
simple problem of generating artificial data with defined features.
I am conducting a study of inter-observer-agreement in
child-bronchoscopy. One of the most important measures is Kappa
according to Fleiss, which is very comfortable available in R through
the irr-package.
Unfortunately medical doctors
2004 Dec 15
3
adding perspectives to existing persp plots
I've created a perspective plot using 'persp' in the graphics package.
I'd like to add a second plane of z values to the existing plot, but I
cannot seem to do this using 'persp'. Is there an analogue to 'lines' or
'points' for perspectives?
Corey.
corey.bradshaw at cdu.edu.au
2010 Apr 26
3
R.GBM package
HI, Dear Greg,
I AM A NEW to GBM package. Can boosting decision tree be implemented in
'gbm' package? Or 'gbm' can only be used for regression?
IF can, DO I need to combine the rpart and gbm command?
Thanks so much!
--
Sincerely,
Changbin
--
[[alternative HTML version deleted]]
2010 Jun 19
1
question about boosting(Adaboosting. M1)
HI, Guys,
I am trying to use the AdaBoosting. M.1 algorithm to integrate three models.
I found the sum of weights for each model is not equal to one.
How to deal with this?
Thanks, any response or suggestions are appreciated!
--
Sincerely,
Changbin
--
[[alternative HTML version deleted]]
2010 Apr 19
0
help in output file
HI, Dear R-community,
I AM using the following codes to grow tree and plot tree:
# Classification Tree with rpart
library(rpart)
pdf(file="/home/cdu/changbin/dimer_tree.pdf")
# grow tree
fit.dimer <- rpart(outcome ~ ., method="class", data=p.dimer[,2:402])
plotcp(fit.dimer) # visualize cross-validation results
# plot tree
plot(fit.dimer, uniform=TRUE,
2010 Jun 15
1
output from the gbm package
HI, Dear Greg and R community,
I have one question about the output of gbm package. the output of Boosting
should be f(x), from it , how to calculate the probability for each
observations in data set?
SInce it is stochastic, how can guarantee that each observation in training
data are selected at least once? IF SOME obs are not selected, how to
calculate the training error?
Thanks?
--
2010 Apr 26
1
boosting with decision tree
Hi, Dear R community,
Does anyone know how to constructdecision tree with boosting? Is any
tutorial I can read?
--
Sincerely,
Changbin
--
[[alternative HTML version deleted]]
2010 Apr 29
1
variable importance in Random Forest
HI, Dear Andy,
I run the RandomFOrest in R, and get the following resutls in variable
importance:
What is the meaning of MeanDecreaseAccuracy and MeanDecreaseGini?
I found they are raw values, they are not scaled to 1, right?
Which column if most similar to the variable rel.influence in Boosting?
Thanks so much!
> fit$importance
0 1
2011 Feb 11
1
Extract a slot value from a 'SpatialPolygons' class object
Dear R-users,
I’m currently trying to extract the value of a slot (area) but can’t find out how to do that.
>str(overlperc)
List of 1
$ :Formal class 'SpatialPolygons' [package "sp"] with 4 slots
.. ..@ polygons :List of 1
.. .. ..$ :Formal class 'Polygons' [package "sp"] with 5 slots
.. .. .. .. ..@ Polygons :List of 1
.. .. .. .. .. ..$