thr3ads.net - similar to: "get the row sums"

Displaying 20 results from an estimated 800 matches similar to: "get the row sums"

2010 May 05

sort the data set by one variable

> #sort the data by predicted probability > b.order<-bo.id.pred[(order(-predict)),] > b.order[1:20,] gene_id predict 43 637882902 0.07823997 53 638101634 0.66256490 61 639084581 0.08587504 41 637832824 0.02461066 25 637261662 0.11613879 22 637240022 0.06350477 62 639084582 0.02238538 63 639097718 0.06792841 44 637943079 0.04532625 80 640158389 0.06582658 3 637006517 0.57648451

help in attach function

2010 Apr 07

help in attach function

Hi, r-community, This morning, I MET the following problem several times when I try to attach the data set. When I closed the current console and reopen the R console, the problem disappear. BUt with the time passed on, the problem occurs again. Can anyone help me with this? > attach(total) The following object(s) are masked from total ( position 3 ) : acid base cell_evalue

help output figures in R

2010 Apr 06

help output figures in R

somfunc<- function (file) { aa_som<-scale(file) final.som<-som(data=aa_som, rlen=10000, grid=somgrid(5,4, "hexagonal")) pdf(file="/home/cdu/changbin/file.pdf") #output graphic file. plot(final.som, main="Unsupervised SOM") dev.off() } I have many different files, if I want output pdf file with the same name as for each dataset I feed to the function

probabilities in svm output in e1071 package

2010 May 05

probabilities in svm output in e1071 package

svm.fit<-svm(as.factor(out) ~ ., data=all_h, method="C-classification", kernel="radial", cost=bestc, gamma=bestg, cross=10) # model fitting svm.pred<-predict(svm.fit, hh, decision.values = TRUE, probability = TRUE) # find the probability, but can not find. attr(svm.pred, "probabilities") > attr(svm.pred, "probabilities") 1 0 1 0 0 2 0

output from the gbm package

2010 Jun 15

output from the gbm package

HI, Dear Greg and R community, I have one question about the output of gbm package. the output of Boosting should be f(x), from it , how to calculate the probability for each observations in data set? SInce it is stochastic, how can guarantee that each observation in training data are selected at least once? IF SOME obs are not selected, how to calculate the training error? Thanks? --

my function does not work for large data set

2010 Dec 16

my function does not work for large data set

Dear R community, I have one function, it works for small data set, but does not work on large data set, can anyone help me with this? > #creat new variable by dividing each aa dimer by total_length. > imper<-function(x, file) { + round(x/file$length, 5) + } > dim(test) [1] 999 2402 > test[varname[2:2401]]<-

R eat my data

2010 May 25

R eat my data

HI, Dear R community, My original file has 1932 lines, but when I read into R, it changed to 1068 lines, how comes? cdu@nuuk:~/operon$ wc -l id_name_gh5.txt 1932 id_name_gh5.txt > gene_name<-read.table("/home/cdu/operon/id_name_gh5.txt", sep="\t", skip=0, header=F, fill=T) > dim(gene_name) [1] 1068 3 -- Sincerely, Changbin -- Changbin Du DOE Joint Genome

return counts of elements on a table column depending on elements on another column

2011 Jun 08

return counts of elements on a table column depending on elements on another column

Hi, I am given the following table: > head(hsa_refseq) chr genome region start stop nu strand nu.1 nu.2 gene_id 1 chr1 hg19_refGene CDS 67000042 67000051 0 + 0 gene_id NM_032291 2 chr1 hg19_refGene exon 66999825 67000051 0 + . gene_id NM_032291 3 chr1 hg19_refGene CDS 67091530 67091593 0 + 2 gene_id NM_032291 4 chr1 hg19_refGene exon

r-loop

2010 Apr 15

r-loop

HI, Dear community, I am building the following loop, ww<-function(file) { lossw<-vector() for (x in seq(0.1, 0.9, by=0.1)) { cat('xweight ', x, '\n') lossw[i] <- cross.validation(file, x)$avg } return(lossw) } MY question is how to index the lossw[i]? for (i in 1:9) for (x in seq(0.1, 0.9, by=0.1)) Thanks so much!

need help with nnet

2010 Oct 12

need help with nnet

HI, Dear R community, My data set has 2409 variables, the last one is response variable. I have used the nnet after feature selection and works. But this time, I am using nnet to fit a model without feature selection. I got the following error information: > dim(train) [1] 1827 2409 nnet.fit<-nnet(as.factor(out) ~ ., data=train, size=3, rang=0.3, decay=5e-4, maxit=500) # model

how to Store loop output from a function

2010 May 26

how to Store loop output from a function

HI, Dear R community, I am writing the following function to create one data set(*tree.pred*) and one vector(*valid.out*) from loops. Later, I want to use the data set from this loop to plot curves. I have tried return, list, but I can not use the *tree.pred* data and *valid.out* vector. auc.tree<- function(msplit,mbucket) { * tree.pred<-data.frame()

R.GBM package

2010 Apr 26

R.GBM package

HI, Dear Greg, I AM A NEW to GBM package. Can boosting decision tree be implemented in 'gbm' package? Or 'gbm' can only be used for regression? IF can, DO I need to combine the rpart and gbm command? Thanks so much! -- Sincerely, Changbin -- [[alternative HTML version deleted]]

average duplicated rows?

2012 Oct 12

average duplicated rows?

Dear useRs, I have a slightly complicated data structure and am stuck trying to extract what I need. I'm pasting an example of this data below. In some cases, there are duplicates in the "gene_id" column because there are two different "sample 1" values for a given "sample 2" value. Where these duplicates exist, I need to average the corresponding

how to work with long vectors

2010 Nov 04

how to work with long vectors

HI, Dear R community, I have one data set like this, What I want to do is to calculate the cumulative coverage. The following codes works for small data set (#rows = 100), but when feed the whole data set, it still running after 24 hours. Can someone give some suggestions for long vector? id reads Contig79:1 4 Contig79:2 8 Contig79:3 13 Contig79:4 14 Contig79:5 17

auc function

2011 Jan 20

auc function

Hi, there. Suppose I already have sensitivities and specificities. What is the quick R-function to calculate AUC for the ROC plot? There seem to be many R functions to calculate AUC. Thanks. Yulei [[alternative HTML version deleted]]

can I use the output of a neural network as the fitness function of genetic algorithm?

2011 Feb 07

can I use the output of a neural network as the fitness function of genetic algorithm?

Hi Everyone, I need to use genetic algorithm to find the minimum. The problem is, I cannot define the fitness function, but I can build a neural network from the input data and use the output as a fitness function. Can this be done? The other problem is, I know there are a few package in R related to GA. So far I know all of them take a specific function as fitness function, is

help with adding lines to current plot

2010 Oct 25

help with adding lines to current plot

HI, Dear R community, I am using the following codes to plot, however, the lines code works. But the line was not drawn on the previous plot and did not shown up. How comes? # specify the data for missense simulation x <- seq(0,10, by=1) y <- c(0.952, 0.947, 0.943, 0.941, 0.933, 0.932, 0.939, 0.932, 0.924, 0.918, 0.920) # missense z <- c(0.068, 0.082, 0.080, 0.099, 0.108, 0.107,

ROC curve in randomForest

2010 Apr 30

ROC curve in randomForest

require(randomForest) rf.pred<-predict(fit, valid, type="prob") > rf.pred[1:20, ] 0 1 16 0.0000 1.0000 23 0.3158 0.6842 43 0.3030 0.6970 52 0.0886 0.9114 55 0.1216 0.8784 75 0.0920 0.9080 82 0.4332 0.5668 120 0.2302 0.7698 128 0.1336 0.8664 147 0.4272 0.5728 148 0.0490 0.9510 153 0.0556 0.9444 161 0.0760 0.9240 162 0.4564 0.5436 172 0.5148 0.4852 176 0.1730

ROC curve in R

2010 Jul 01

ROC curve in R

Hi, i have a fairly large amount of genomic data. I have created a dataframe which has "Reference" as one column and "Variation" as another. I want to plot a ROC curve based on these 2 columns. I have serached the R manual but I could not understand. Can anybody help me with the R code for plotting ROC curve. Thnx ashu6886 -- View this message in context:

can not print probabilities in svm of e1071

2010 Apr 29

can not print probabilities in svm of e1071

> x <- train[,c( 2:18, 20:21, 24, 27:31)] > y <- train$out > > svm.pr <- svm(x, y, probability = TRUE, method="C-classification", kernel="radial", cost=bestc, gamma=bestg, cross=10) > > pred <- predict(svm.pr, valid[,c( 2:18, 20:21, 24, 27:31)], decision.values = TRUE, probability = TRUE) > attr(pred, "decision.values")[1:4,]

similar to: get the row sums