Displaying 20 results from an estimated 400 matches similar to: "boxplot with text and symbols on x"
2008 May 05
1
Count data in random Forest
Hello R-user!
I am running R 2.7.0 on a Power Book (Tiger). (I am still R and
statistics beginner)
I try to find the most important variables to divide my dataset as
given in a categorical variable using randomForest.
Is randomForest() able to deal with count data?
Or is there no difference because only the ranks are used in the trees?
Thanks in advance
Birgit
Birgit Lemcke
Institut f?r
2008 Apr 21
1
ANCOVA
R version 2.6.2 PowerBook G4
Hello R User,
I try to perform an ANCOVA using the glm function.
I have a dataset with continuous and categorical data (explanatory
variables) and my response variable is also binary categorical.
Fehler: NA/NaN/Inf in externem Funktionsaufruf (arg 4)
Zus?tzlich: Warning messages:
1: In Ops.factor(y, mu) : - nicht sinnvoll f?r Faktoren (makes no
sense for
2008 May 05
1
Problems using rfImpute
Hello R-user!
I am running R 2.7.0 on a Power Book (Tiger). (I am still R and
statistics beginner)
I tried rfImpute (randomForest) and as far as I understood should it
replace NA`s using a proximity matrix:
> set.seed(100000)
> Subset5Imputed<-rfImpute(Sex~., data=Subset5)
ntree OOB 1 2
300: 11.78% 12.36% 11.21%
ntree OOB 1 2
300: 12.07% 12.64%
2008 Apr 29
1
randomForest and ordered factors
Hello R-user!
I am running R 2.7.0 on a Power Book (Tiger). (I am still R and
statistics beginner)
I try to find the most important variables to divide my dataset as
given in a categorical variable.
code:
Test.rf4<-randomForest(Sex~.,na.action=na.roughfix, data=Subset4,
importance=TRUE, proximity=TRUE, ntree=10000, do.trace=1000,
keep.forest=FALSE)
My dataset contains also ordered
2008 May 15
1
metaMDS using Dissimilarity matrix
Hello R-user community!
I am running R 2.7.0 on a Power Book (Tiger). (I am still R and
statistics beginner)
Presently I try to run the function metaMDS (vegan) using an existing
dissimilarity-matrix.
As I would like to start with this matrix I thought I could just give
the matrix using the x= -argument
Test<-metaMDS(x=Dist.Gower)
Fehler in inherits(comm, "dist") :
2008 Feb 06
1
Discriminant function analysis
Hello R-Cracks,
I am using R 2.6.1 on a PowerBook G4.
I would like to perform a discriminant function analysis. I found lda
in MASS but as far as I understood, is it only working with
explanatory variables of the class factor. My dataset contains
variables of the classes factor and numeric.
Is there another function that is able to handle this?
Thank you all in advance.
Greetings
Birgit
2009 Nov 20
1
how to specify the order of panels with xyplot
> chromosomes
id refseq name length
1 0 NC_000001.9 Homo sapiens chromosome 1 247249719
2 1 NC_000002.10 Homo sapiens chromosome 2 242951149
3 2 NC_000003.10 Homo sapiens chromosome 3 199501827
4 3 NC_000004.10 Homo sapiens chromosome 4 191273063
5 4 NC_000005.8 Homo sapiens chromosome 5 180857866
6 5 NC_000006.10 Homo sapiens chromosome 6
2009 Sep 29
4
How can I avoid a for-loop through sapply or lapply ?
Through converting a miRNAs file from FASTA to character format I get a vector which looks like the following:
> nml
[1] "hsa-let-7a MIMAT0000062 Homo sapiens let-7a"
[2] "hsa-let-7b MIMAT0000063 Homo sapiens let-7b"
[3] "hsa-let-7c MIMAT0000064 Homo sapiens let-7c"
[4] "hsa-let-7d MIMAT0000065 Homo sapiens
2010 May 27
3
how to extract the 1st field from a vector of strings
I have the following vector of strings (shown only the first 3 elements)
> desc[1:3]
[1] "hsa-let-7a MIMAT0000062 Homo sapiens let-7a"
[2] "hsa-let-7a* MIMAT0004481 Homo sapiens let-7a*"
[3] "hsa-let-7a-2* MIMAT0010195 Homo sapiens let-7a-2*"
> is.vector(desc)
[1] TRUE
> A <- unlist(strsplit(desc[1:3], " "))
> A
[1]
2003 Oct 09
2
No NT domain controller is available problem
Hi all,
I have samba server 2.2.8a to act as a PDC on a mixed
WinXP/Win2k/WinNT
network.
I joined all my workstations to the domain with no
problem... And users
can login into the domain
and access their folders/printers, BUT!!! Every now and
then they
cannot log in getting error message
that domain controller cannot be found....
I'm trying to track the problem down :-) When I went thru
2011 Sep 13
2
GO & Protein Complex Analysis for Homo sapiens
Dear All,
I need to fetch GO ontologies for Homo sapiens with their mappings to
corresponding Uniprot identifiers. I would be using this information to
compare result from a clustering algorithm with existing protein complexes.
This would be a test to check how the clustering algorithm accurately
captures GO terms with respect to the known protein complexes. Can anyone
suggest a simple workflow
2011 Apr 20
1
Pattern match
Hi ALL,
I have very simple question regarding pattern matching. Could anyone tell me
how to I can use R to retrieve string pattern from text file. for example
my file contain following information
SpeciesCommon=(Human);SpeciesScientific=(Homo
sapiens);ReactiveCentres=(N,C,C,C,+
H,O,C,C,C,C,O,H);BondInvolved=(C-H);EzCatDBID=(S00343);BondFormed=(O-H,O-H);Bond+
2007 Mar 11
1
recoding question
Hi R-users,
I have a data frame like this:
photographer category picture points
Hannu kalat limamikko 1
Teemu kalat verkkovaja 3
Hate kalat munat puoliks padassa 6
Hannu kalat isokala 8
Teemu kasvit, sienet ja muut eliöt harppi 2
Hate kasvit, sienet ja muut eliöt pyynikki 2
Petteri kasvit, sienet ja muut eliöt harmaaleppä 5
Lauri kasvit, sienet ja muut eliöt lumipuu 9
Teemu linnut kainostelua 1
2009 Jul 01
3
is there a way to extract fata from web pages through some R function ?
I deal with a huge amount of Biology data stored in different databases.
The databases belongig to Bioconductor organization can be accessed through Bioconductor packages.
Unluckily some useful data is stored in databases like, for instance, miRDB, miRecords, etc ... which offer just an
interactive HTML interface. See for instance
http://mirdb.org/cgi-bin/search.cgi,
2009 Oct 21
0
drawing a line indicating extent of each factored data series in multipanel lattice xyplot
Hi,
Am am plotting aggregated frequency data (extracted from an RDBMS)
relating to DNA sequence features for each of the human chromosomes as
described in the table chromosomes below (the frequency data is in a
table 'hits' that has a value (or not) for each of a set of bins across
each chromosome).
I would like to mark the extent of the chromosome (according to the
length value in
2007 Mar 10
2
barplot, for loop?
Hi R-users,
I have a dataset like this:
kuvaaja
kuva
yhteispisteet
Hannu
isokala
8
Hannu
kaapin alta löytynyt
2
Hannu
kaapin alta löytynyt 2
8
Hannu
limamikko
1
Hannu
maukasta marmeladia
8
Hannu
skrinnareita
4
Hate
madekoukkujen suojelupyhimys
3
Hate
matka aikaan joka ei enää palaa
3
Hate
munat puoliks padassa
6
Hate
pyynikki
2
Hate
vailla armeerausta
2
2008 Apr 22
1
Comparing kendall's tau values?
I have 3 variables relating to the successful introductions of species
to 95 different areas: introduction frequency; number of successes pre
1906; number of successes post 1906
The data are not normal, nor homo-skedatic, so I am using non-parametric
statistics.
I have calculated Kendall's tau between both introduction & successes
pre 1906 (tau=0.3903) and introduction & successes
2007 Jun 28
5
Repeat if
Hello,
(Power Book G4, Mac OS X, R 2.5.0)
I would like to repeat the function range for 85 Vectors (V1-V85).
I tried with this code:
i<-0
> repeat {
+ i<-i+1
+ if (i<85) next
+ range (Vi, na.rm = TRUE)
+ if (i==85) break
+ }
I presume that the Vi is wrong, because in this syntax i is not known
as a variable. But I don´t know how to say that it is a variable here.
Would be nice if
2007 Oct 10
3
as.dist with diagonal unequal zero
Hello and sorry that I still haven?t found a solution for my problem.
I need to extract the lower and upper triangle from a square matrix
including the diagonal. This diagonal is not zero in that special case.
I tried with as.dist
w<-as.dist(w, diag = TRUE)
> w
1 2 3 4 5
1 0
2 2 0
3 3 8 0
4 4 9 14 0
5 5 10 15 20 0
but found no way to keep the diagonal that is in the
2007 Nov 23
4
PCA with NA
Dear all,
(Mac OS X 10.4.11, R 2.6.0)
I have a quantitative dataset with a lot of Na?s in it. So many, that
it is not possible to delete all rows with NA?s and also not
possible, to delete all variables with NA?s.
Is there a function for a principal component analysis, that can deal
with so many NA?s.
Thanks in advance
Birgit
Birgit Lemcke
Institut f?r Systematische Botanik