Displaying 20 results from an estimated 20000 matches similar to: "What is behind class coercion of a factor into a character"
2011 Dec 02
2
Unexplained behavior of level names when using ordered factors in lm?
Hello dear all,
I am unable to understand why when I run the following three lines:
set.seed(4254)
> a <- data.frame(y = rnorm(40), x=ordered(sample(1:5, 40, T)))
> summary(lm(y ~ x, a))
The output I get includes factor levels which are not relevant to what I am
actually using:
Call:
> lm(formula = y ~ x, data = a)
> Residuals:
> Min 1Q Median 3Q Max
>
2012 Jan 31
2
question of merging two dataframes
Suppose I have two data frames A and B
A has three variables and B also has three variables.
I would like to merge these two database but the requirement to merge
is that the value of the second column in database A is less than the
value of the second column in database B. Is there a R code to do
this? Thanks
Dataframe A: V1a V2a V3a
1 2 3 5
2012 Oct 18
3
how to concatenate factor vectors?
How do I concatenate two vectors of factors?
--8<---------------cut here---------------start------------->8---
> a <- factor(5:1,levels=1:9)
> b <- factor(9:1,levels=1:9)
> str(c(a,b))
int [1:14] 5 4 3 2 1 9 8 7 6 5 ...
> str(unlist(list(a,b),use.names=FALSE))
Factor w/ 9 levels "1","2","3","4",..: 5 4 3 2 1 9 8 7 6 5 ...
2010 Mar 07
2
Why can't "apply" be used with "as.factor" on a data.frame ?
Hi all,
Let's say I have a data.frame and wants to turn each of it's columns into a
factor.
My instinct would be to use as.factor with apply. But this won't work, and
result with a data.frame of characters.
I found another solution for how to achieve this, but I would also like to
understand - *WHY* does it work this way?
Here is an example script:
a <- data.frame(x1 = rnorm(100),
2010 Dec 11
5
Why do we have to turn factors into characters for various functions?
Hello dear R-help mailing list,
My question is *not* about how factors are implemented in R (which is, if I
understand correctly, that factors keeps numbers and assign levels to them).
My question *is* about why so many functions that work on factors don't
treat them as characters by default?
Here are two simple examples:
Example one turning the characters inside a factor into numeric:
x
2013 Apr 20
1
Assigning factor to character vector
Hi!
Yesterday I accidentally discovered this:
> a <- LETTERS[1:5]
> a
[1] "A" "B" "C" "D" "E"
>
> a[1] <- factor(a[1])
> a
[1] "1" "B" "C" "D" "E"
BUT:
> b <- factor(LETTERS[1:5])
> b
[1] A B C D E
Levels: A B C D E
> b[1] <- factor(b[1])
> b
[1] A B C D E
2012 Feb 23
5
cor() on sets of vectors
suppose I have two sets of vectors: x1,x2,...,xN and y1,y2,...,yN.
I want N correlations: cor(x1,y1), cor(x2,y2), ..., cor(xN,yN).
my sets of vectors are arranged as data frames x & y (vector=column):
x <- data.frame(a=rnorm(10),b=rnorm(10),c=rnorm(10))
y <- data.frame(d=rnorm(10),e=rnorm(10),f=rnorm(10))
cor(x,y) returns a _matrix_ of all pairwise correlations:
cor(x,y)
2011 Jun 13
1
In rpart, how is "improve" calculated? (in the "class" case)
Hi all,
I apologies in advance if I am missing something very simple here, but since
I failed at resolving this myself, I'm sending this question to the list.
I would appreciate any help in understanding how the rpart function is
(exactly) computing the "improve" (which is given in fit$split), and how it
differs when using the split='information' vs split='gini'
2015 May 18
2
A "bug" in plot.dendrogram - can't plot lty with character color
The problem:
===========
Once a dendrogram has a branch with both a line type AND a color (which is
a character color), the plot.dendrogram function will not plot and return
an error.
I say this is a bug because (I believe), we would like a dendrogram to be
able to use character colors, while also allowing control over line types.
This e-mail includes an example, and what I think a solution
2009 Nov 18
2
Importing tRNA data into R ?
Hello dear R help group,
I would like to download the tRNA data on:
http://gtrnadb.ucsc.edu/download.html
And then import it into R.
Can anyone direct me as to how to do so?
Thanks,
Tal
----------------------------------------------
My contact information:
Tal Galili
E-mail: Tal.Galili@gmail.com
Phone number: 972-52-7275845
FaceBook: Tal Galili
My Blogs:
http://www.talgalili.com (Web and
2012 Aug 28
5
variable scope
At the end of a for loop its variables are still present:
for (i in 1:10) {
x <- vector(length=100000000)
}
ls()
will print "i" and "x".
this means that at the end of the for loop body I have to write
rm(x)
gc()
is there a more elegant way to handle this?
Thanks.
--
Sam Steingold (http://sds.podval.org/) on Ubuntu 12.04 (precise) X 11.0.11103000
2011 Dec 21
4
qqnorm & huge datasets
Hi,
When qqnorm on a vector of length 10M+ I get a huge pdf file which
cannot be loaded by acroread or evince.
Any suggestions? (apart from sampling the data).
Thanks.
--
Sam Steingold (http://sds.podval.org/) on Ubuntu 11.10 (oneiric) X 11.0.11004000
http://mideasttruth.com http://honestreporting.com http://camera.org
http://openvotingconsortium.org http://pmw.org.il
2010 Jun 15
2
Graphics question: How to create a changing "smudge factor" for overlapping lines?
Hello all,
I am trying to create a Clustergram in R.
(More about it here: http://www.schonlau.net/clustergram.html)
And to produce a picture similar to what is seen here:
http://www.schonlau.net/images/clustergramexample.gif
I was able (more or less) to write the R code for creating the image, but
there is one thing I can't seem to figure out, that is the
*changing*"smudge factor"
2009 Nov 20
6
How to: highlight R syntax on webpages ?
My question if in the Subject, but if to extend: I am specifically curious
about WordPress blogs. But any solution will give me a lead.
Thanks,
Tal
----------------------------------------------
Contact me: Tal.Galili@gmail.com | 972-52-7275845
Read me: www.talgalili.com (Hebrew) | www.biostatistics.co.il (Hebrew) |
www.r-statistics.com/ (English)
[[alternative HTML version deleted]]
2012 Feb 08
4
"unsparse" a vector
Suppose I have a vector of strings:
c("A1B2","A3C4","B5","C6A7B8")
[1] "A1B2" "A3C4" "B5" "C6A7B8"
where each string is a sequence of <column><value> pairs
(fixed width, in this example both value and name are 1 character, in
reality the column name is 6 chars and value is 2 digits).
I need to
2009 Oct 14
2
Scatter plot using icons (from a gif) instaed of points - is it possible ?
Hello dear R-help group.
I wish to plot a scatter plot using icons (or images) instead of points.
Is it possible? and how so?
Thanks,
Tal
----------------------------------------------
My contact information:
Tal Galili
E-mail: Tal.Galili@gmail.com
Phone number: 972-52-7275845
FaceBook: Tal Galili
My Blogs:
http://www.talgalili.com (Web and general, Hebrew)
http://www.biostatistics.co.il
2009 Mar 10
5
Changing factor to numeric
Dear Users,
I have a variable in my dataset which is of type factor. But it actually
contains numeric entries which like 5.735 4.759 ..... This is because the
data was read from a CSV file into R and this variable contained other
charaters which were not numeric. I have now dropped the records with the
characters which are not numeric for this variable and want to change it to
numeric srotage
2009 Jul 25
2
labelling points plotted in a 2D plan
I created a 2 D plan:
> plot(range(-2.5,0.95),range(0.00,1.00),type="n",axes=TRUE)
I made a projection of points with their coordonates (X,Y) in that plan
> fullpoints=read.csv2("fullpoints.csv",h=T)
> plot(fullpoints)
The points are listed in that .csv file it is organized this way:
0,48875 0,142857143
0,409 0,142857143
0,45611 0,25
0,49833 0,222222222
0,61158
2010 May 22
2
Capturing R console output into a file (sink+savehistory ??)
After reading more, I understand I didn't formulate my last question
correctly, so please allow me to rephrase:
What I am looking for is a way to save the R console session output.
That is, a command that would combine the results of using:
?sink # And
?savehistory
My motivation for this is that doing it will allow someone who is a blind
user of R to be able to easily export his results to
2010 May 08
3
Count cases in a list
Hi everybody, I would like to count how many times names in list L,
nombreL, apear in list C, nombreC.
Can I improve the next program?
cuenta <- 0
topL <- length(nombreL)
topC <- length(nombreC)
for (i in 1:topL) {
for (j in 1:topC) {
k <-
grep(noquote(nombreL[i]),nombreC[j])