Displaying 20 results from an estimated 1000 matches similar to: "data.frame transformation"
2010 Dec 10
3
(no subject)
Hi R-help,
I am trying to find a way to select five highest values in data frame
according some variable. I will demonstrate:
c
X1 X2
1 1 1
2 1 2
3 1 3
4 1 4
5 1 5
6 1 6
7 1 7
8 1 8
9 1 9
10 1 10
11 2 11
12 2 12
13 2 13
14 2 14
15 2 15
16 2 16
17 2 17
18 2 18
19 2 19
20 2 20
21 2 21
22 2 22
23 2 23
24 2 24
25 2 25
So I
2011 Dec 04
3
RODBC connect to Excel (64-bit Windows 7)
Hi to all.
I have a problem to connect to an Excel database using RODBC.
Namely, I am using 64-bit R 2.14.0, under Windows 7 and I tried following:
library(RODBC)
> channel <- odbcConnectExcel("results.xlsx")
Error in odbcConnectExcel("results.xlsx") :
odbcConnectExcel is only usable with 32-bit Windows # ok this is
clear why it doesn't work
> channel
2013 Mar 20
3
highlight overlapping region of two densities
Hi all.
I would like to highlight overlapping regions of two densities and I could
not find a way to do it.
Here is the sample code:
myd <- c(2,4,5, 4,3,2,2,3,3,3,2,3,3,4,2,4,3,3,3,2,2.5,
2, 3,3, 2.3, 3, 3, 2, 3)
myd1 <- myd-2
plot(range(density(myd)$x, density(myd1)$x), range(density(myd)$y,
density(myd1)$y), type = "n")
lines(density(myd), col=1, lwd=4)
2011 Mar 16
2
export list to csv
Hi everybody.
I have list like this:
l<-list(data.frame(q1=c(1,2,"check"),q2=c(3,"check",5)),
data.frame(q1=c("check",1),q2=c(4,5)))
names(l)<-c("A","B")
rownames(l[[1]])<-c("aa","bb","cc")
rownames(l[[2]])<-c("aa","bb")
Every object has the same number of columns but different number
2011 Apr 12
2
Converting a categorical variable to multiple dichotemous variables
I have a categorical variable in a dataframe similar to the following...
cat
1
1
3
2
4
I need to convert it to 4 dichotemous variables for each observations like...
cat1 cat2 cat3 cat4
1 0 0 0
1 0 0 0
0 0 1 0
0 1 0 0
0 0 0 1
Thanks in advance!
Shane
2011 Aug 10
1
subqueries in sqlQuery function (package RODBC)
Hi R users.
sorry for missing example and if question is to general but I am wondering
if it is possible to execute subqueries in function sqlQuery (package RODBC)
with opened connection with Excel or SQL server 2000. I couldn't find any
example of this.
And if it is possible what should be a correct syntax for this query:
SELECT ct,COUNT(*) as n
FROM (SELECT COUNT(*) AS ct FROM children
2010 Dec 11
2
package sampling
Hi R users.
I have a problem with function strata in sampling packages.
> st0 = strata(dom, stratanames="stratas", size=sample.size,
method="systematic",pik, FALSE)
Error in sort.list(y) : 'x' must be atomic for 'sort.list'
Have you called 'sort' on a list?
In previous version of R 2.9.1 and previous version of package sampling this
code worked
2010 Dec 19
1
package survey
Hi R users,
could someone help me to find out which formulas, for standard error
calculation, are used in following example:
a=data.frame(weights=rep(c(10,1),c(4,1)),fpc=rep(41,5),uk=rep(1,5))
srs<-svydesign(id=~1, weights=~weights, data=a)
srs1<-svydesign(id=~1, weights=~weights,fpc=~fpc, data=a)
svytotal(~uk,srs)
total SE
uk 41 9
svytotal(~uk,srs1)
total SE
uk 41
2011 Dec 08
1
sum of deviations from the weighted mean
Hi all. I tried to calculate sum of deviations from the weighted mean and i
didn't get what i expected - 0. Here is an example:
> wt <- c(10,25,38,22,5)
> x <- 6:10
> wm <- weighted.mean(x,wt)
> (x-wm)*wt
[1] -18.70 -21.75 4.94 24.86 10.65
> sum((x-wm)*wt)
[1] -1.24345e-14
With simple mean I got 0:
> sum(x-mean(x))
[1] 0
Could someone explain me why we
2013 Apr 02
2
speedometer charts in R
Hi useRs.
Does anybody know if there is some function that creates speedometer chart
in R? Or if anybody has proposals where to start looking and which
functions I can modify in order to create this kind of chart?
Thanks for any help
Andrija
[[alternative HTML version deleted]]
2011 Jan 25
1
NA replacing
Hello R user,
I have following data frame:
df=data.frame(id=c(1:10),strata=rep(c(1,2),c(5,5)),y=c(
10,12,10,NA,15,70,NA,NA,55,100),x=c(3,4,5,7,4,10,12,8,3,15))
and I would like to replace NA's with:
instead of first NA tapply(na.exclude(df)$y,na.exclude(df)$strata,sum)[1]*
*7 */tapply(na.exclude(df)$x,na.exclude(df)$strata,sum)[1]
where 7 is the value of x (id=4) in strata 1 where y=NA
2012 Apr 14
1
Error: R for Windows GUI front-end has stopped working
Hi all.
I found one situation, on my OS - Windows 7, where R stops working
with reported error R for Windows GUI front-end has stopped working.
Here is the example:
library(plyr)
DF <- data.frame(x=c(1:3, NA, NA), y=factor(sample(1:3,5,rep=T),levels=1:5))
DF[DF$x<3, ]
#this works properly
ddply(DF, .(y), nrow, .drop=FALSE)
#this causes the problem
ddply(DF[DF$x<3, ], .(y), nrow,
2011 Dec 04
4
equating approximate values
Hello List,
I am having trouble finding the command for my problem.
I have two arrays x and y. now i would like to compare the values of x and y
and then get the index of x which is exactly or approximately equal(+/- some
value ) to the values in y.
x <- runif(100,min=0,max=5)
y <- runif(10,min=0,max=5)
the threshold value(+/-) value can vary. for this example lets take it to be
.5
2010 Dec 12
1
list manipulation
Hi R users!
Does anyone know command similar to cbind for adding a column to a object of
the list. For example on this list:
> c1
$`1`
x11 x22
1 1 1
2 1 2
3 1 3
4 1 4
5 1 5
$`2`
x11 x22
6 2 6
7 2 7
8 2 8
9 2 9
10 2 10
11 2 11
i would like to add column, named random created with - runif(5)-, to the
first object in order to obtain something like this:
> c1
$`1`
x11 x22 random
1 1 1
2012 Apr 03
5
R equivalent for SQL query
Hi,
I have a query which I would like to translate into R, but I do not know how to do it in an easy way.
Assume a data frame has columns A, B and C:
A B C
1 1 3
1 1 4
1 1 5
1 2 6
1 2 7
1 3 8
The query is as follows:
select A, B, count(*)
from data.frame
group by A, B
order by count(*) desc
How do I translate this into R statements in such way that the result is a data frame structured as
2010 Mar 11
4
Forecast
sample report data that i want to forecast
quarter quarter_index Revenue
2007 Q1 1 $3,856,799
2007 Q2 2 $4,243,328
2007 Q3 3 $4,930,369
2007 Q4 4 $5,443,579
2008 Q1 5 $5,164,830
2008 Q2 6 $5,104,413
2008 Q3 7
2012 Feb 03
3
Cannot get "==" operator to return TRUE
I have a data.frame named "df". The dput of df is at the bottom of this e-mail.
What I'd like to do is replace the "n/a " values with NA. On Mac OSX, it works
to do this:
df[df == "n/a"] <- NA
However, it does not work on Ubuntu. See below.
Thanks in advance,
Garrett
> x <- df[27, 4] # complete data.frame dput is below
> dput(x)
"n/a?"
2012 Apr 12
4
Definition of "lag" is opposite in ts and xts objects!
Example:
Will ts objects be obsolete or modified?
> a [,1]
1983 Q1 2.747365190
1983 Q2 2.791594762
1983 Q3 -0.009953715
1983 Q4 -0.015059485
1984 Q1 -1.190061246
1984 Q2 -0.553031799
1984 Q3 0.686874720
1984 Q4 0.953911035> lag(a,4) [,1]
1983 Q1 NA
1983 Q2 NA
1983 Q3 NA
1983 Q4 NA
1984 Q1 2.747365190
1984 Q2
2010 Mar 10
3
see the example and help me
sample report data that i want to forecast
quarter quarter_index Revenue
2007 Q1 1 $3,856,799
2007 Q2 2 $4,243,328
2007 Q3 3 $4,930,369
2007 Q4 4 $5,443,579
2008 Q1 5 $5,164,830
2008 Q2 6 $5,104,413
2008 Q3 7
2009 Feb 19
2
table with 3 variables
I have the initial matrice:
> *data.frame(Subject=rep(100:101, each=4), Quarter=rep(paste("Q",1:4,
sep=""),2), Boolean = rep(c("Y","N"),4))*
Subject Quarter Boolean
1 100 Q1 Y
2 100 Q2 N
3 100 Q3 Y
4 100 Q4 N
5 101 Q1 Y
6 101 Q2 N
7 101 Q3 Y
8 101