Displaying 20 results from an estimated 50 matches for "cecilia".
2013 Jun 10
2
please check this
...indx10<- sort(c(indx11,indx00))
?nrow(res10Percent[-indx10,])
#[1] 452
?res10PercentNew<-res10Percent[-indx10,]
?nrow(subset(res10PercentNew,dummy==1))
#[1] 226
?nrow(subset(res10PercentNew,dummy==0))
#[1] 226
?nrow(unique(res10PercentNew))
#[1] 452
A.K.
----- Original Message -----
From: Cecilia Carmo <cecilia.carmo at ua.pt>
To: arun <smartpink111 at yahoo.com>
Cc:
Sent: Monday, June 10, 2013 10:19 AM
Subject: RE: please check this
But I don't want it like this.
Once a firm is paired with another, these two firms should not be paired again.
Could you solve this?
Thanks,...
2013 Jun 07
4
matched samples, dataframe, panel data
I R-helpers
#I have a data panel of thousands of firms, by year and industry and
#one dummy variable that separates the firms in two categories: 1 if the firm have an auditor; 0 if not
#and another variable the represents the firm dimension (total assets in thousand of euros)
#I need to create two separated samples with the same number os firms where
#one firm in the first have a corresponding
2009 May 04
4
Creating a variable which is the sum of equal rows in a dataframe
...aframe:
Firm numberofbanks
500600700 3
500600800 2
500600900 1
This is a question of counting the times each firm
appears, but I don?t know which function do this. If
anyone could help me I appreciate.
Thank you in advance for the help you could give me,
Cecilia Carmo (Portugal)
2009 Jun 01
1
Fwd: subset dataframe/list
--- the forwarded message follows ---
-------------- next part --------------
An embedded message was scrubbed...
From: "Cecilia Carmo" <cecilia.carmo at ua.pt>
Subject: Re: [R] subset dataframe/list
Date: Mon, 01 Jun 2009 21:33:15 +0100
Size: 3657
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20090601/921f7638/attachment-0002.mht>
2011 Oct 02
2
subset in dataframes
I need help in subseting a dataframe:
data1<-data.frame(year=c(2001,2002,2003,2004,2001,2002,2003,2004,
2001,2002,2003,2004,2001,2002,2003,2004),
firm=c(1,1,1,1,2,2,2,2,3,3,3,3,4,4,4,4),x=c(11,22,-32,25,-26,47,85,98,
101,14,87,56,12,43,67,54),
y=c(110,220,302,250,260,470,850,980,1010,140,870,560,120,430,670,540))
data1
I want to keep the firms where all x>0 (where there are
2009 Jun 28
2
simple loop
Hi everyone!
I have this dataframe:
firm<-c(rep(1,4),rep(2,4),rep(3,4),rep(4,4),rep(5,4),rep(6,4))
year<-c(rep(2000:2003,6))
industry<-c(rep(10,4),rep(20,4),rep(30,4),rep(10,4),rep(20,4),rep(30,4))
X1<-c(10,14,18,16,20,45,23,54,24,67,98,58,16,32,57,12,54,0,0,22,11,3,5,6)
data<-data.frame(firm, industry,year,X1)
data
I need a loop that calculates the mean of X1 by year and
by
2010 Aug 01
3
remove extreme values or winsorize – loop - dataframe
Hi everyone!
#I need a loop or a function that creates a X2 variable
that is X1 without the extreme values (or X1 winsorized)
by industry and year.
#My reproducible example:
firm<-sort(rep(1:1000,10),decreasing=F)
year<-rep(1998:2007,1000)
industry<-rep(c(rep(1,10),rep(2,10),rep(3,10),rep(4,10),rep(5,10),rep(6,10),rep(7,10),rep(8,10),rep(9,10),
rep(10,10)),1000)
X1<-rnorm(10000)
2010 Aug 21
3
problems with merge() - the output has many repeated lines
Hi everyone,
I have been merging many big dataframes (about 80000 rows
each) and I never had this problem, but now it happened to
me and I want to know if someone knows what could be
happening.
The final dataframe has many rows, an impossible number! I
have done edit(dataframe) and I saw that there are many
repeated rows (all equal).
Thanks for any help,
Cec?lia Carmo
Universidade de
2011 Sep 22
2
the opposite of lag() in panel data
Hi R-helpers
I want a function that performs the opposite of lag() with panel data.
I have transformed my data before with pdata.frame(mydata,
index=c("groupindex", “timeindex"))
And then I’ve done lag(mydata, -1) but it doesn’t work.
The error message was:
Error in rep(1, ak) : invalid ''times'' argument
Thank you in advance,
Cecília Carmo
2009 Jun 02
1
R: subset dataframe/list
...=T,dec=",")
Then, subset worked fine
coeficientesWanted<-subset(coeficientes,b1>0)
Thanks again,
Cec?lia Carmo (Universidade de Aveiro ? Portugal)
--- the forwarded message follows ---
-------------- next part --------------
An embedded message was scrubbed...
From: "Cecilia Carmo" <cecilia.carmo at ua.pt>
Subject: Fwd: [R] subset dataframe/list
Date: Mon, 01 Jun 2009 21:34:33 +0100
Size: 4560
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20090602/c8422ca1/attachment-0002.mht>
2009 Jun 16
2
save the output of summary(lmList(x)) into a dataframe
Hi r-helpers!
I need to save the output of summary() function that I?ve
runned like this:
z<- lmList(y~x1+x2| x3,
na.action=na.omit,data1,subset=year==1999)
w<-summary(z)
The output (w) is something like this:
Call:
Model: y ~ x1 + x2 | x3
Data: data1
Coefficients:
(Intercept)
Estimate Std. Error t value Pr(>|t|)
1 0.081110514 1.141352e-01
2009 Jun 01
2
subset dataframe/list
Hi R-helpers!
I have the following object:
> head(coeficientes)
caedois b1 b2 b3
1 1 0,033120395 -20,29478338 -0,274638864
2 2 -0,040629634 74,54239889 -0,069958424
3 5 -0,001116816 35,2398622 0,214327185
4 10 0,171875
5 14 0,007288399 40,06560548 -0,081828338
6 15 0,027530346 0,969969409 0,102775555
2010 Aug 20
5
paired samples, matching rows, merge()
Hi everyone!
I'm matching two samples to create one sample that have
pairs of observations equal for the k1 variable. Merge()
doesn't work because I dont't want to recycle the values.
x <- data.frame(k1=c(1,1,2,3,3,5),
k2=c(20,21,22,23,24,25))
x
y <- data.frame(k1=c(1,1,2,2,3,4,5,5),
k2=c(10,11,12,13,14,15,16,17))
y
merge(x,y,by="k1")
k1 k2.x k2.y
1 1 20
2009 Dec 10
0
plm ? tests of poolability ? error: insufficient number
Hello Cecilia,
nice hearing from you again. I must restate a couple of my old hints,
though ;^)
1) please always put the authors c/c, as we are not guaranteed to browse
through the r-help every day
2) please provide reproducible examples.
As example(pooltest) keeps working fine, as do some other cases I tried...
2009 Apr 27
1
question about adaboost.
Hello,
I would like to know how to obtain the misclassification error when performing a boosting analisis with ADABAG package?
With:
> prop.table(Tesis.boostcv$confusion)
I obtain the confusion matrix, but not the overall missclassification error.
Thanks in advance,
BSc. Cecilia Lezama
Facultad de Ciencias - UDELAR
Montevideo - Uruguay.
[[alternative HTML version deleted]]
2013 Jun 08
0
data
Hi,
Try this:
final3New<-read.table(file="real_data_cecilia.txt",sep="\t")
dim(final3New)
#[1] 5369??? 5
#Inside the split within split, dummy==1 for the first row.? For lists that have many rows, I selected the row with dummy==0 (from the rest) using the #condition that the absolute difference between the dimensions of those rows and the fi...
2007 Feb 12
6
Boxplot: quartiles/outliers
For boxplot(), is it possible to pass in a parameter to change the default
way that the 1st and 3rd quartiles are computed? (specifically, I'd like to
use type 6 described in the quantile function).
Also, what are the options for how outliers are computed, and how can one
change them?
Thank you
[[alternative HTML version deleted]]
2011 May 19
2
balanced panel data
I have a dataframe with many firm-year observations and many variables.
Not all firms have information for all the years.
I want another dataframe with only those firms that have information all
years.
This is, I want a balanced panel data, but with the maximum number of years.
In my reprocucible example I want to keep firms 1,2 and 3 (period 2000 to
2004).
I need your help to create a
2009 Jun 08
5
if else
Hi R-helpers!
I have the following dataframe:
firm<-c(rep(1:3,4))
year<-c(rep(2001:2003,4))
X1<-rep(c(10,NA),6)
X2<-rep(c(5,NA,2),4)
data<-data.frame(firm, year,X1,X2)
data
So I want to obtain the same dataframe with a variable X3
that is:
X1, if X2=NA
X2, if X1=NA
X1+X2 if X1 and X2 are not NA
So my final data is
X3<-c(15,NA,12,5,10,2,15,NA,12,5,10,2)
2009 Apr 19
2
importing spreadsheet data - linera regression - panel data
Hi everyone and thank you for the help you could give me.
My data is in a spreadsheet. The 1st column identifies the
firm (with the fiscal number), the columns 2 to 11 have
the variable value for 11 years. I have many variables
(files like this). Each file has about 40.000 firms
(rows). I transformed all the files in txt files. The data
is a panel data, like this:
firm revenu2007 revenue2006