thr3ads.net - similar to: "Recreate new dataframe based on condition"

2006 May 23

5

conditional replacement

Hi How can do this in R. >df 48 1 35 32 80 If df < 30 then replace it with 30 and else if df > 60 replace it with 60. I have a large dataset so I cant afford to identify indexes and then replace. Desired o/p: 48 30 35 32 60 Thanx in advance. Sachin

Handling large dataset & dataframe [Broadcast]

2006 Apr 24

1

Handling large dataset & dataframe [Broadcast]

Here's a skeletal example. Embellish as needed: p <- 5 n <- 300 set.seed(1) dat <- cbind(rnorm(n), matrix(runif(n * p), n, p)) write.table(dat, file="c:/temp/big.txt", row=FALSE, col=FALSE) xtx <- matrix(0, p + 1, p + 1) xty <- numeric(p + 1) f <- file("c:/temp/big.txt", open="r") for (i in 1:3) { x <- matrix(scan(f, nlines=100), 100,

Creat new column based on condition

2006 Apr 21

3

Creat new column based on condition

Hi, How can I accomplish this task in R? V1 10 20 30 10 10 20 Create a new column V2 such that: If V1 = 10 then V2 = 4 If V1 = 20 then V2 = 6 V1 = 30 then V2 = 10 So the O/P looks like this V1 V2 10 4 20 6 30 10 10 4 10 4 20 6 Thanks in advance. Sachin

Dataframe modification

2006 Aug 21

1

Dataframe modification

Hi, How can I accomplish this in R. I have a Dataframe with 3 columns. Column B and C have same elements. But column A has more elements than B and C. I want to compare Column A with B and do the following: If A is not in B then insert a new row in B and C and fill these new rows with B = A and C = 0. So finally I will have balanced dataframe with equal no of rows (entries) in

Conditional Row Sum

2006 Apr 20

2

Conditional Row Sum

Hi, How can I accomplish this in R. Example: R1 R2 3 101 4 102 3 102 18 102 11 101 I want to find Sum(101) = 14 - i.e SUM(R1) where R2 = 101 Sum(102) = 25 - SUM(R2) where R2 = 102 TIA Sachin --------------------------------- [[alternative HTML version deleted]]

write.table & csv help

2006 Jun 26

2

write.table & csv help

Hi, How can I produce the following output in .csv format using write.table function. for(i in seq(1:2)) { df <- rnorm(4, mean=0, sd=1) write.table(df,"C:/output.csv", append = TRUE, quote = FALSE, sep = ",", row.names = FALSE, col.names = TRUE) } Current O/p: x 0.287816 -0.81803 -0.15231 -0.25849 x 2.26831 0.863174

vlookup in R

2008 Mar 24

6

vlookup in R

Hi, Is there are function similar to excel vlookup in R. Please let me know. Thanks, Sachin ____________________________________________________________________________________ [[alternative HTML version deleted]]

Handling large dataset & dataframe

2006 Apr 24

6

Handling large dataset & dataframe

Hi, I have a dataset consisting of 350,000 rows and 266 columns. Out of 266 columns 250 are dummy variable columns. I am trying to read this data set into R dataframe object but unable to do it due to memory size limitations (object size created is too large to handle in R). Is there a way to handle such a large dataset in R. My PC has 1GB of RAM, and 55 GB harddisk space running

prediction interval for new value

2006 Sep 15

2

prediction interval for new value

Hi, 1. How do I construct 95% prediction interval for new x values, for example - x = 30000? 2. How do I construct 95% confidence interval? my dataframe is as follows : >dt structure(list(y = c(26100000, 60500000, 16200000, 30700000, 70100000, 57700000, 46700000, 8600000, 10000000, 61800000, 30200000, 52200000, 71900000, 55000000, 12700000 ), x = c(108000, 136000,

Plotting Frequency Distribution in R

2008 May 13

2

Plotting Frequency Distribution in R

Hi, How can plot a frequency distribution curve for the following data. V1 V2 1 1 160.54% 2 1 201.59% 3 1 18.45% 4 1 179.03% 5 1 274.37% 6 1 0.00% 7 1 24.52% 8 1 39.17% 9 3 43.72% 10 1 53.06% 11 1 64.97% 12 1 79.84% 13 1 98.08% 14 1 115.32% 15 1 127.96% 16 1 155.38% 17 1 157.25% 18 1 193.17% 19 1 51.53% 20 15 99.32% 21 1 106.86% 22 1 219.44%

Subset dataframe based on condition

2006 Apr 17

3

Subset dataframe based on condition

Hi, I am trying to extract subset of data from my original data frame based on some condition. For example : (mydf -original data frame, submydf - subset dada frame) >submydf = subset(mydf, a > 1 & b <= a), here column a contains values ranging from 0.01 to 100000. I want to extract only those matching condition 1 i.e a > . But when i execute this command it is

write.csv + appending output (FILE I/O)

2006 May 18

1

write.csv + appending output (FILE I/O)

Hi, How can I write the output to an excel (csv) file without printing row names (i.e without breaks). Here is my code: library( fn <- function() { q <- c(1,2,3) write.csv(q,"C:/Temp/op.xls", append = TRUE, row.names = FALSE,quote = FALSE) } # Function Call for(i in 1:3) { fn() } Present Output : x 1 2 3 x 1 2

dataframe of unequal rows

2006 Aug 18

2

dataframe of unequal rows

Hi, How can I read data of unequal number of observations (rows) as is (i.e. without introducing NA for columns of less observations than the maximum. Example: A B C D 1 10 1 12 2 10 3 12 3 10 4 12 4 10 5 10 Thanks in advance. Sachin --------------------------------- [[alternative HTML version deleted]]

Couldn't find Order with ID=pending_orders

2010 Feb 22

9

Couldn't find Order with ID=pending_orders

I have a Controller named Orders which has a pending_orders method which is expected to fetch some records from the database. If i dont write a route for this method, I get the following error when i call this method. Couldn''t find Order with ID=pending_orders I am using rails 2.3.5, in the previous versions i use to get this I am not getting whether its new version requirement... Help

KPSS test

2006 Jul 06

2

KPSS test

Hi, Am I interpreting the results properly? Are my conclusions correct? > KPSS.test(df) ---- ---- KPSS test ---- ---- Null hypotheses: Level stationarity and stationarity around a linear trend. Alternative hypothesis: Unit root. ---- Statistic for the null hypothesis of level stationarity: 1.089 Critical values: 0.10 0.05 0.025 0.01 0.347 0.463

subsetting a dataframe

2003 Apr 07

4

subsetting a dataframe

How does one remove a column from a data frame when the name of the column to remove is stored in a variable? For Example: colname <- "LOT" newdf <- subset(olddf,select = - colname) The above statement will give an error, but thats what I'm trying to accomplish. If I had used: newdf <- subset(olddf,select = - LOT) then it would have worked, but as I said the column

cdf of weibull distribution

2006 Apr 26

1

cdf of weibull distribution

Hi, I have a data set which is assumed to follow weibull distr'. How can I find of cdf for this data. For example, for normal data I used (package - lmomco) >cdfnor(15,parnor(lmom.ub(c(df$V1)))) Also, lmomco package does not have functions for finding cdf for some of the distributions like lognormal. Is there any other package, which can handle these distributions?

Insert rows - how can I accomplish this in R

2006 Aug 18

1

Insert rows - how can I accomplish this in R

Hi, I have following dataframe. Column A indicates months. DF <- structure(list(A = c(1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 1, 2, 3, 4, 5, 7, 8, 11, 12, 1, 2, 3, 4, 5, 8), B = c(0, 0, 0, 8, 0, 19, 5, 19, 0, 0, 0, 11, 0, 8, 5, 11, 19, 8, 11, 10, 0, 8, 36, 10, 16, 10, 22)), .Names = c("A", "B"), class = "data.frame", row.names = c("1",

Quickie : unload library

2006 Aug 25

5

Quickie : unload library

Dear list, I know it must be obvious and I did my homework. (In fact I've RSiteSearched with keyword "remove AND library" but got timed out.(why?)....) How do I unload a library? I don't mean getting ride of it permanently but just to unload it for the time being. A related problem : I have some libraries loaded at startup in .First() which I have in .Rprofile. Now, I exited R

Unexpected behavior in factor level ordering

2012 Feb 25

1

Unexpected behavior in factor level ordering

Hello, Everybody: This may not be a "bug", but for me it is an unexpected outcome. A factor variable's levels do not retain their ordering after the levels function is used. I supply an example in which a factor with values "BC" "AD" (in that order) is unintentionally re-alphabetized by the levels function. To me, this is very bad behavior. Would you agree? #

similar to: Recreate new dataframe based on condition