Displaying 17 results from an estimated 17 matches similar to: "Reshape set operations?"
2008 Aug 27
1
Updating a list.
I have a list that is generated from the resape package function 'cast'. It consists of three columns, Sku, DayOfYear, variable it is generated like:
r2007 <- cast(m2008, DayOfYear ~ variable | Sku, sum)
Now DayOfYear can range from 1:365 but there are not necessarily that many rows in the list. What I want to do is make every row in the list of lenght 365 and have the values
2008 Aug 08
2
Length of data.frame column
I have a beginner question. After I finally get the data to a data.frame that I can work with I have the following a data frame that is fairly long:
> length(r2007)
[1] 17409
If I look at the first element:
> r2007[1]
$`100009`
DayOfYear Quantity
1 66 1
2 128 1
3 137 1
4 193 1
Now how do I get the length of this list (actually it is
2008 Aug 05
1
RESHAPE cast help.
I have a set of data that is basically sales figures for a given year. It has columns for Yeaqr, Day Of Year, Sku, SubCatetory, and Category. The first few lines of data look like:
Year DayOfYear Sku Quantity CatId Category SubCategory
1 2007 1 100091 1 10862 HOLIDAY Christmas
2 2007 1 100138 1 11160 PET COSTUMES Famous
2009 Mar 11
1
Reshape question.
This hopefully is trivial. I am trying to reshape the data using the reshape package.
First I read in the data:
a2009 <- read.csv("Total2009.dat", header = TRUE)
Then I trim it so that it only contains the columns that I have interested in:
m2009 <- melt(a2009, id.var=c("DayOfYear","Category","SubCategory","Sku"),
2008 Jul 26
1
Simple vector question.
I have some data that I read in via read.csv:
sales2007 <- read.csv("Total2007.dat", header=TRUE)
The data looks like:
> sales2007[1:605,]
Year DayOfYear Sku Quantity CatId Category SubCategory
1 2007 1 100091 1 10862 HOLIDAY Christmas
2 2007 1 100138 1 11160 PET COSTUMES Famous (Licensed)
3 2007
2008 Aug 11
3
Exporting a list of lists
I have a list
List(Sku=" ", Shape=1, Scale=3, DayOfYear=daylist)
Note: picture daylist as c(2,3,4,3) it is a list with variable length.
Then I have a list of lists
al <- c(al, List(List(Sku=" ", Shape=1, Scale=3, DayOfYear=daylist))
Note: same comment on daylist as above.
So far this creates a list of lists just how I want it. If I do al[1] I get each member and the
2008 May 15
1
lattice histogram problem with integers values and nint
been puzzling over this for a day.
Summary
integer variable to use with histogram, 170,000 rows. Value is day of
year. Hist works, lattice histogram with nint does not work (spurious
spikes in display), lattice histogram using breaks=c(0:365) works
fine. Spike values appear to be sum of two adjacent bins. Want to
know if this is a familiar problem, and what the recommended
work-around is.
2008 Aug 07
1
Where is the archive? Calling a function within a function.
I seem to remember this topic coming up before so I decided to look at the archive and realized that I didn't know where it was. Is there a searchable archive for this list? Thank you.
My question is calling a function from within a function. I have
smerge <- function(d1,d2) {
temp <- merge(d1,d2,all=TRUE,by.x="DayOfYear",by.y="DayOfYear")
return
2008 Aug 08
1
write.csv writing the "index"
I have a simple command to export a data.frame:
write.csv(output, "TotalPredicted2008.dat")
The structure of the data.frame can be seen with:
> head(output)
DayOfYear Sales
1 1 1429
2 2 3952
3 3 3049
4 4 2844
5 5 2219
6 6 2340
But it seems that the index is getting written out to the file. The first few lines look like:
2008 Jul 26
4
Data length mismatch.
I have two vectos (list) that represent a years of data. Each "row" is represented by the day of year and the quantity that was sold for that day. I would like to form a new vector that is the difference between the two years of data. A sample of A (and similarly B) looks like:
> A[1:5,]
DayOfYear x
1 1 1429
2 2 3952
3 3 3049
4 4 2844
5 5
2012 Mar 27
1
Convert day of year back into a date format.
Hello,
I am having trouble figuring out how to convert a Day of Year integer
back into a Date format. For example I have the following:
date <- c('2008-01-01','2008-01-02','2008-01-03','2008-01-04','2008-01-05','2008-01-06','2008-01-07',
2008 Jan 02
2
Windows Shell Command under Vista
Dear R2008,
In Windows 2000, I used the following shell extension to allow right-click
rgui start in a directory (assuming rgui.exe is on the path:
-----
REGEDIT4
[HKEY_CLASSES_ROOT\Directory\shell\Run R]
[HKEY_CLASSES_ROOT\Directory\shell\Run R\command]
@="Rgui.exe --no-restore --no-save --internet2"
----
I switched to Vista this year. For reasons I don't understand, I have
2012 Oct 07
2
gam error message: matrix not +ve definite
Hello,
I'm running a multimodel analysis which involves fitting several GAM models
as implemented in package mgcv. The issue I'm having is that when I try to
fit my model, gam gives me the following error message: 'Error in
initial.sp(w * X, S, off) : S[[2]] matrix is not +ve definite.' The strange
part of this is that the error message stops my model fitting function when
run
2008 Jul 12
1
Assoociative array?
I have search the archive and I could not find what I need so I will try to ask the question here.
I read a table in (read.table)
a <- read.table(.....)
The table has column names like DayOfYear, Quantity, and Category.
The values in the row for Category are strings (characters).
I want to get all of the rows grouped by Category. The number of unique category names could be around 50. Say
2009 Jan 22
1
convergence problem gamm / lme
Hope one of you could help with the following question/problem:
We would like to explain the spatial
distribution of juvenile fish. We have 2135 records, from 75 vessels
(code_tripnr) and 7 to 39 observations for each vessel, hence the random effect
for code_tripnr. The offset (‘offsetter’) accounts for the haul duration and
sub sampling factor. There are no extreme outliers in lat/lon. The model
2011 Jul 07
3
How to make matrix missing data 0
Dear All,
I am trying to analysis traffic data with one timestamp column and speed
column. This data set contains six years data; for each year, I want to make
a matrix in (day of the year) * (hour)
0 1 2 . . . 23
1
2
.
.
365
However random day's record is missing(e.g. there are 40 missing records in
2005 data set), when I tried to use tapply function, matrix came out is
2009 Jan 28
2
t.test in a loop
Hi All,
I've been having a little trouble with creating a loop that will run a a
series of t.tests for inspection,
Below is the code i've tried, and some checks i've looked at.
I've used the get(paste()) idea as i was told previously that the use of the
eval should try and be avoided.
I've run a single syntax to check that my systax is correct and works
without any problems