similar to: I need guidance on better data management in preparation for time series analysis

Displaying 20 results from an estimated 8000 matches similar to: "I need guidance on better data management in preparation for time series analysis"

2000 Mar 01
2
Help please..
Hello R-world, I am facing a peculiar problem and hope someone out there can comment on it. In goodness-of-fit tests for evaluation of distributions, there are three well-known methods: 1. Chi-square 2. Anderson-Darling 3. Kolmogorov-Sminrov I am trying to use the second test. Many researchers have reported results using this test. I wrote programs in C and now in R to do this. I run into
2008 Oct 08
0
Applying an R script to data within MySQL? How to?
I am trying something I haven't attempted before and the available documentation doesn't quite answer my questions (at least in a way I can understand). My usual course of action would be to extract my data from my DB, do whatever manipulation is necessary, either manually or using a C++ program, and then import the data into R. Now I need to try to do it all within R+RMySQL+MySQL. I
2003 Feb 12
1
Na/NaN error in subsampling script
R-help readers, I''m having a problem with an R script (see below), which regularly generates the error message, Error in start:(start + (sample.length - 1)) : NA/NaN argument , for which I am unsure of the cause. In essence, the script (below) generates the start and end points for random subsamples from along a vector (in reality a transect (of a given length,
2008 Sep 25
0
Please help me interpret these results (fitting distributions to real data)
I just thought of a useful metaphore for the problem I face. I am dealing with a problem in business finance, with two kinds of related events. However, imagine you have a known amount of carbon (so many kilograms), but you do not know what fraction is C14 (and thus radioactive). Only the C14 will give decay events (and once that event has occurred, the atom that decayed will never decay
2011 May 21
2
unbalanced anova with subsampling (Type III SS)
Hello R-users, I am trying to obtain Type III SS for an ANOVA with subsampling. My design is slightly unbalanced with either 3 or 4 subsamples per replicate. The basic aov model would be: fit <- aov(y~x+Error(subsample)) But this gives Type I SS and not Type III. But, using the drop() option: drop1(fit, test="F") I get an error message: "Error in
2008 Oct 07
3
Fitting weibull, exponential and lognormal distributions to left-truncated data.
Dear All, I have two questions regarding distribution fitting. I have several datasets, all left-truncated at x=1, that I am attempting to fit distributions to (lognormal, weibull and exponential). I had been using fitdistr in the MASS package as follows: fitdistr<-(x,"weibull") However, this does not take into consideration the truncation at x=1. I read another posting in this
2012 Aug 16
1
Big Data reading subsample csv
Hello, I'm most grateful for your time to read this. I have a uber size 30GB file of 6 million records and 3000 (mostly categorical data) columns in csv format. I want to bootstrap subsamples for multinomial regression, but it's proving difficult even with my 64GB RAM in my machine and twice that swap file , the process becomes super slow and halts. I'm thinking about generating
2010 Jul 08
1
Query about using timestamps returned by SQL as 'factor' for split
I have a simple query as follows: "SELECT m_id,sale_date,YEAR(sale_date),WEEK(sale_date),return_type,DATEDIFF(return_date,sale_date) AS elapsed_time FROM risk_input" I can get, and view, all the data that that query returns. The question is, sale_date is a timestamp, and I need to call split to group this data by m_id and the week in which the sale occurred. Obviously, I would
2008 Oct 15
1
dbAppy questions/clarifications
In the example in the documentation, I see: rs <- dbSendQuery(con, "select Agent, ip_addr, DATA from pseudo_data order by Agent") out <- dbApply(rs, INDEX = "Agent", FUN = function(x, grp) quantile(x$DATA, names=FALSE)) Maybe I am a bit thick, but it took me a while, and a kind hint from Phil, to figure much of this out. It is clear that the SQL
2008 Sep 19
0
Re lative Novice ? "Can I get some explanation of the docs for fitdistr(MASS)?"
In the docs I see: Usage fitdistr(x, densfun, start, ...) Arguments x A numeric vector. densfun Either a character string or a function returning a density evaluated at its first argument. Distributions "beta", "cauchy", "chi-squared", "exponential", "f", "gamma", "geometric", "log-normal", "lognormal",
2011 May 13
0
routine for dependent correlation test with stratified random sample
Dear R-List,   I would like to have a large number of stratified random subsamples drawn from my dataframe and automatically test for correlation differences in every subsample.   Let this be my dataframe   df<-data.frame(group=c(rep(1,5),rep(2,5),rep(3,5)),a=c(3,4,5,6,3,4,5,4,5,4,1,2,1,2,1),b=c(1,2,3,4,5,3,4,3,4,5,6,5,6,2,3),c=c(2,2,3,3,5,1,1,6,6,5,6,1,1,2,1))   Then I would like to have n
2009 Jan 14
1
fitting data
Dear all, I have a set of data which seem to be distributed almost exponentially but only on [0;1]. I guess that the probability distribution in this case would look like \frac{lambda}{1-e^{-\lambda}) e^{-\lambda x} I would like to use fitdistr to estimate the value of \lambda. 1) Would it be correct first to find lambda for the exponential distribution and then to substitute it in the formula
2006 Sep 23
1
Fitdistr() versus nls()
Hello R-Users, I'm new to R so I apologize in advance for any big mistake I might be doing. I'm trying to fit a set of samples with some probabilistic curve, and I have an important question to ask; in particular I have some data, from which I calculate manually the CDF, and then I import them into R and try to fit: I have the x values (my original samples) and the y values
2005 Jan 14
5
subsampling
hi, I would like to subsample the array c(1:200) at random into ten subsamples v1,v2,...,v10. I tried with to go progressively like this: > x<-c(1:200) > v1<-sample(x,20) > y<-x[-v1] > v2<-sample(y,20) and then I want to do: >x<-y[-v2] Error: subscript out of bounds.
2004 Oct 27
1
Warning messages in function fitdistr (library:MASS)
Why the warning messages (2:4)? > x <- rexp(1000,0.2) > fitdistr(x,"exponential",list(rate=1)) rate 0.219824219 (0.006951308) Warning messages: 1: one-diml optimization by Nelder-Mead is unreliable: use optimize in: optim(start, mylogfn, x = x, hessian = TRUE, ...) 2: NaNs produced in: dexp(x, 1/rate, log) 3: NaNs produced in: dexp(x, 1/rate, log) 4: NaNs
2009 Apr 06
3
how to subsample all possible combinations of n species taken 1:n at a time?
Hello I apologise for the length of this entry but please bear with me. In short: I need a way of subsampling communities from all possible communities of n taxa taken 1:n at a time without having to calculate all possible combinations (because this gives me a memory error - using combn() or expand.grid() at least). Does anyone know of a function? Or can you help me edit the combn or
2003 Sep 06
1
Fitting t-Student(mu, sigma, nu)
Dear R users: 1. Is there a function which fits to the data the t-student distribution with parameters mu, sigma, nu. Is the function fitdistr of MASS with the syntax fitdistr(x, "t") appropriate for this? 2. Is there a function which can fit the exponential power distribution? Thanks. --------------------------------- ÁðïêôÞóôå ôçí äùñåÜí óáò@yahoo.gr [[alternative
2008 Feb 09
1
Problem with fitdistr function while estimating parameters
Hello, I am using fitdistr function for parameter estimation. When I use fd<-fitdistr(V2,"gamma") I get following error: Error in optim(x = c(0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, : initial value in 'vmmin' is not finite fd<-fitdistr(V2,"weibull") Error in optim(x = c(0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, :
2012 Jul 02
1
Fitting and Plotting the fitted distributions
Dear all, I have wrote some sample code that would allow me easier fit fast many distributions and check which of the fits performs better. My sample code (that you can of course execute it looks like that) distrList<-list(   "exponential",  "geometric", "log-normal",  "normal", "Poisson") fitfunction<-function(Type,x){     return
2008 Sep 24
0
Simulations / repetitions help!
Dear all, My question concerns using repetitions and simulations (loops?) in R. I am very new R user, so any help that can be offered would be greatly appreciated! I am using fitdistr() to determine the distribution of empirical univariate datasets, and ks.test to assess the goodness of fit. Because the null distribution of the KS statistic is not known when the distribution parameters are