similar to: "Large" data set: performance issue

Displaying 20 results from an estimated 100 matches similar to: ""Large" data set: performance issue"

2003 Feb 24
1
Mass: lda and collinear variables
hello list, when I use method lda of the MASS package I experience a warning: variables are collinear in: lda.default(data[train, ], classes[train]) Is there an easy way to recover from this issue within the MASS package? Or how can I tell how severe this issue is at all? I understand that I shouldn't use lda at all with collinear data and should use "quadratische" (squared?)
2002 Dec 31
1
Selecting variables from a data.frame
Hi all, currently I'm working with physical data stored in a data.frame. I have N observations, typically 100-300 per data set. Each row in a set holds M (typically 2100) variables which represent a curve. For linear discriminant analysis I chose first to do a wavelet transform (because M >> N) and then feed the transformed data (of level L) in lda. This works fine (e.g. error <
2003 Mar 11
3
R-Graphics: Scaling axis
Hi, how can I scale the x- and y-axis of a "plot" to the same scale? My problem: The following command sequence produces the plot in a square. What I want is the x-axis to be 5 times as wide (measured e.g. in pixels) as the y-axis is long (because y ranges from -1 to 1 and x ranges from 0 to 10). x <- seq( from=0, to=10, by=.1) sinx <- sin(x) plot( x, sinx, type="l")
2018 Apr 17
0
Time intervals in a datframe
> On Apr 17, 2018, at 10:10 AM, Allaisone 1 <allaisone1 at hotmail.com> wrote: > > > Hi all > > I have a list of multiple datframes with the same column headers. The last column in each datframe contains a vector of "Interval" class after I have produced this column using "lubridate" package. I needed to convert my list of dataframes to be in a single
2018 Apr 17
2
Time intervals in a datframe
Hi all I have a list of multiple datframes with the same column headers. The last column in each datframe contains a vector of "Interval" class after I have produced this column using "lubridate" package. I needed to convert my list of dataframes to be in a single dataframe for further analysis. I did this using the following syntax : SingleDataframe <- ldply
2005 Mar 25
2
tapply and NA value
Hi, I'm writing for a little help. I have a dataframe with same NA value and I'd like to obtain the means of the value of a coloumn grouped by the levels of a factor coloumn of the datframe. I'm using the function "tapply" but I see that if only a NA value is present the result is NA. There is an option to have the correct result or I must use an other function? Thanks of
2009 Aug 28
2
Pls package
Hi, I have managed to format my data into a single datframe consisting of two AsIs response and predictor dataframes in order to supply the plsr command of the pls package for principal components analysis. When I execute the command, however, I get this error: > fiber1 <- plsr(respmat ~ predmat, ncomp=1, data=inputmat,validation="LOO") Error in model.frame.default(formula =
2004 Jun 29
1
nls fitting problems (singularity)
Hallo! I have a problem with fitting data with nls. The first example with y1 (data frame df1) shows an error, the second works fine. Is there a possibility to get a fit (e.g. JMP can fit also data I can not manage to fit with R). Sometimes I also got an error singularity with starting parameters. # x-values x<-c(-1,5,8,11,13,15,16,17,18,19,21,22) # y1-values (first data set)
2008 Nov 26
1
Creating a vector based on lookup function
I am still searching for a solution to what i think is a simple problem i am having with building a vector in a for loop. I have built a more understandable example so hopefully that will help..help you, help me, if you know what i mean. dev=400 #test location model TAZs to reference cands=c(101,105,109) #Create Object of length of cands candslength=length(cands) #TEST TAZ Vector
2006 Oct 01
3
aggregate function with 'NA'
Dear r-help reader, I have some problems with the aggregate function. My datframe looks like >frame Day Time V1 V2 1 M 0 3 NA 2 M 0 4 NA 3 M 0 5 2 4 M 1 NA 4 5 M 1 10 6 6 T 0 4 45 7 T 1 4 3 8 T 1 3 2 9 T 1 6 1 I used the aggegate function to obtain the mean in V1 and V2 over the grouping variable Time and Day
2018 May 02
2
using apply
Hi I have 3 dataframes, a,b,c with 0/1 values...i have to check a condition for dataframe a and b and then input the rows ids to datframe c . In the if condition, I AND the 2 rows of from a and b and then see if the result is equal to one of them. I have done this using a for loop, however, it takes a long time to execute with larger dataset..Can you help me do it using apply function so that i
2004 Jun 23
1
Fitting function with if-clause (nls; e.g. heaviside)
Hallo! I want to fit a function. The function is e.g.: y = c+m1*x if x<0, c+m2*x if x>=0 where m1, m2 and c is a parameter and x, y are variables of a data frame. I think using nls is appropriate. But I do not know, how to type this formula in nls. Can anybody help? (If there is a possibility to use a Heaviside-function this would be enough.) Karl
2006 Nov 28
0
Consulting request R training and programming
Hello everybody, first I would like to apologize my consulting request on this help list but I couldn't find any ressources about consulties on the net. In our company we are searching on alternatives to SPSS and after a very short test, R could satisfy our needs completly. Unfortunately we are not able to cover all required features, like a little more sophisticated tabularization, an
2018 May 02
0
using apply
Hi Neha, Perhaps merge() from base or join from dplyr is what you are looking for. data. table could also be interesting. Hth Ulrik On Wed, 2 May 2018, 21:28 Neha Aggarwal, <aggarwalneha2000 at gmail.com> wrote: > Hi > > I have 3 dataframes, a,b,c with 0/1 values...i have to check a condition > for dataframe a and b and then input the rows ids to datframe c . In the if >
2009 Jan 02
1
Calculating signicance value
Hi friends, If someone can find out some time to go through my problem would be really grateful. I have a dataset(dataset1) as shown below:-- recmeanC1 recmeanC2 recmeanC3 recmeanC4 i1 i2 i3 i4 i5 i6 i7 i8 i9 i10 i11 1 NA 1 1.00 1.800000 NA 1 NA 1 1 NA 2 2 2 NA 2 2 2 2 1.00
2001 Aug 17
0
making a neat timetable
On Fri, 17 Aug 2001, Patrick Connolly wrote: > |> The data are stored in a MySQL table, and I can read them > |> into R with RMySQL obtaining a MySQLResultSet object (which I > |> suppose is a data frame ?) which looks like this: > > No, it doesn't have any column names. It would be a good idea to get > it into one since dataframes are very good ways of
2011 May 23
1
Applying boxplot.stats to multiple value lists
Hello all R gurus, I have a following problem which I hope someone will help me to solve. I have a data.frame in form similar to below. > testframe<-data.frame("Name"=c("aa","aa","aa","aa","aa","bb","bb","bb","bb","bb"),"Value"=c(1,100,1,1,1,100,100,100,100,1))
2003 Mar 03
1
Q: Best-Practice for Swing-GUI calling R-code on Windows?
org.omegahat.R.Java.REvaluator e = new org.omegahat.R.Java.REvaluator(); Object val = e.eval("objects()"); if(val != null) { String[] objects = (String[])val; for(int i = 0 ; i < objects.length; i++) System.err.println("("+i+") " + objects[i]); } hello, thanks to Philippe Grosjean's work I finally got SJava working (on Windows XP!!), so that I can
2001 May 09
2
[Newbie] Row-Iterator for data.frame??
hello all, for my diploma-thesis i want to statitically analyze near-infrared-spectra. a spectrum is given by the y-values of 1038 equi-distant x-points. in nature, a spectrum is a continuous curve. for analysis, every x-point is seen as a statistical variable. now my problem: first, i read a csv-table in a data.frame called sTable via read.table. besides some meta-data there are 1038 variables
2006 Aug 06
1
ordering by a datframe date
I am hoping for some advice regarding ordering a dataframe, by date. The dataframe is in the format below. $story $datepub story10 1 April 1999 story 90 1 March 2002 story 37 10 July 1985 I want to reorder the entire dataframe so the earliest story is first, and save the reordered dataframe. The command, 'class' (datepub) reveals $datepub is a factor variable. I tried