Displaying 20 results from an estimated 100 matches similar to: ""Large" data set: performance issue"
2003 Feb 24
1
Mass: lda and collinear variables
hello list,
when I use method lda of the MASS package I experience a warning:
variables are collinear in: lda.default(data[train, ], classes[train])
Is there an easy way to recover from this issue within the MASS package?
Or how can I tell how severe this issue is at all?
I understand that I shouldn't use lda at all with collinear data and should
use "quadratische" (squared?)
2002 Dec 31
1
Selecting variables from a data.frame
Hi all,
currently I'm working with physical data stored in a data.frame. I have
N observations, typically 100-300 per data set.
Each row in a set holds M (typically 2100) variables which represent a curve.
For linear discriminant analysis I chose first to do a wavelet transform
(because M >> N) and then feed the transformed data (of level L) in lda.
This works fine (e.g. error <
2003 Mar 11
3
R-Graphics: Scaling axis
Hi,
how can I scale the x- and y-axis of a "plot" to the same scale?
My problem: The following command sequence produces the plot in a square.
What I want is the x-axis to be 5 times as wide (measured e.g. in pixels)
as the y-axis is long (because y ranges from -1 to 1 and x ranges from 0
to 10).
x <- seq( from=0, to=10, by=.1)
sinx <- sin(x)
plot( x, sinx, type="l")
2018 Apr 17
0
Time intervals in a datframe
> On Apr 17, 2018, at 10:10 AM, Allaisone 1 <allaisone1 at hotmail.com> wrote:
>
>
> Hi all
>
> I have a list of multiple datframes with the same column headers. The last column in each datframe contains a vector of "Interval" class after I have produced this column using "lubridate" package. I needed to convert my list of dataframes to be in a single
2018 Apr 17
2
Time intervals in a datframe
Hi all
I have a list of multiple datframes with the same column headers. The last column in each datframe contains a vector of "Interval" class after I have produced this column using "lubridate" package. I needed to convert my list of dataframes to be in a single dataframe for further analysis. I did this using the following syntax :
SingleDataframe <- ldply
2005 Mar 25
2
tapply and NA value
Hi,
I'm writing for a little help.
I have a dataframe with same NA value and I'd like to obtain the means of the
value of a coloumn grouped by the levels of a factor coloumn of the datframe.
I'm using the function "tapply" but I see that if only a NA value is present
the result is NA.
There is an option to have the correct result or I must use an other function?
Thanks of
2009 Aug 28
2
Pls package
Hi,
I have managed to format my data into a single datframe consisting of two AsIs response and predictor dataframes in order to supply the plsr command of the pls package for principal components analysis.
When I execute the command, however, I get this error:
> fiber1 <- plsr(respmat ~ predmat, ncomp=1, data=inputmat,validation="LOO")
Error in model.frame.default(formula =
2004 Jun 29
1
nls fitting problems (singularity)
Hallo!
I have a problem with fitting data with nls. The first
example with y1 (data frame df1) shows an error, the
second works fine.
Is there a possibility to get a fit (e.g. JMP can fit
also data I can not manage to fit with R). Sometimes I
also got an error singularity with starting
parameters.
# x-values
x<-c(-1,5,8,11,13,15,16,17,18,19,21,22)
# y1-values (first data set)
2008 Nov 26
1
Creating a vector based on lookup function
I am still searching for a solution to what i think is a simple problem i am
having with building a vector in a for loop. I have built a more
understandable example so hopefully that will help..help you, help me, if
you know what i mean.
dev=400
#test location model TAZs to reference
cands=c(101,105,109)
#Create Object of length of cands
candslength=length(cands)
#TEST TAZ Vector
2006 Oct 01
3
aggregate function with 'NA'
Dear r-help reader,
I have some problems with the aggregate function.
My datframe looks like
>frame
Day Time V1 V2
1 M 0 3 NA
2 M 0 4 NA
3 M 0 5 2
4 M 1 NA 4
5 M 1 10 6
6 T 0 4 45
7 T 1 4 3
8 T 1 3 2
9 T 1 6 1
I used the aggegate function to obtain the mean in V1 and V2 over the
grouping variable
Time and Day
2018 May 02
2
using apply
Hi
I have 3 dataframes, a,b,c with 0/1 values...i have to check a condition
for dataframe a and b and then input the rows ids to datframe c . In the if
condition, I AND the 2 rows of from a and b and then see if the result is
equal to one of them.
I have done this using a for loop, however, it takes a long time to execute
with larger dataset..Can you help me do it using apply function so that i
2004 Jun 23
1
Fitting function with if-clause (nls; e.g. heaviside)
Hallo!
I want to fit a function. The function is e.g.:
y = c+m1*x if x<0, c+m2*x if x>=0
where m1, m2 and c is a parameter and x, y are
variables of a data frame.
I think using nls is appropriate. But I do not know,
how to type this formula in nls. Can anybody help?
(If there is a possibility to use a Heaviside-function
this would be enough.)
Karl
2006 Nov 28
0
Consulting request R training and programming
Hello everybody,
first I would like to apologize my consulting request on this help list but I couldn't find any ressources about consulties on the net.
In our company we are searching on alternatives to SPSS and after a very short test, R could satisfy our needs completly.
Unfortunately we are not able to cover all required features, like a little more sophisticated tabularization, an
2018 May 02
0
using apply
Hi Neha,
Perhaps merge() from base or join from dplyr is what you are looking for.
data. table could also be interesting.
Hth
Ulrik
On Wed, 2 May 2018, 21:28 Neha Aggarwal, <aggarwalneha2000 at gmail.com> wrote:
> Hi
>
> I have 3 dataframes, a,b,c with 0/1 values...i have to check a condition
> for dataframe a and b and then input the rows ids to datframe c . In the if
>
2009 Jan 02
1
Calculating signicance value
Hi friends,
If someone can find out some time to go through my problem would be really
grateful.
I have a dataset(dataset1) as shown below:--
recmeanC1 recmeanC2 recmeanC3 recmeanC4 i1 i2 i3 i4 i5 i6 i7
i8 i9 i10 i11
1 NA 1 1.00 1.800000
NA 1 NA 1 1 NA 2 2 2 NA 2
2 2 2 1.00
2001 Aug 17
0
making a neat timetable
On Fri, 17 Aug 2001, Patrick Connolly wrote:
> |> The data are stored in a MySQL table, and I can read them
> |> into R with RMySQL obtaining a MySQLResultSet object (which I
> |> suppose is a data frame ?) which looks like this:
>
> No, it doesn't have any column names. It would be a good idea to get
> it into one since dataframes are very good ways of
2011 May 23
1
Applying boxplot.stats to multiple value lists
Hello all R gurus,
I have a following problem which I hope someone will help me to solve.
I have a data.frame in form similar to below. > testframe<-data.frame("Name"=c("aa","aa","aa","aa","aa","bb","bb","bb","bb","bb"),"Value"=c(1,100,1,1,1,100,100,100,100,1))
2003 Mar 03
1
Q: Best-Practice for Swing-GUI calling R-code on Windows?
org.omegahat.R.Java.REvaluator e = new
org.omegahat.R.Java.REvaluator();
Object val = e.eval("objects()");
if(val != null) {
String[] objects = (String[])val;
for(int i = 0 ; i < objects.length; i++)
System.err.println("("+i+") " + objects[i]);
}
hello,
thanks to Philippe Grosjean's work I finally got SJava working (on Windows
XP!!), so that I can
2001 May 09
2
[Newbie] Row-Iterator for data.frame??
hello all,
for my diploma-thesis i want to statitically analyze near-infrared-spectra.
a spectrum is given by the y-values of 1038 equi-distant x-points.
in nature, a spectrum is a continuous curve. for analysis, every x-point
is seen as a statistical variable.
now my problem:
first, i read a csv-table in a data.frame called sTable via read.table.
besides some meta-data there are 1038 variables
2006 Aug 06
1
ordering by a datframe date
I am hoping for some advice regarding ordering a dataframe, by date.
The dataframe is in the format below.
$story $datepub
story10 1 April 1999
story 90 1 March 2002
story 37 10 July 1985
I want to reorder the entire dataframe so the earliest story is first, and
save the reordered dataframe. The command, 'class' (datepub) reveals
$datepub is a factor variable.
I tried