Displaying 20 results from an estimated 11000 matches similar to: "R and Clusters"
2009 Apr 21
2
Changing the binning of collected data
Dear All,
Apologies if this is too simple for this list.
Let us assume that you have an instrument measuring particle distributions.
The output is a set of counts {n_i} corresponding to a set of average
sizes {d_i}.
The set of {d_i} ranges from d_i_min to d_i_max either linearly of
logarithmically.
There is no access to further detailed information about the
distribution of the measured sizes, but
2009 Jul 09
1
Creating and Using Objects in R
Dear All,
I am not very into object-oriented programming, but I would like to
learn the ropes for some R applications.
Quoting from the online R language definition (paragraph 5.1)
> Consider the following simple example. A point in two-dimensional
> Euclidean space can be specified by its Cartesian (x-y) or polar
> (r-theta) coordinates. Hence, to store information about the location
2007 Apr 05
17
Reasons to Use R
Dear All,
The institute I work for is organizing an internal workshop for High
Performance Computing (HPC).
I am planning to attend it and talk a bit about fluid dynamics, but
there is also quite a lot of interest devoted to data post-processing
and management of huge data sets.
A lot of people are interested in image processing/pattern recognition
and statistic applied to geography/ecology, but I
2019 Jul 09
3
[R] Curl4, Quantmod, tseries and forecast
Hi Ralf,
I tried the following
> install.packages("RCurl")
which went OK, but then same story when I tried to install tseries.
> sessionInfo()
R version 3.6.1 (2019-07-05)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Debian GNU/Linux 10 (buster)
Matrix products: default
BLAS: /usr/lib/x86_64-linux-gnu/blas/libblas.so.3.8.0
LAPACK:
2013 Mar 24
3
Parallelizing GBM
Dear All,
I am far from being a guru about parallel programming.
Most of the time, I rely or randomForest for data mining large datasets.
I would like to give a try also to the gradient boosted methods in GBM,
but I have a need for parallelization.
I normally rely on gbm.fit for speed reasons, and I usually call it this
way
gbm_model <- gbm.fit(trainRF,prices_train,
offset = NULL,
misc =
2013 Feb 09
3
Addressing Columns in a Data Frame
Dear All,
Probably a one liner, but I am banging my head against the floor.
Consider the following
DF <- data.frame(
x=1:10,
y=10:1,
z=rep(5,10),
a=11:20
)
mn<-names(DF)
but then I cannot retrieve a column by doing e.g,
DF$mn[2]
I tried to play with the quotes and so on, but so far with no avail.
Any suggestion is welcome.
Cheers
Lorenzo
2013 Jan 19
2
Deformulation and R
Dear All,
I hope this is not too off-topic.
Essentially, I need to know if there is any R package which can help me
with a deformulation project.
Suppose e.g. that you know from a chemical analysis the fat, mineral,
vitamin, energy [and so on] content of a certain food product.
You also know the ingredients of this product (e.g. milk, lactose,
vegetable oil) and you know the chemical
2009 Jul 20
3
Histograms on a log scale
Dear All,
I would like to be able to plot histograms/densities on a semi-log or
log-log scale.
I found several suggestions online
http://tolstoy.newcastle.edu.au/R/help/05/09/12044.html
https://stat.ethz.ch/pipermail/r-help/2002-June/022295.html
http://www.harding.edu/fmccown/R/#histograms
Now, consider the code snippet taken from
http://www.harding.edu/fmccown/R/#histograms
# Get a random
2011 Dec 15
3
From Distance Matrix to 2D coordinates
Dear All,
I am struggling with the following problem: I am given a NxN symmetric
matrix P ( P[i,i]=0, i=1...N and P[i,j]>0 for i!=j) which stands for the
relative distances of N points.
I would like use it to get the coordinates of the N points in a 2D
plane. Of course, the solution is not unique (given one solution, I can
translate or rotate all the points by the same amount and generate
2007 Aug 08
2
Relocating Axis Label/Title --2
Apologies for the previous mail (I sent it off too early by mistake).
This is the correct example:
rm(list=ls())
D_mean<-seq(-5,5,length=100)
y<-exp(-D_mean^2/5)
pdf("my.pdf")
plot(D_mean,y,type="l",yaxt="n",lty=2,lwd=2,col="black",
ylab = list(expression(paste(dN/dlogD[agg]," ["*cm^-3*"]"))),
xlab = expression(paste(D[agg],"
2012 Oct 26
2
Stata Database & R
Dear All,
I am given some data to analyze. The data is in the form of a Stata
database (.dta file).
What is the best way to import it into an R dataframe?
Is there any particular caveat I should be aware of?
Many thanks
Lorenzo
2012 Oct 05
2
Test for Random Points on a Sphere
Dear All,
I implemented an algorithm for (uniform) random rotations.
In order to test it, I can apply it to a unit vector (0,0,1) in Cartesian
coordinates.
The result is supposed to be a set of random, uniformly distributed,
points on a sphere (not the point of the algorithm, but a way to test it).
This is what the points look like when I plot them, but other then
eyeballing them, can anyone
2013 Mar 25
2
Reassign Multiple Factors to same Factor Value
Dear All,
Probably something very easy, but I am looking for the most efficient ways
to achieve this.
Consider the following snippet
y<-c('a','b','c','d','e','f','g')
x<-rnorm(length(y))
df<-data.frame(y,x)
leading to
> df$y
[1] a b c d e f g
Levels: a b c d e f g
Now, I would like to replace levels
2013 Jan 28
1
RandomForest and Missing Values
Dear All,
I would like to use a randomForest algorithm on a dataset.
The set is not particularly large/difficult to handle, but it has some
missing values (both factors and numerical values).
According to what I found
https://stat.ethz.ch/pipermail/r-help/2005-September/078880.html
https://stat.ethz.ch/pipermail/r-help/2007-January/123117.html
the randomForest package has a problem with missing
2016 Apr 19
3
Problem with X11
Dear All,
I have never had this problem before. I run debian testing on my box
and I have recently update my R environment.
Now, see what happens when I try the most trivial of all plots
> plot(seq(22))
Error in (function (display = "", width, height, pointsize, gamma, bg,
:
X11 module cannot be loaded
In addition: Warning message:
In (function (display = "", width,
2007 Apr 14
1
[LLVMdev] "Name that compiler"
Looks like people send many names in the contest,
so shall I do too.
I propose "Lepton", loosely meaning light (featherweight) in Greek.
It is used in particle physics to refer to very light particles
(electrons, muons).
For me it also has the connotations of fastness and
restlessness (in the LLVM sense of post-compile
optimization).
See more at:
2010 Feb 26
3
Plotting a Trivial Matrix
Dear All,
Consider a matrix (N x N) where each entry is either zero or one (can
hardly get any simpler).
Now, I would like to plot it as a 'chessboard' where every matrix entry
is a black (1) or white (0) square.
Whatever tool I use to plot it, it should not try to interpolate the
data at all.
I found some online references
http://www.phaget4.org/R/image_matrix.html
but probably I can
2019 Jul 07
2
Curl4, Quantmod, tseries and forecast
Dear All,
I have just upgraded to Debian stable 10 and rebuilt most of the R
packages.
I use the R backported packages from here
https://cran.r-project.org/bin/linux/debian/#debian-buster-testing
for the core system.
I encounter some issues when updating quantmod, tseries and forecast.
For instance, see the following
> install.packages("tseries")
which finally fails with the
2019 Jul 07
2
Curl4, Quantmod, tseries and forecast
Dear All,
I have just upgraded to Debian stable 10 and rebuilt most of the R
packages.
I use the R backported packages from here
https://cran.r-project.org/bin/linux/debian/#debian-buster-testing
for the core system.
I encounter some issues when updating quantmod, tseries and forecast.
For instance, see the following
> install.packages("tseries")
which finally fails with the
2013 Feb 03
3
RandomForest, Party and Memory Management
Dear All,
For a data mining project, I am relying heavily on the RandomForest and
Party packages.
Due to the large size of the data set, I have often memory problems (in
particular with the Party package; RandomForest seems to use less memory).
I really have two questions at this point
1) Please see how I am using the Party and RandomForest packages. Any
comment is welcome and useful.