Displaying 20 results from an estimated 1000 matches similar to: "Permuting rows of a matrix"
2010 Sep 08
3
Regression using mapply?
Hi,
I have huge matrices in which the response variable is in the first
column and the regressors are in the other columns. What I wanted to do
now is something like this:
#this is just to get an example-matrix
DataMatrix <- rep(1,1000);
Disturbance <- rnorm(900);
DataMatrix[101:1000] <- DataMatrix[101:1000]+Disturbance;
DataMatrix <- matrix(DataMatrix,ncol=10,nrow=100);
#estimate
2010 Dec 07
2
longer object length is not a multiple of shorter object length
In datamatrix[, "y"] == datamatrix[, "y"][-1] :
longer object length is not a multiple of shorter object length
out = c(FALSE,datamatrix[,'y'] == datamatrix[,'y'][-1])
and I do not know why I get that error, the resulting out matrix is somehow
one row larger than datamatrix...
all I try to do is filter matrix by dropping rows where [,'y'][-1] ==
2007 Feb 13
1
Questions about results from PCAproj for robust principal component analysis
Hi.
I have been looking at the PCAproj function in package pcaPP (R 2.4.1) for
robust principal components, and I'm trying to interpret the results. I
started with a data matrix of dimensions RxC (R is the number of rows /
observations, C the number of columns / variables). PCAproj returns a list
of class princomp, similar to the output of the function princomp. In a
case where I can
2008 Mar 03
2
handling big data set in R
Hello R users,
I'm wondering whether it is possible to manage big data set in R? I
have a data set with 3 million rows and 3 columns (X,Y,Z), where X is
the group id. For each X, I need to run 2 regression on the submatrix.
I used the function "split":
datamatrix<-read.csv("datas.csv", header=F, sep=",")
dim(datamatrix)
# [1] 2980523 3
2009 May 14
3
memory usage grows too fast
Hi All,
I have a 1000x1000000 matrix.
The calculation I would like to do is actually very simple: for each row, calculate the frequency of a given pattern. For example, a toy dataset is as follows.
Col1 Col2 Col3 Col4
01 02 02 00 => Freq of ?02? is 0.5
02 02 02 01 => Freq of ?02? is 0.75
00 02 01 01 ?
My code is quite simple as the following to find the pattern ?02?.
2004 Jul 13
5
Help with factanal and missing values
Hi list,
I'm performing a series of confirmatory factor analysis on different
groupings of items from data collected with questionnaires. There are some
missing values.
For those sets with no missing values I call
factanal(datamatrix,factors=n)
where datamatrix is a table of all observations for the items under
investigation.
This call fails when there are missing values.
help(factanal)
2004 Apr 27
1
beginners k means clustering question
Hi all,
I am wandering.. is it possible to cluster data which is in a single
column ?
for example.. I have some data as follows:
4013
7362
7585
9304
11879
14785
21795
30500
30669
30924
33988
36975
40422
42911
50501
51593
53729
54338
55497
57337
61993
62601
66229
69815
69933
70760
71340
75921
83972
90134
91061
.
.
.
is it possible to cluster this data since it is in a single column ?
I have
2009 Jan 20
1
heatmap.2 color issue
Dear All:
I tried to use heatmap.2 to generate hierarchical clustering using the following command:
heatmap.2(datamatrix, scale="row", trace="none", col=greenred(256), labRow=genelist[,1], margins=c(10,10), Rowv=TRUE, Colv=TRUE)
datamatrix is subset of a RMA normalized data subset by a genelist.
The problem is a lot of times, the z-score in key are from, like -5 to 15 or
2012 Jul 24
4
ERROR : cannot allocate vector of size (in MB & GB)
Hi,
Here in R, I need to load a huge file(.csv) , its size is 200MB. [may come
more than 1GB sometimes].
When i tried to load into a variable it taking too much of time and after
that when i do cbind by groups,
getting an error like this
" Error: cannot allocate vector of size 82.4 Mb "
My requirement is, spilt data from Huge-size-file(.csv) to no. of small csv
files.
Here i will give
2012 Aug 10
1
Split CSV as per file size
Hi here i have a code to split a csv file as per group of line.
The code given below,
------------------------------------
SplitCSVByLine <- function(DataMatrix,Destination,NoOfLineToGroup)
{
input <- file(DataMatrix, "r")
fileNo <- 1
repeat
{
myLines <- readLines(input, n=NoOfLineToGroup)
if (length(myLines) == 0) break
2009 Jul 03
1
fix() and edit() not working with Rcmdr and german LANG-variable
Dear Mailinglist,
I just set up an R 2.9.1 environment with Rcmdr 1.4-6 on Ubuntu Jaunty
9.04. As I'm from Germany my $LANG variable is set to "de_DE.UTF-8".
Now, when I open up Rcmdr and try to edit a new datamatrix there is no
edit window appearing:
Datenmatrix <- edit(as.data.frame(NULL))
ERROR: invalid device
In addition, there are plenty of warning messages in the
2011 Nov 21
5
R ignores number only with a nine under 10000
Hello R users,
I'm trying to replace numerical values in a datamatrix with strings. R does
this except for numbers under 10000 starting with a 9 (eg 98, 970, 9504
etc). This is really weird and I wondered whether someone had encountered
such a problem or knows the solution. I'm using the next script:
test_1 <- read.table("5+ref_151111clusters3.csv", header = TRUE, sep =
2005 Mar 10
2
Logistic regression goodness of fit tests
I was unsure of what suitable goodness-of-fit tests existed in R for logistic regression. After searching the R-help archive I found that using the Design models and resid, could be used to calculate this as follows:
d <- datadist(mydataframe)
options(datadist = 'd')
fit <- lrm(response ~ predictor1 + predictor2..., data=mydataframe, x =T, y=T)
resid(fit, 'gof').
I set up a
2006 May 09
1
visualisation of Self organising map
Hello R users,
I'm using SOM() to cluster a gene expression data set
the syntax i used was
dataGrid <- c(somgrid(xdim = 3, ydim = 3, topo = c("rectangular","hexagonal")))
dataClusters <- SOM(dataMatrix, grid = dataGrid)
plot(dataClusters)
it seems that this works just fine but the thing i can't figure out is
how to determine where each data point has been
2009 Jan 14
2
Vectorization of three embedded loops
Dear R-programmer,
I wrote an adapted implementation of the Kennard-Stone algorithm for
sample selection of multivariate data (R 2.7.1 under MacBook Pro,
Processor 2.2 GHz Intel Core 2 Duo, Memory 2 GB 667 MHZ DDR2 SDRAM).
I used for the heart of the script three embedded loops. This makes it
especially for huge datasets very slow. For a datamatrix of 1853*1853
and the selection of 556
2009 Aug 24
2
Creating a simple line graph
Hey everyone,
Sorry for yet another simple question but hopefully it makes whoever
comes up with the answer feel good about helping others. I would like to
simply plot the following two sets of data in a line graph. The one set is
an observed set of points and the latter is the predicted. I have looked
through the documentation (which makes any graphing very complicated to me)
but i havent
2023 Jan 29
2
[REGRESSION] GM20B probe fails after commit 2541626cfb79
On Fri, 27 Jan 2023 at 20:42, Diogo Ivo <diogo.ivo at tecnico.ulisboa.pt> wrote:
>
> On Fri, Jan 27, 2023 at 04:00:59PM +1000, Ben Skeggs wrote:
> > On Fri, 20 Jan 2023 at 21:37, Diogo Ivo <diogo.ivo at tecnico.ulisboa.pt> wrote:
> > >
> > > On Wed, Jan 18, 2023 at 11:28:49AM +1000, Ben Skeggs wrote:
> > > > On Mon, 16 Jan 2023 at 22:27, Diogo
2012 Mar 30
1
Help with the lumi R package
Hi all,
My name is Amy, I am a masters student in Bioinformatics at North Carolina
State University. I am working on a project and I am trying to use the lumi
R package for microarray data analysis. I have shown the sample code here
and have questions about modifying the sample code for my own data.
lumi package in R, example.lumi, the sample data has 8000 features and 4
samples
I have
2007 Oct 22
3
median value dataframe coming from multiple dataframes
Hi all,
I am not a skillful R programmer and has I am handling with large dataframes (about 30000 x 300) I am in need of an efficient function.
I have 4 dataframes with the same dimension. I need to generate other dataframe with the some dimension than the others where in each position it has the median value of the 4 values in the same position coming from the 4 dataframes.
Grateful by your
2012 Dec 05
8
Error: Could not request certificate: Connection refused - connect(2)
Hi all,
I am new at puppet and Im using the puppet learning tutorial. I execute
until Basic agent/master puppet. Everything worked ok but after 2 or 3
days, running the command "puppet agent --test", start throw the error in
subject. That left me with the following questions:
how to puppet agent connect to master without service started on master?
If this connection is possible (