Displaying 20 results from an estimated 1000 matches similar to: "tapply for enormous (>2^31 row) matrices"
2017 Jul 03
0
R memory limits on table(x, y) (and bigtabulate)
Sorry, don't know enough to give you trustworthy answers, but I can
say that crashes due to (or linked to) packages should usually be
reported to the package maintainer, who can be found by the
?maintainer function. That person may not monitor this list.
Cheers,
Bert
Bert Gunter
"The trouble with having an open mind is that people keep coming along
and sticking things into it."
2017 Jul 03
2
R memory limits on table(x, y) (and bigtabulate)
I have two character vectors x and y that have the following characteristics:
length(x) # same as
length(y) # 872099
length(unique(x)) # 47740
length(unique(y)) # 52478
I need to crosstabulate them, which would lead to a table with
47740*52478 # 2505299720
cells, which is more than
2^31 # 2147483648
cells, which seems to be R's limit because I am getting the error message
Error in
2009 Oct 21
0
drawing a line indicating extent of each factored data series in multipanel lattice xyplot
Hi,
Am am plotting aggregated frequency data (extracted from an RDBMS)
relating to DNA sequence features for each of the human chromosomes as
described in the table chromosomes below (the frequency data is in a
table 'hits' that has a value (or not) for each of a set of bins across
each chromosome).
I would like to mark the extent of the chromosome (according to the
length value in
2009 Nov 20
1
how to specify the order of panels with xyplot
> chromosomes
id refseq name length
1 0 NC_000001.9 Homo sapiens chromosome 1 247249719
2 1 NC_000002.10 Homo sapiens chromosome 2 242951149
3 2 NC_000003.10 Homo sapiens chromosome 3 199501827
4 3 NC_000004.10 Homo sapiens chromosome 4 191273063
5 4 NC_000005.8 Homo sapiens chromosome 5 180857866
6 5 NC_000006.10 Homo sapiens chromosome 6
2004 Aug 06
1
questions related to ploting in R
Dear all.
I need to draw a scatter plot of 23 chromosome copy numbers (y axes) against
chromosome and physical location within each chromosome in one plot. The
data matrix looks as below:
chr location copy_num
1 118345 1.320118
1 3776202 1.133879
1 4798845 0.989997
1 5350951 1.100967
. more data here
.
.
2 118345 2.459119
2 157739 1.915919
2 1530065 1.924372
2
2012 Feb 29
0
Question about tables in bigtabulate
I have a large file backed big. matrix, with millions of rows and 20
columns.
The columns contain data that I simply need to tabulate. There are a few
dozen unique
values. and I just want a frequency count
Test code with a small "big" matrix.
library(bigmemory)
library(bigtabulate)
test <- big.matrix(nrow = 100, ncol = 10)
test[,1:3]<- sample(150)
test[,4:6]<-
2009 Dec 04
1
Lattice : Help with changing the labels of x-axis in respective panels
Dear R-Helpers,
I am not very experienced in using lattice and I am still in the learning
stage
I have a data set which looks like this: (I have deleted a few lines in
order to save space)
Chromosome marker Marker.Name Distance
1 1 1 PeMm261 0.0000
2 1 2 Xtxp8 10.1013
..
20 1 20 EbMi148 210.3099
21 1 21 Xtxp25
2008 Jul 09
2
replacing value in column of data frame
Dear all,
Probably a very basic question but I need some help.
I have a data frame (made by read.table from a text file) of microarray data, of which the first column is a factor and the rest of the columns are numeric.
The factor column contains chromosome names, so values 1 through 22 plus X, Y and XY. The numeric columns contain positions or intensity measurements.
What I need to do is
2011 Dec 09
1
minor allele frequency comparison
Hi all,
We are using two methods to identify SNPs. One is based on resequencing
the genome and aligning the reads to the sequenced genome to identify SNPs
(data available for 44 individuals). Another is based on SNP array with
selected loci (30000 loci, 870 individuals). I want to compare the results
from the resequencing based minor allele frequency and Array based minor
allele frequency.
2009 Jan 16
0
R-help Digest, Vol 71, Issue 16
On Fri, 16 Jan 2009, r-help-request at r-project.org wrote:
> Date: Thu, 15 Jan 2009 13:29:03 +0100
> From: Pablo G Goicoechea <pgoikoetxea at neiker.net>
> Subject: Re: [R] How to create a chromosome location map by locus ID
> To: Sake <tlep.nav.ekas at hccnet.nl>
> Cc: r-help at r-project.org
> Message-ID: <496F2C0F.3040304 at neiker.net>
> Content-Type:
2003 Jan 16
1
graphics
Dear R community,
I need to plot the results of some simulations I did using QTL
Cartographer. I am plotting LOD scores over three chromosomes. The three
plot have to be one next to the other.
The procedure I am using is:
par(mfrow=c(1,3))
plot(x$x, x$y, ylim=c(0,35), type="l", col="blue", las=1, xaxs="i",
yaxs="i", xlab="X Chromosome",
2017 Sep 04
1
Merge by Range in R
Hi,?
I have two big data set.?
data _1 :?
> dim(data_1)
[1] 15820 5
> head(data_1)
? ?Chromosome ?????Start????????End????????Feature GroupA_3
1: ? ? ? ????????chr1 521369 ?750000 ????chr1-0001 ? ?????0.170
2: ? ? ? ????????chr1 750001 ?800000 ????chr1-0002 ? ????-0.086
3: ? ? ? ????????chr1 800001 ?850000 ????chr1-0003 ? ?????0.006
4: ? ? ? ????????chr1 850001 ?900000 ????chr1-0004 ?
2007 Apr 02
3
Random number from density()
Hello,
I'm writing some genetic simulations in R where I would like to place
genes along a chromosome proportional to the density of markers in a
given region. For example, a chromosome can be presented as a list of
marker locations:
Chr1<-c(0, 6.5, 17.5, 26.2, 30.5, 36.4, 44.8, 45.7, 47.8, 48.7, 49.2,
50.9, 52.9, 54.5, 56.5, 58.9, 61.2, 64.1)
Where the numbers refer to the locations of
2018 May 15
0
find common regions between two kinds of tests
Dear R community,
For 100 sites at human chromosomes, I ran two tests, one is to consider an experiment measurement as a continuous variable, so doing multiple regression; the other is to compare top 25% samples to bottom 25% samples based on values of the measured variable, so categorical analysis. A total of 16 sites show significance; In the following results, I only show five variables (
2012 Sep 10
3
plot: x and y chromosomes are missing
Hello,
I have a list with gene names, fold changes (=expression level) and
chromosomes.
Names fold change chromosome
hz 1.5 2
If I plot fold change versus chromosome (or vice versa):
plot (ch, fc)
I see only the chromosomes with numbers but not those with letter (x and
y). What can I do?
A second question:
How can I add a single line in that plot at a certain
2010 Jun 01
5
Help barplots
Dear All,
I am newbie to R, and I wanted to plot a barplots with R and in such a way
that It will also show me position which I can plot on the bar line.
Here is my code that I am using to plot,
> chromosome <- c(40.2, 35.6, 36.1, 29.6, 31, 29.6, 31, 29.4, 28.2, 23, 23,
28.2)
>barplot (chromosome, col="purple", xlab="Oryza sativa Chromosomes", border
= NA, space =
2009 Mar 23
1
changing order of lattice plots
Hi,
This is another question relating to my 2 factor figure.
densityplot(~End-Begin | Type * Chromosome, data=Mon, layout=c(5,12),
xlab="Element Length",type="percent", col="grey60",
strip=strip.custom(style=3, bg="grey90", par.strip.text=list(cex=0.5)))
I would like to flip the plot so those at the bottom are at the top and
so on. I have tried using a
2007 Nov 18
1
how to sort a data.frame by ascending some columns
Dear list,
I have a data frame (238304 rows and 6 columns). I want the data frame
sorted by two columns in ascending order. I am showing
the first 5 rows of the data frame
> clones.info[1:5,1:6]
USER_CLONE_ID CHROMOSOME Expr1002 KB_POSITION Allele_A WELL_ID
1 SNP_A-1855402 17 41419603 41419603 C rs17572851
2 SNP_A-4249904 17 41420045 41420045 A rs17572893
3 SNP_A-2174835 18 41407760
2011 Aug 24
0
lodplot help
I have a data frame (narrow) with 431 rows and 6 columns containing information on chromosome, position, lod1, lod2, lod3, lod4, looking like this:
> narrow
chr pos lod1 lod2 lod3 lod4
1 1 3.456 -0.025 -0.003 -0.209 -0.057
2 1 5.697 -0.029 -0.005 -0.200 -0.058
3 1 8.434 -0.049 -0.012 -0.247 -0.092
4 1 9.466 -0.074 -0.025 -0.300 -0.136
5 1 9.706
2010 Nov 07
1
How do I order xyplot line points?
I have the following xyplot figure:
http://img577.imageshack.us/img577/686/filesizeresults12000000.png
The data are organized in a matrix file as follows:
Type Elements Chromosome Time
bedGz 12000000 chr1 14.240
bedGz 12000000 chr2 7.949
bedGz 12000000 chr3 5.103
bedGz 12000000 chr4 5.290
bedGz 12000000 chr5 5.161
...
The x-axis labels in the Chromosome column are ordered