thr3ads.net - similar to: "Idea/package to "linearize a curve" along the diagonal?"

Displaying 20 results from an estimated 8000 matches similar to: "Idea/package to "linearize a curve" along the diagonal?"

Retrieve indexes of the "first occurrence of numbers" in an effective manner

2012 Dec 27

Retrieve indexes of the "first occurrence of numbers" in an effective manner

Hi, That sounds simple but I cannot think of a really fast way of getting the following: c(1,1,2,2,3,3,4,4) would give c(1,3,5,7) i.e., a function that returns the indexes of the first occurrences of numbers. Note that numbers may have any order e.g., c(3,4,1,2,1,1,2,3,5), can be very large, and the vectors are also very large (which prohibits any loop). The best I could think of is: tmp =

How to plot diagonal line at any coordinate range in R

2012 Mar 12

How to plot diagonal line at any coordinate range in R

Dear expert How can we plot diagonal across (from bottom-left-hand corner to top right-hand corner), at any given coordinate range For example > plot(c(-2,3), c(-1,5), type = "n", xlab="x", ylab="y", asp = 1) or > plot(c(0,1000), c(0,334), type = "n", xlab="x", ylab="y", asp = 1) I tried abline with the following but failed:

FYI : XML 3.4.2 still breaks odfWeave 0.7.17

2011 Aug 14

FYI : XML 3.4.2 still breaks odfWeave 0.7.17

Dear list, perusing the GMane archive shows that the issue with XML 3.4.x still bugs odfWeave users. I just checked that the newer XML 3.4.2 version still give the same problem. Using it to weave a bit of documentation written with LibreOffice 3.3.3 (current in Debian testing) leads me to a 192 Kb, 590 pages document, whereas reinstalling XML 3.2.0 gives me a 4 pages, 24Kb file (not

Which non-parametric regression would allow fitting this type of data? (example given).

2012 Mar 11

Which non-parametric regression would allow fitting this type of data? (example given).

Hi, I'm wondering which function would allow fitting this type of data: tmp=rnorm(2000) X.1 = 5+tmp Y.1 = 5+ (5*tmp+rnorm(2000)) tmp=rnorm(100) X.2 = 9+tmp Y.2 = 40+ (1.5*tmp+rnorm(100)) X.3 = 7+ 0.5*runif(500) Y.3 = 15+20*runif(500) X = c(X.1,X.2,X.3) Y = c(Y.1,Y.2,Y.3) plot(X,Y) The problem with loess is that distances for the "goodness of

group bunch of lines in a data.frame, an additional requirement

2006 Sep 13

group bunch of lines in a data.frame, an additional requirement

Thanks for pointing me out "aggregate", that works fine! There is one complication though: I have mixed types (numerical and character), So the matrix is of the form: A 1.0 200 ID1 A 3.0 800 ID1 A 2.0 200 ID1 B 0.5 20 ID2 B 0.9 50 ID2 C 5.0 70 ID1 One letter always has the same ID but one ID can be shared by many letters (like ID1) I just want to keep track of the ID, and get

Finding (swapped) repetitions of numbers pairs across two columns

2012 Dec 27

Finding (swapped) repetitions of numbers pairs across two columns

Hi, I've had this problem for a while and tackled it is a quite dirty way so I'm wondering is a better solution exists: If we have two vectors: v1 = c(0,1,2,3,4) v2 = c(5,3,2,1,0) How to remove one instance of the "3,1" / "1,3" double? At the moment I'm using the following solution, which is quite horrible: v1 = c(0,1,2,3,4) v2 = c(5,3,2,1,0) ft <-

How to "flatten" a multidimensional array into a dataframe?

2012 Apr 19

How to "flatten" a multidimensional array into a dataframe?

Hi, I have a three dimensional array, e.g., my.array = array(0, dim=c(2,3,4), dimnames=list( d1=c("A1","A2"), d2=c("B1","B2","B3"), d3=c("C1","C2","C3","C4")) ) what I would like to get is then a dataframe: d1 d2 d3 value A1 B1 C1 0 A2 B1 C1 0 . . . A2 B3 C4 0 I'm sure there is one function to do

CentOS 6.5: NFS server crashes with list_add corruption errors

2014 Jan 30

CentOS 6.5: NFS server crashes with list_add corruption errors

Hi, I'm running CentOS 6.5 as NFS server (v3 and v4) and exporting Ext4 and XFS filesystem. After many months that all works fine today the server crash: Jan 30 09:46:13 qb-storage kernel: ------------[ cut here ]------------ Jan 30 09:46:13 qb-storage kernel: WARNING: at lib/list_debug.c:26 __list_add+0x6d/0xa0() (Not tainted) Jan 30 09:46:13 qb-storage kernel: Hardware name: PowerEdge

which(df$name=="A") takes ~1 second! (df is very large), but can it be speeded up?

2008 Aug 12

which(df$name=="A") takes ~1 second! (df is very large), but can it be speeded up?

Dear All, I have a large data frame ( 2700000 lines and 14 columns), and I would like to extract the information in a particular way illustrated below: Given a data frame "df": > col1=sample(c(0,1),10, rep=T) > names = factor(c(rep("A",5),rep("B",5))) > df = data.frame(names,col1) > df names col1 1 A 1 2 A 0 3 A 1 4 A

Random sampling while keeping distribution of nearest neighbor distances constant.

2009 Aug 12

Random sampling while keeping distribution of nearest neighbor distances constant.

Dear All, I cannot find a solution to the following problem although I imagine that it is a classic, hence my email. I have a vector V of X values comprised between 1 and N. I would like to get random samples of X values also comprised between 1 and N, but the important point is: * I would like to keep the same distribution of distances between the X values * For example let's say N=10 and

A particular shuffling on a vector

2007 Apr 20

A particular shuffling on a vector

Hello, I was wondering if anyone can think of a straightforward way (without loops) to do the following shuffling: Let's imagine a vector: c(1,1,1,2,2,3,3,3) I would like to derive shuffled vectors __where the same digits are never separated__, although they can be at both ends (periodicity). So the following shuffled vectors are possible: c(2,2,1,1,1,3,3,3) c(2,1,1,1,3,3,3,2)

(Most efficient) way to make random sequences of random sequences

2007 Aug 21

(Most efficient) way to make random sequences of random sequences

Hi, I was wondering the what would be the (most efficient) way to generate a sequence of sequences, i mean: if I have 1,2 and 3. I'd like to generate a sequence of length N*3 (N ~ 1,000,000 or more) Where random permutations of the sequence 1,2,3 follow each other. i.e 1,2,3,1,3,2,3,2,1 /!\ The thing is that there should never be twice the same number of in the same sub-sequence,

Best HMM package to generate random (protein) sequences?

2011 Mar 22

Best HMM package to generate random (protein) sequences?

Dear All, I would like to generate random protein sequences using a HMM model. Has anybody done that before, or would you have any idea which package is likely to be best for that? The important facts are that the HMM will be fitted on ~3 million sequential observations, with 20 different states (one for each amino acid). I guess that 2-5 hidden states should be enough, and an order of 3 would

How to re-order clusters of hclust output?

2012 May 11

How to re-order clusters of hclust output?

Hello, The heatmap function conveniently has a "reorder.dendrogram" function so that clusters follow a certain logic. It seems that the hclust function doesn't have such feature. I can use the "reorder" function on the dendrogram obtained from hclust, but this does not modify the hclust object itself. I understand that the answer should be within the "heatmap"

Mclust problem with mclust1Dplot: Error in to - from : non-numeric argument to binary operator

2008 Oct 20

Mclust problem with mclust1Dplot: Error in to - from : non-numeric argument to binary operator

Dear list members, I am using Mclust in order to deconvolute a distribution that I believe is a sum of two gaussians. First I can make a model: > my.data.model = Mclust(my.data, modelNames=c("E"), warn=T, G=1:3) But then, when I try to plot the result, I get the following error: > mclust1Dplot(my.data.model, parameters = my.data.model$parameters, what = "density")

Quickbooks revisited

2003 Jun 09

Quickbooks revisited

I did testing on generating this report from a 130 MB Quickbooks file. Locally, 15 seconds. Samba Server, 1:35. Shared off of my win98 Workstation, 2+ mins. I do not understand the reason it is SOO slow. I know it is a fairly large file. The only explanation I can come up with is the way QuickBooks is accessing the files. Maybe jumping all around the file, requiring many operations on the

Samba + Quickbooks

2002 May 10

Samba + Quickbooks

This is set up with Win2K boxes sharing a QB file on a SAMBA share. It may be quite different than with 95-98x-ME(Miserable Edition). There are two or more problems - one thing I identified is that it is possible for two computers to connect to the QB(QuickBooks) share using the same quick books logon - something that will cause data corruption. (Something that could have been easily

problem with PDF/postcript, cannot change paper size: "‘mode(width)’ and ‘mode(height)’ differ between new and previous"

2010 Nov 16

problem with PDF/postcript, cannot change paper size: "‘mode(width)’ and ‘mode(height)’ differ between new and previous"

Hi, The pdf function would not let me change the paper size and gives me the following warning: pdf("figure.pdf", width="6", height="10") Warning message: ?mode(width)? and ?mode(height)? differ between new and previous ==> NOT changing ?width? & ?height? If I use the option paper = "a4r", it does not give me a warning but still prints on a

Slow performance with QuickBooks

2003 May 28

Slow performance with QuickBooks

I have spent much of the day today researching performance tuning with samba. I have tried everything that I can find out about how to make performance faster. I checked disk performance with Bonnie, installed FTP and tested a transfer that way, achieving 6-7 MB / second. about 30 seconds for 150 MB file. I was finally able to achieve those speeds on a file transfer to the server through

Directory and File Perms

2009 Apr 29

Directory and File Perms

I have a directory shared out via Samba for Quickbooks and seem to have some issues with permissions. The directory being shared is a subdirectory in an ext3 partition being mounted with the acl option. It has been setup as follows: chown root:DOMAIN\AD_Group /mnt/Intuit_Data/ chmod 2770 /mnt/Intuit_Data/ And the Samba share config is has: create mask = 0660 directory mask =

similar to: Idea/package to "linearize a curve" along the diagonal?