thr3ads.net - similar to: "Analyzing multiple text files"

Displaying 20 results from an estimated 6000 matches similar to: "Analyzing multiple text files"

2010 Jun 07

exepath error in seqaln

Hi everyone, I am using bio3d package for some sequence analysis because of its nice integrated MUSCLE execution. However, I have been encountering some errors while I was running seqaln function, which calls for installed MUSCLE program. In my case, I stored the MUSCLE program on the desktop with pathway of "./Desktop/muscle/src/muscle". I had no problem running MUSCLE directly from

bio3d package not installing

2010 Jan 03

bio3d package not installing

Is it just me, or is the bio3d package no longer available? Is there another way to do a reasonable nucleic acid sequence alignment in R? Thanks, Jonathan [[alternative HTML version deleted]]

Parsing large XML documents in R - how to optimize the speed?

2012 Aug 10

Parsing large XML documents in R - how to optimize the speed?

Hello everyone, I would like to parse very large xml files from MS/MS experiments and create R objects from their content. (By very large, I mean going up to 5-10Gb, although I am using a 'small' 40M file to test my code.) My first attempt at parsing the 40M file, using the XML package, took more than 2200 seconds and left me quite disappointed. I managed to cut that down to around 40

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

2010 Apr 08

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

Building the current release 2.7 branch on x86_64-apple-darwin10 with r81455 reverted, I get the following Polyhedron 2005 benchmark results (with no test failures)... ================================================================================ Date & Time : 7 Apr 2010 22:24:16 Test Name : llvm_gfortran_lin_p4 Compile Command : llvm-gfortran -ffast-math -funroll-loops -msse3

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

2010 Apr 08

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

On Apr 7, 2010, at 8:41 PM, Jack Howarth wrote: > Building the current release 2.7 branch on x86_64-apple-darwin10 > with r81455 reverted, I get the following Polyhedron 2005 benchmark > results (with no test failures)... Very nice! A 14% speedup on a benchmark we don't tune for isn't bad. I imagine that there are several easy wins you could get on it if you were interested

analyze amino acid sequence (composition)of proteins

2006 Jun 18

analyze amino acid sequence (composition)of proteins

Dear R-helpers: thank your for your attention. i am a newer to R and i am doing some protein category classification based on the amino acid sequence.while i have some questions urgently. 1. any packages for analysis amino acid sequence 2. given two sequences "AAA" and "BBB",how can i combine them into "AAABBB" 3. based on "AAABBB",how can i get some

Two questions about metacharacter in regexprs and function return

2011 Apr 07

Two questions about metacharacter in regexprs and function return

for the script, please kindly see the script below. At line 10 and line 13, my problems occurs. The first one is I try to retrieve the gene official name from a column of a table. The pattern of official name is something starting with gene_name. For detail problems, please see the according lines. Any suggestions are appreciated example of matching source (extract the Nnat, sometime it would

Separator with " | " for read.table

2008 Jun 16

Separator with " | " for read.table

read.csv

2024 Apr 16

read.csv

Dear R-developers, I came to a somewhat unexpected behaviour of read.csv() which is trivial but worthwhile to note -- my data involves a protein named "1433E" but to save space I drop the quote so it becomes, Gene,SNP,prot,log10p YWHAE,13:62129097_C_T,1433E,7.35 YWHAE,4:72617557_T_TA,1433E,7.73 Both read.cv() and readr::read_csv() consider prot(ein) name as (possibly confused by

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

2010 Apr 08

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

On Wed, Apr 07, 2010 at 09:54:36PM -0700, Chris Lattner wrote: > > On Apr 7, 2010, at 8:41 PM, Jack Howarth wrote: > > > Building the current release 2.7 branch on x86_64-apple-darwin10 > > with r81455 reverted, I get the following Polyhedron 2005 benchmark > > results (with no test failures)... > > Very nice! A 14% speedup on a benchmark we don't tune for

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

2010 Apr 08

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

[CCing Dale since this was his change, not mine] The change in 81455 fixes a compiler crash. It doesn't happen very often, but I can't imagine we would want to back that out. Fixing it would be a more reasonable solution. From a quick look at it, the problem is that gcc/config/darwin-c.c is registering va_opt for GC. When you build for Fortran, darwin-c.o is not linked so the GC gets

read.csv

2024 Apr 16

read.csv

?s 11:46 de 16/04/2024, jing hua zhao escreveu: > Dear R-developers, > > I came to a somewhat unexpected behaviour of read.csv() which is trivial but worthwhile to note -- my data involves a protein named "1433E" but to save space I drop the quote so it becomes, > > Gene,SNP,prot,log10p > YWHAE,13:62129097_C_T,1433E,7.35 > YWHAE,4:72617557_T_TA,1433E,7.73 >

read.csv

2024 Apr 16

read.csv

Gene names being misinterpreted by spreadsheet software (read.csv is no different) is a classic issue in bioinformatics. It seems like every practitioner ends up encountering this issue in due time. E.g. https://pubmed.ncbi.nlm.nih.gov/15214961/ https://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-1044-7 https://www.nature.com/articles/d41586-021-02211-4

GUI commands to call for a protein from protein data bank

2010 May 18

GUI commands to call for a protein from protein data bank

What I am trying to do is use GUI function, traitr, and to call for a pdb file and save it and then display it. I want to call for it by taking it from the user and then displaying it on the screen. I am having problems with that. The line pdb <- read.pdb(""ProteinCode) where proteincode should be the name of the protein, for example 1ly2, but it always ends up being protein. My

Separator argument in read.table

2008 May 29

Separator argument in read.table

Plot

2012 May 14

Plot

Hello, I am trying to make a plot of the rates of an enzyme against three different protein concentrations (there are 45 rates in total and split up into 3 groups of 15, each receiving one of the 3 protein concentrations). When I enter the following code I instead get 3 separate boxplots for each of the three different protein concentrations ... plot(rate ~ ferm, data=LDH, col=LDH$rate,

joined R-today

2009 Apr 21

joined R-today

Friends, I am a newbie to R. Just installed and started with R. I installed netcdf library (netcdf-4.0.tar.gz) and then ncdf package of R from CRAN with the following command. R CMD INSTALL --configure-args="-with-netcdf_incdir=/usr/local/netcdf/include -with-netcdf_libdir=/usr/local/netcdf/lib" ncdf_1.6.tar.gz The installation was successful. But when i try to use ncdf inside R, i

Newbie clustering/classification question

2006 Mar 26

Newbie clustering/classification question

My laboratory is measuring the abundance of various proteins in the blood from either healthy individuals or from individuals with various diseases. I would like to determine which proteins, if any, have significantly different abundances between the healthy and diseased individuals. Currently, one of my colleagues is performing an ANOVA on each protein with MS Excel. I would like to analyze

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

2010 Apr 08

[LLVMdev] darwin llvm-gfortran Polyhedron 2005 results

On Thu, Apr 08, 2010 at 08:45:48AM -0700, Bob Wilson wrote: > [CCing Dale since this was his change, not mine] > > The change in 81455 fixes a compiler crash. It doesn't happen very often, but I can't imagine we would want to back that out. Fixing it would be a more reasonable solution. From a quick look at it, the problem is that gcc/config/darwin-c.c is registering va_opt

[LLVMdev] Polyhedron 2005 regressions

2010 Sep 20

[LLVMdev] Polyhedron 2005 regressions

Comparing the Polyhedron 2005 benchmark results for gfortran from llvm-gcc-4.2 of April 7th, 2010 and September 18th, 2010 (from the rc2 2.8 release branch), we seem to be regressing in performance for this release.... ================================================================================ Date & Time : 7 Apr 2010 22:24:16 Test Name : llvm_gfortran_lin_p4 Compile Command :

similar to: Analyzing multiple text files