thr3ads.net - R help - [R] read.table mystery [Mar 2011]

If this information is useful, please help other people find it:
Share via:

Johannes Graumann

2011-Mar-06 10:04 UTC

[R] read.table mystery

Hello,

Please have a look at the code below, which I use to read in the attached 
file. As line 18 of the file reads "1065:>sp|Q9V3T9|ADRO_DROME 
NADPH:adrenodoxin oxidoreductase, mitochondrial OS=Drosophila melanogaster 
GN=dare PE=2 SV=1", I expect the code below to produce a 3 column data
frame
with most of the last column empty and line 18 to produce a data.frame row 
like so:

V1
	1065
V2	
	>sp|Q9V3T9|ADRO_DROME NADPH
V3
	adrenodoxin oxidoreductase, mitochondrial OS=Drosophila 
melanogaster GN=dare PE=2 SV=1

Why is that not so?

Thanks for any hint.

Sincerely, Joh

read.table(
  "/tmp/testfile.txt",
  sep=":",
  header=FALSE,
  quote="",
  fill=TRUE
)[19,]
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: testfile.txt
URL:
<https://stat.ethz.ch/pipermail/r-help/attachments/20110306/6bbe6a74/attachment.txt>

Sarah Goslee

2011-Mar-06 15:19 UTC

head link

[R] read.table mystery

Not so much a mystery. read.table() only looks at the first 5 lines when
decided how many columns your file has (as described in the Details
section of the help).

The easiest solution is to add a col.names argument to read.table() with
the correct number of names.

You may want to also include as.is=TRUE if you don't want your data to
be imported as factors. If you expect character but have factor you may
get unexpected results later.

Sarah

On Sun, Mar 6, 2011 at 5:04 AM, Johannes Graumann
<johannes_graumann at web.de> wrote:> Hello,
>
> Please have a look at the code below, which I use to read in the attached
> file. As line 18 of the file reads "1065:>sp|Q9V3T9|ADRO_DROME
> NADPH:adrenodoxin oxidoreductase, mitochondrial OS=Drosophila melanogaster
> GN=dare PE=2 SV=1", I expect the code below to produce a 3 column data
frame
> with most of the last column empty and line 18 to produce a data.frame row
> like so:
>
> V1
>        1065
> V2
>        >sp|Q9V3T9|ADRO_DROME NADPH
> V3
>        adrenodoxin oxidoreductase, mitochondrial OS=Drosophila
> melanogaster GN=dare PE=2 SV=1
>
> Why is that not so?
>
> Thanks for any hint.
>
> Sincerely, Joh
>
> read.table(
>  "/tmp/testfile.txt",
>  sep=":",
>  header=FALSE,
>  quote="",
>  fill=TRUE
> )[19,]
---
Sarah Goslee
http://www.functionaldiversity.org

Maybe Matching Threads

Search for more reasonably related threads

R help - Mar 2011 - read.table mystery

[R] read.table mystery

[R] read.table mystery

Maybe Matching Threads