I've tried colClasses="character", fill=T, as.is=T, header=F, sep="\t", read.csv; read.delim, read.csv2, read.delim2.... don't know what else to try. On Sat, May 9, 2015 at 11:13 AM, MacQueen, Don <macqueen1 at llnl.gov> wrote:> Some indication of what you have tried would be useful. Assuming you are > using read.table(), then the "fill" argument of read.table() might be what > you need. If you look at the help for read.table you will find: > > From ?read.table: > fill: logical. If 'TRUE' then in case the rows have unequal length, > blank fields are implicitly added. See 'Details'. > > > -- > Don MacQueen > > Lawrence Livermore National Laboratory > 7000 East Ave., L-627 > Livermore, CA 94550 > 925-423-1062 > > > > > > On 5/9/15, 7:59 AM, "Kate Ignatius" <kate.ignatius at gmail.com> wrote: > >>I have some data that I've trouble importing... >> >>A B C D E >>A 1232 0.565 >>B 2323 0.5656 0.5656 0.5656 >>C 2323 0.5656 >>D 2323 0.5656 >>E 2323 0.5656 >>F 2323 0.5656 >>G 2323 0.5656 >>G 2323 0.5656 0.5656 0.5656 >> >>When I input the data it seems to go like this: >> >>SampleID ItemB ItemC ItemD ItemE >>A 1232 0.565 >>B 2323 0.5656 >>0.5656 0.5656 >>C 2323 0.5656 >>D 2323 0.5656 >>E 2323 0.5656 >>F 2323 0.5656 >>G 2323 0.5656 >>G 2323 0.5656 0.5656 0.5656 >> >>with the last two columns (or the two columns with vast amounts of >>missing data which are usually the last two = see SampleB) wrapping >>around - is there away to prevent this? >> >>Thanks! >> >>______________________________________________ >>R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see >>https://stat.ethz.ch/mailman/listinfo/r-help >>PLEASE do read the posting guide >>http://www.R-project.org/posting-guide.html >>and provide commented, minimal, self-contained, reproducible code. >
txt <- c("A B C D E", "A 1232 0.565", "B 2323 0.5656
0.5656 0.5656",
"C 2323 0.5656", "D 2323 0.5656", "E 2323
0.5656", "F 2323 0.5656",
"G 2323 0.5656", "G 2323 0.5656 0.5656 0.5656")
z <- read.table(text=txt, fill=TRUE, header=TRUE)
str(z)
#'data.frame': 8 obs. of 5 variables:
# $ A: Factor w/ 7 levels
"A","B","C","D",..: 1 2 3 4 5 6 7 7
# $ B: int 1232 2323 2323 2323 2323 2323 2323 2323
# $ C: num 0.565 0.566 0.566 0.566 0.566 ...
# $ D: num NA 0.566 NA NA NA ...
# $ E: num NA 0.566 NA NA NA ...
If your file contains extra spaces or tabs at the ends of the lines it
is possible that using sep="\t" may mess things up: using an explicit
'sep' argument means that you have to use it consistently as a
separator.
Bill Dunlap
TIBCO Software
wdunlap tibco.com
On Sat, May 9, 2015 at 8:22 AM, Kate Ignatius <kate.ignatius at gmail.com>
wrote:
> I've tried colClasses="character", fill=T, as.is=T, header=F,
> sep="\t", read.csv; read.delim, read.csv2, read.delim2....
don't know
> what else to try.
>
> On Sat, May 9, 2015 at 11:13 AM, MacQueen, Don <macqueen1 at
llnl.gov> wrote:
> > Some indication of what you have tried would be useful. Assuming you
are
> > using read.table(), then the "fill" argument of read.table()
might be
> what
> > you need. If you look at the help for read.table you will find:
> >
> > From ?read.table:
> > fill: logical. If 'TRUE' then in case the rows have unequal
length,
> > blank fields are implicitly added. See 'Details'.
> >
> >
> > --
> > Don MacQueen
> >
> > Lawrence Livermore National Laboratory
> > 7000 East Ave., L-627
> > Livermore, CA 94550
> > 925-423-1062
> >
> >
> >
> >
> >
> > On 5/9/15, 7:59 AM, "Kate Ignatius" <kate.ignatius at
gmail.com> wrote:
> >
> >>I have some data that I've trouble importing...
> >>
> >>A B C D E
> >>A 1232 0.565
> >>B 2323 0.5656 0.5656 0.5656
> >>C 2323 0.5656
> >>D 2323 0.5656
> >>E 2323 0.5656
> >>F 2323 0.5656
> >>G 2323 0.5656
> >>G 2323 0.5656 0.5656 0.5656
> >>
> >>When I input the data it seems to go like this:
> >>
> >>SampleID ItemB ItemC ItemD ItemE
> >>A 1232 0.565
> >>B 2323 0.5656
> >>0.5656 0.5656
> >>C 2323 0.5656
> >>D 2323 0.5656
> >>E 2323 0.5656
> >>F 2323 0.5656
> >>G 2323 0.5656
> >>G 2323 0.5656 0.5656 0.5656
> >>
> >>with the last two columns (or the two columns with vast amounts of
> >>missing data which are usually the last two = see SampleB) wrapping
> >>around - is there away to prevent this?
> >>
> >>Thanks!
> >>
> >>______________________________________________
> >>R-help at r-project.org mailing list -- To UNSUBSCRIBE and more,
see
> >>https://stat.ethz.ch/mailman/listinfo/r-help
> >>PLEASE do read the posting guide
> >>http://www.R-project.org/posting-guide.html
> >>and provide commented, minimal, self-contained, reproducible code.
> >
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
[[alternative HTML version deleted]]
Hi Kate, The following: x<-read.table(text="A B C D E A 1232 0.565 B 2323 0.5656 0.5656 0.5656 C 2323 0.5656 D 2323 0.5656 E 2323 0.5656 F 2323 0.5656 G 2323 0.5656 G 2323 0.5656 0.5656 0.5656",header=TRUE,fill=TRUE) works fine for me: x A B C D E 1 A 1232 0.5650 NA NA 2 B 2323 0.5656 0.5656 0.5656 3 C 2323 0.5656 NA NA 4 D 2323 0.5656 NA NA 5 E 2323 0.5656 NA NA 6 F 2323 0.5656 NA NA 7 G 2323 0.5656 NA NA 8 G 2323 0.5656 0.5656 0.5656 I just copied your example text into a text editor and added the "x<-read.table...". Do you really have TAB delimiters in the file you are trying to import? If so, can you do a global replace of TAB -> space in the incoming file? Jim On Sun, May 10, 2015 at 2:43 AM, William Dunlap <wdunlap at tibco.com> wrote:> txt <- c("A B C D E", "A 1232 0.565", "B 2323 0.5656 0.5656 0.5656", > "C 2323 0.5656", "D 2323 0.5656", "E 2323 0.5656", "F 2323 0.5656", > "G 2323 0.5656", "G 2323 0.5656 0.5656 0.5656") > z <- read.table(text=txt, fill=TRUE, header=TRUE) > str(z) > #'data.frame': 8 obs. of 5 variables: > # $ A: Factor w/ 7 levels "A","B","C","D",..: 1 2 3 4 5 6 7 7 > # $ B: int 1232 2323 2323 2323 2323 2323 2323 2323 > # $ C: num 0.565 0.566 0.566 0.566 0.566 ... > # $ D: num NA 0.566 NA NA NA ... > # $ E: num NA 0.566 NA NA NA ... > > If your file contains extra spaces or tabs at the ends of the lines it > is possible that using sep="\t" may mess things up: using an explicit > 'sep' argument means that you have to use it consistently as a separator. > > > Bill Dunlap > TIBCO Software > wdunlap tibco.com > > On Sat, May 9, 2015 at 8:22 AM, Kate Ignatius <kate.ignatius at gmail.com> > wrote: > >> I've tried colClasses="character", fill=T, as.is=T, header=F, >> sep="\t", read.csv; read.delim, read.csv2, read.delim2.... don't know >> what else to try. >> >> On Sat, May 9, 2015 at 11:13 AM, MacQueen, Don <macqueen1 at llnl.gov> wrote: >> > Some indication of what you have tried would be useful. Assuming you are >> > using read.table(), then the "fill" argument of read.table() might be >> what >> > you need. If you look at the help for read.table you will find: >> > >> > From ?read.table: >> > fill: logical. If 'TRUE' then in case the rows have unequal length, >> > blank fields are implicitly added. See 'Details'. >> > >> > >> > -- >> > Don MacQueen >> > >> > Lawrence Livermore National Laboratory >> > 7000 East Ave., L-627 >> > Livermore, CA 94550 >> > 925-423-1062 >> > >> > >> > >> > >> > >> > On 5/9/15, 7:59 AM, "Kate Ignatius" <kate.ignatius at gmail.com> wrote: >> > >> >>I have some data that I've trouble importing... >> >> >> >>A B C D E >> >>A 1232 0.565 >> >>B 2323 0.5656 0.5656 0.5656 >> >>C 2323 0.5656 >> >>D 2323 0.5656 >> >>E 2323 0.5656 >> >>F 2323 0.5656 >> >>G 2323 0.5656 >> >>G 2323 0.5656 0.5656 0.5656 >> >> >> >>When I input the data it seems to go like this: >> >> >> >>SampleID ItemB ItemC ItemD ItemE >> >>A 1232 0.565 >> >>B 2323 0.5656 >> >>0.5656 0.5656 >> >>C 2323 0.5656 >> >>D 2323 0.5656 >> >>E 2323 0.5656 >> >>F 2323 0.5656 >> >>G 2323 0.5656 >> >>G 2323 0.5656 0.5656 0.5656 >> >> >> >>with the last two columns (or the two columns with vast amounts of >> >>missing data which are usually the last two = see SampleB) wrapping >> >>around - is there away to prevent this? >> >> >> >>Thanks! >> >> >> >>______________________________________________ >> >>R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see >> >>https://stat.ethz.ch/mailman/listinfo/r-help >> >>PLEASE do read the posting guide >> >>http://www.R-project.org/posting-guide.html >> >>and provide commented, minimal, self-contained, reproducible code. >> > >> >> ______________________________________________ >> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide >> http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. >> > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.