Displaying 1 result from an estimated 1 matches for "comp215489_c0_seq40".
2012 Dec 04
2
problem with factor levels
Hi
I have a data.frame with 371,718 obs. of 12 variables (see below for
an str). My problem is with V1, a Factor w/ 93144 levels, there should
actually be 93994 levels. Each entry looks like:
comp[number]_c[number]_seq[number]
for example
comp215489_c0_seq40
R is grouping as though the last number is a decimal for some reason, in
other words comp215489_c0_seq40 and comp215489_c0_seq4 are considered to be
the same. My problem is that they are not the same so when I group by this
factor I am losing 800 levels.
Here is an str
'data.frame': 37171...