Displaying 20 results from an estimated 7000 matches similar to: "str(data.frame) after subsetting reflects original structure, not subsetted structure?"
2007 Dec 09
2
adjusting "levels" after subset a table
Um texto embutido e sem conjunto de caracteres especificado associado...
Nome: n?o dispon?vel
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20071208/5409f1a7/attachment.pl
2009 Nov 10
3
drop unused levels in subset.data.frame
Dear list,
subset has a 'drop' argument that I had often mistaken for the one in
[.factor which removes unused levels.
Clearly it doesn't work that way, as shown below,
d <- data.frame(x = factor(letters[1:15]), y = factor(LETTERS[1:3]))
s <- subset(d, y=="A", drop=TRUE)
str(s)
'data.frame': 5 obs. of 2 variables:
$ x: Factor w/ 15 levels
2008 Apr 01
1
Subset: data frames and factor levels
Hi!
I'm doing:
etni <- subset(etni, NAMECOM!="Maniquisito")
where etni is a data.frame, NAMECOM a factor and "Maniquisito" labels
a row that I want to delete.
The problem is that while the row is deleted, the factor level is still
there (I can see "Maniquisito" if I do levels(etni$NAMECOM) ).
I know I can get the appropriate levels by:
2009 Aug 07
3
bug with subset and plot?
Hi!
I have the following problem that I beleive is a bug:
I have a dataframe with one categorical and one numerical vector. The
categorical vector has three levels (uc, up and vc). A plot of the
vectors with the categorical vector on the x-axis gives a boxplot with
three boxes - exactly as expected.
If I then use the subset function to make a dataset that only includes
two of the levels of
2009 May 27
2
Factor level with no cases shows up in a plot
Consider this data structure (df1) ...
Group Year PctProf FullYr
1 Never RF 2004 87 88
2 Cohort 1 2004 83 84
3 Cohort 2 2004 84 86
4 Cohort 3 2004 87 87
5 Cohort 4 2004 73 74
6 Never RF 2005 85 86
7 Cohort 1 2005 81 82
8 Cohort 2 2005 81 81
9 Cohort 3 2005 78 79
10 Cohort 4 2005 72 74
11
2009 Jul 30
4
edit.row.names taking row names from the edited dataframe
Hi all,
I am struggling to work out how to use the rownames from an edited dataframe
rather than the row names from the original dataframe. In my data set i'm
trying to extract several rows of data on particular individuals, i don't
doubt i'm using the long way round but what i have in the way of a script is
this:
##selecting the IDs from the dataframe individually
2009 Nov 16
4
Where are usages like "== 2L" documented?
Gurus:
I keep seeing other people?s code that contain ideas like
If (x == 2L)
X[-1L]
X - 1L
I have some idea of what?s going on, but where is the use of concepts like
?2L? documented?
Thanks, Bryan
*************
Bryan Hanson
Acting Chair
Professor of Chemistry & Biochemistry
DePauw University, Greencastle IN USA
2008 Mar 30
2
Definition of "wrapper"?
I think I more or less understand what a ?wrapper? is, but I?d like to hear
how more experienced R users define it, and especially I'd like to know if
there is a formal definition. In my reading, it seems like there are a
fairly wide range of meanings, but they are all conceptually similar.
I've looked in a couple of the classic R texts, the extensions and
developers' manuals, and R
2007 Feb 14
1
se.contrast confusion
Hello,
I've got what I'd expect to be a pretty simple issue: I fit an aov object
using multiple error strata, and would like some significance tests for the
contrasts I specified.
In this contrived example, I model some test score as the interaction of a
subject's gender and two emotion variables (angry, happy, neutral), measured
at entry to the experiment (entry) and later
2010 Nov 29
3
Replacing several rows of a matrix at once
Hello Folks. This must be a silly question with a (not) obvious (to me)
answer.
Consider this:
tmp <- matrix(1:200, nrow = 20)
vec <- 300:309
tmp[9,] <- vec # replacing one row works fine
p <- c(3, 11, 17)
tmp[p,] <- vec
# replacing multple rows pastes the values down a column and recycles vec.
What I want to do is replace multiple rows simultaneously at once. I
suppose I can
2010 Mar 14
2
Why doesn't vec[-real.number] give an error or warning? Kids do the darndest things!
Hi all...
My students were conflating grepping for a value in a vector to get the
index, and then removing it with [-index], for instance like this:
set.seed(17)
v <- rnorm(20)
s <- v[-1.18]
They were trying to remove the 12th value in v, which is -1.18 or so.
But the result is, as documented in ?Extract, to coerce 1.18 to the next
lowest integer, and remove the 1st value of v, not the
2009 Sep 07
1
xyplot {lattice} are different types possible for each panel?
Hello R Folks...
Using the example below, I¹d like two of the panels to be plotted with type
= ³p² but the third to be done with type = ³h². I can¹t use type = c(³p²,
³p², ³h²) because this syntax applies all given types to every panel. I
don¹t think I can use groups and distribute.type because these are intended
for different styles of plotting within a single panel. As you can see, I
tried
2008 Apr 17
2
Suggestions: Terminology & Pkgs for following spectra over time
Hi Folks... No code to troubleshoot here. I need some suggestions about the
right terminology to use in further searching, and any suggestions about R
pkgs that might be appropriate.
I am in the planning stages of a project in which IR, NMR and other spectra
(I'm a chemist) would be collected on various samples, and individual
samples would be followed over time. The spectra will be feature
2011 May 20
3
Downloading a csv from Dropbox using the shareable link
Hello Kindred R Spirits...
I'm trying to get a file (csv) from Dropbox using their shareable link
concept. They issue a short URL that goes to a web page where you see
a button that says "Download File". They don't really give you the
URL of the file itself, just this page. Is there a way to coax R into
getting such a file? I don't even really want the file per
2009 Dec 29
1
ggplot2, building a simple formula interface
I?m trying to build a simple formula interface to work with a function using
ggplot2. The following scheme ?works? up until the plot(p) request, at which
point there are complaints about xlim?s and a blank graphics window.
Looking at str(p) I do see the limits are NULL, plus layer 1 claims to have
an empty data frame (but df is reproduced correctly). I'm sure I'm missing
something really
2005 Nov 07
1
reduce levels
Hi all:
I have an example that shows my problem:
> test <- data.frame(c("a", "b", "c"))
> colnames(test) <- "mm"
> sub <- subset(test, mm=="b")
> sub$mm
[1] b
Levels: a b c
> levels(sub$mm)
[1] "a" "b" "c"
How can I reduce the levels to exclusively those which
2010 Jul 12
2
findInterval and data resolution
Hello Wise Ones...
I need a clever way around a problem with findInterval. Consider:
vec1 <- 1:10
vec2 <- seq(1, 10, by = 0.1)
x1 <- c(2:3)
a1 <- findInterval(x1, vec1); a1 # example 1
a2 <- findInterval(x1, vec2); a2 # example 2
In the problem I'm working on, vec* may be either integer or numeric, like
vec1 and vec2. I need to remove one or more sections of this vector;
2009 Aug 11
1
Selecting/Accessing the last vector in a list of a list of data.frames
Hello Again R Folks:
I?m trying to clean up some code. Suppose I have an object like this:
> str(test)
List of 2
$ G:List of 2
..$ cls:'data.frame': 101 obs. of 2 variables:
.. ..$ V1: num [1:101] -0.0019 -0.0019 -0.00189 -0.00188 -0.00186 ...
.. ..$ V2: num [1:101] 0.000206 0.000247 0.000288 0.000329 0.000371 ...
..$ rob:'data.frame': 101 obs. of 2
2009 Oct 23
1
ggplot2: stat_bin ..count.. with geom_text when NA is present
One for the ggplot2 gurus...
I have a function which makes a plot just fine if the response vector (res
in the example; fac1 is a factor) has no NA in it. It plots the data, then
makes a little annotation at the bottom with the data counts using:
p <- p + geom_text(aes(x = fac1, y = min(res) - 0.1 * diff(range(res)),
label = paste("n = ", ..count.. , sep =
2009 Aug 10
3
Need Advice: Considering Converting a Package from S3 to S4
Hello R Folks...
Not a technical question, but I need some advice and perspective.
I?ve got a set of functions I?m planning to put together into a package.
The main hunk of data that gets used by different functions is currently an
S3 list. I?ve been reading about S4 objects, and I see the (numerous)
advantages of them. I have seen the recommendation that all new packages be
done with S4.