thr3ads.net - R help - [R] Odd behavior of a function within apply [Aug 2022]

If this information is useful, please help other people find it:
Share via:

Erin Hodgess

2022-Aug-08 17:22 UTC

[R] Odd behavior of a function within apply

Hello!

I have the following data.frame
 dput(test1.df[1:10,8:10])
structure(list(X1_1_HZP1 = c(48160L, 48198L, 80027L, 48161L,
NA, 48911L, NA, 48197L, 48021L, 11355L), X1_1_HBM1_mon = c("December",
"June", "August", "", "",
"August", "April", "February", "",
"December"), X1_1_HBM1_yr = c(2014L, 2018L, 2016L, NA, NA, 1985L,
2019L, 1993L, NA, 1990L)), row.names = c(NA, 10L), class =
"data.frame")

And the following function:> dput(count1a)function (x)
{
    if (typeof(x) == "integer")
        y <- sum(is.na(x))
    if (typeof(x) == "character")
        y <- sum(x == "")
    return(y)
}
When I use the apply function with count1a, I get the following:
 apply(test1.df[1:10,8:10],2,count1a)
    X1_1_HZP1 X1_1_HBM1_mon  X1_1_HBM1_yr
           NA             3            NA
However, when I do use columns 8 and 10, I get the correct response:
 apply(test1.df[1:10,c(8,10)],2,count1a)
   X1_1_HZP1 X1_1_HBM1_yr
           2            3>I am really baffled.  If I use count1a on a single column, it works fine.

Any suggestions much appreciated.
Thanks,
Sincerely,
Erin


Erin Hodgess, PhD
mailto: erinm.hodgess at gmail.com

	[[alternative HTML version deleted]]

Andrew Simmons

2022-Aug-08 17:29 UTC

head link

[R] Odd behavior of a function within apply

When you use apply, it converts the input to an array, so all of the
integer columns in your data frame are converted to character class. I
would use lapply, sapply, or vapply instead to do this, I think you just
need to remove the "MARGIN" argument.

On Mon, Aug 8, 2022, 13:25 Erin Hodgess <erinm.hodgess at gmail.com>
wrote:
> Hello!
>
> I have the following data.frame
>  dput(test1.df[1:10,8:10])
> structure(list(X1_1_HZP1 = c(48160L, 48198L, 80027L, 48161L,
> NA, 48911L, NA, 48197L, 48021L, 11355L), X1_1_HBM1_mon =
c("December",
> "June", "August", "", "",
"August", "April", "February", "",
> "December"), X1_1_HBM1_yr = c(2014L, 2018L, 2016L, NA, NA, 1985L,
> 2019L, 1993L, NA, 1990L)), row.names = c(NA, 10L), class =
"data.frame")
>
> And the following function:
> > dput(count1a)
> function (x)
> {
>     if (typeof(x) == "integer")
>         y <- sum(is.na(x))
>     if (typeof(x) == "character")
>         y <- sum(x == "")
>     return(y)
> }
> When I use the apply function with count1a, I get the following:
>  apply(test1.df[1:10,8:10],2,count1a)
>     X1_1_HZP1 X1_1_HBM1_mon  X1_1_HBM1_yr
>            NA             3            NA
> However, when I do use columns 8 and 10, I get the correct response:
>  apply(test1.df[1:10,c(8,10)],2,count1a)
>    X1_1_HZP1 X1_1_HBM1_yr
>            2            3
> >
> I am really baffled.  If I use count1a on a single column, it works fine.
>
> Any suggestions much appreciated.
> Thanks,
> Sincerely,
> Erin
>
>
> Erin Hodgess, PhD
> mailto: erinm.hodgess at gmail.com
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
	[[alternative HTML version deleted]]

R help - Aug 2022 - Odd behavior of a function within apply

[R] Odd behavior of a function within apply

[R] Odd behavior of a function within apply