thr3ads.net - similar to: "Identifying and Removing NA Columns and factor Columns with more than x Levels"

Displaying 20 results from an estimated 10000 matches similar to: "Identifying and Removing NA Columns and factor Columns with more than x Levels"

Bug in "transform"?

2008 Dec 02

Bug in "transform"?

Dear useRs, Here is a weird behavior of transform function: mtcars1<-matcars transform(mtcars1,t1=3,t2=4) Error in data.frame(`_data`, e[!matched]) : arguments imply differing number of rows: 32, 1 instead, this works: mtcars1$t1<-0 transform(mtcars1,t1=3,t2=4) also works if applied in turn: transform(mtcars1,t1=3) transform(mtcars1,t2=4) I often need to use this

How to perform clustering without removing rows where NA is present in R

2013 Dec 07

How to perform clustering without removing rows where NA is present in R

I have a data which contain some NA value in their elements. What I want to do is to **perform clustering without removing rows** where the NA is present. I understand that `gower` distance measure in `daisy` allow such situation. But why my code below doesn't work? __BEGIN__ # plot heat map with dendogram together. library("gplots") library("cluster")

Ggplot barchart drops factor levels: how to show them with zero counts?

2012 Mar 15

Ggplot barchart drops factor levels: how to show them with zero counts?

Hello, When plotting a barchart with ggplot it drops the levels of the factor for which no counts are available. For example: library(ggplot) mtcars$cyl<-factor(mtcars$cyl) ggplot(mtcars[!mtcars$cyl==4,], aes(cyl))+geom_bar() levels(mtcars[!mtcars$cyl==4,]) This shows my problem. Because no counts are available for factorlevel '4', the label 4 dissapears from the plot. However, I

List of Levels for all Factor variables

2012 Oct 16

List of Levels for all Factor variables

Hi, I want to get a clean succinct list of all levels for all my factor variables. I have a dataframe that's something like #1 below. This is just an example subset of my data and my actual dataset has 70 variables. I know how to narrow down my list of variables to just my factor variables by using #2 below (thanks to Bert Gunter). I can also get list of all levels for all my factor

Notation for previous observation in a data frame

2012 Jul 08

Notation for previous observation in a data frame

I've created a data frame in R, but in order to clean up some of the data, I need to set certain variable observations equal to the value of their previous observation (it would be conditional, but that part's less important right now). In Stata, I would simply set var = var[_n-1] in those cases. What is the R equivalent? [[alternative HTML version deleted]]

Why copying columns of a data.frame becomes numeric?

2013 Apr 12

Why copying columns of a data.frame becomes numeric?

Dear list, I want the 1st, 2nd, 5th, and 6th columns of mtcars. After copying them, the columns become numeric class rather than data frame. But, when I copy rows, they data frame retains its class. Why is this? I don't see why copying rows vs columns is so different. > class(mtcars) [1] "data.frame" > head(mtcars) mpg cyl disp hp drat wt qsec vs

Selecting A List of Columns

2013 May 17

Selecting A List of Columns

Dear R Helpers, I need help with a slightly unusual situation in which I am trying to select some columns from a data frame. I know how to use the subset statement with column names as in: x=as.data.frame(matrix(c(1,2,3, 1,2,3, 1,2,2, 1,2,2, 1,1,1),ncol=3,byrow=T)) all.cols<-colnames(x) to.keep<-all.cols[1:2] Kept<-subset(x,select=to.keep) Kept

loop for regression

2012 Jul 04

loop for regression

---------- Forwarded message ---------- From: Akhil dua <akhil.dua.12@gmail.com> Date: Wed, Jul 4, 2012 at 10:33 AM Subject: To: r-help@r-project.org Hi everyone I have data on stock prices and market indices and I need to run a seperate regression of every stock on market so I want to write a "for loop" so that I wont have to write codes again and again to run the

Bug in by() function which works for some FUN argument and does not work for others

2016 Apr 15

Bug in by() function which works for some FUN argument and does not work for others

Dear All, Thanks for your help. However, I would like to draw your attention to the following: Actually, I was replicating the Example 2.3, using the dataset "brainsize.txt" given in Section 2.3.3 ("Summarize by group") at page 55, of a famous book "R by Example" written by "Jim Albert and Maria Rizzo" published in Springers (2012) in a Use R! Series. The

Changing NA to 0 in selected columns of a dataframe

2012 Oct 11

Changing NA to 0 in selected columns of a dataframe

I've been beating my head on the table for hours now and don't understand why this doesn't work. I have a dataframe that I want to change NAs to 0 for some of the columns and not others. Consider this... >#create dataframe > A = c(1:5) > B = c(6, 7, NA, NA, NA) > C = c(NA, NA, 13, 14, 15) > D = c(16:20) > E = c(21, NA, NA, NA, 25) > data = as.data.frame ( cbind (

Bug in by() function which works for some FUN argument and does not work for others

2016 Apr 15

Bug in by() function which works for some FUN argument and does not work for others

> On Apr 15, 2016, at 1:16 AM, Akhilesh Singh <akhileshsingh.igkv at gmail.com> wrote: > > Dear All, > > Thanks for your help. However, I would like to draw your attention to the > following: > > Actually, I was replicating the Example 2.3, using the dataset > "brainsize.txt" given in Section 2.3.3 ("Summarize by group") at page 55, > of a

Objective: to create a Joint Probability Table and save as a csv file (with headers).Input: A csv file (with headers) containing raw data, arranged by columns.Variable: number of bins for each column of data, default to 5 bins per column.Data

2010 May 04

Hi Roscoe, I couldn't make it though that excel worksheet without my eyes glazing over... but do you want something like data(mtcars) tab <- xtabs(~cyl + gear, data=mtcars) tab/sum(tab) ? Best, Isat On Mon, May 3, 2010 at 10:08 PM, Roscoe <Roscoe at exemail.com.au> wrote: > Dear R-Help, > > First post, be gentle... > > I am new to R: I can perform basic functions

Bug in by() function which works for some FUN argument and does not work for others

2016 Apr 14

Bug in by() function which works for some FUN argument and does not work for others

I think you are not using the best function for what your intentions are. Try: > by(data=mtcars, INDICES=list(as.factor(mtcars$am)), FUN=colMeans) : 0 mpg cyl disp hp drat wt qsec vs 17.1473684 6.9473684 290.3789474 160.2631579 3.2863158 3.7688947 18.1831579 0.3684211 am gear carb 0.0000000

deleting columns from a dataframe where NA is more than 15 percent of the column length

2012 Aug 06

deleting columns from a dataframe where NA is more than 15 percent of the column length

I have a dataframe of 10 different columns (length of each column is the same). I want to eliminate any column that has 'NA' greater than 15% of the column length. Do i first need to make a function for calculating the percentage of NA for each column and then make another dataframe where i apply the function? Whats the best way to do this.

Bug in by() function which works for some FUN argument and does not work for others

2016 Apr 16

Bug in by() function which works for some FUN argument and does not work for others

Dear All, I have got your core message, that it is my responsibility to determine whether any particular function in my version of R satisfies the language requirements at the time of your use. Jim Albert and Maria Rizzo must have used their code, which was permitted in the R-code of their time (2012). Therefore, I have now modified my R-code, as per R-3..2.4 version, according to my requirement

how to plot a graph with different pch

2007 Oct 01

how to plot a graph with different pch

I am trying to plot a graph but the points on the graph should be different symbols and colors. It should represent what is in the legend. I tried using the points command but this does not work. Is there another command in R that would allow me to use different symbols and colors for the points? Thank you kindly. data(mtcars) plot(mtcars$wt,mtcars$mpg,xlab= "Weight(lbs/1000)",

How to add points to two plots parallelly ?

2011 Dec 13

How to add points to two plots parallelly ?

I am looking for ways to add points to three different plots in parallel. I generate three scatter plots and name them as s3d1, s3d2 and s3d3 s3d1<-scatterplot3d(mtcars[,3],mtcars[,4],mtcars[,5],main="common",pch=20) s3d2<-scatterplot3d(mtcars[,3],mtcars[,4],mtcars[,5],main="common",pch=20)

the first name of the first column

2018 May 10

the first name of the first column

Dear all; I need to run heatmap. Because my first column in my data is alphanumeric, I can not run as.matrix(scale(my_data)). So I need to make my data readable as in data(mtcars). In *mtcars *data the first column is alphanumeric and has no name. Thanks, Greg [[alternative HTML version deleted]]

Bug in by() function which works for some FUN argument and does not work for others

2016 Apr 17

Bug in by() function which works for some FUN argument and does not work for others

> On Apr 16, 2016, at 2:03 AM, Akhilesh Singh <akhileshsingh.igkv at gmail.com> wrote: > > Dear All, > > I have got your core message, that it is my responsibility to determine whether any particular function in my version of R satisfies the language requirements at the time of your use. Jim Albert and Maria Rizzo must have used their code, which was permitted in the R-code

suggestion: "." in [lsv]apply()

2020 Apr 16

suggestion: "." in [lsv]apply()

I'm sure this exists elsewhere, but, as a trade-off, could you achieve what you want with a separate helper function F(expr) that constructs the function you want to pass to [lsv]apply()? Something that would allow you to write: sapply(split(mtcars, mtcars$cyl), F(summary(lm(mpg ~ wt,.))$r.squared)) Such an F() function would apply elsewhere too. /Henrik On Thu, Apr 16, 2020 at 9:30 AM

similar to: Identifying and Removing NA Columns and factor Columns with more than x Levels