Displaying 20 results from an estimated 20000 matches similar to: "Obtaining the internal integer codes of a factor XXXX"
2011 Sep 01
2
Including only a subset of the levels of a factor XXXX
Hello everyone,
I have the following factor:
levels(pp_income)
[1] "" "1" "2" "3" "4" "5" "6" "7"
[9] "8" "9" "Renter"
I want to subset so that only values 1:9 are included. I have the following:
> income<-pp_income[pp_income %in%
2011 Aug 24
3
Importing data from MS EXCEL (.xls) to R XXXX
Hello everyone,
What is the simplest, most RELIABLE way to import data from MS EXCEL (.xls)
format to R? In the past I have used the read.xls() function from the
xlsReadWrite package, however, I have been wrestling with it all afternoon
long with no success. I continue to receive the following error message:
> {widge<-read.xls("F:\\Classes\\Z1.Data\\stat.3010\\WidgeOne.xls",
+
2012 Mar 02
3
Memory issue. XXXX
Hi everyone,
Any ideas on troubleshooting this memory issue:
> d1<-read.csv("arrears.csv")
Error: cannot allocate vector of size 77.3 Mb
In addition: Warning messages:
1: In class(data) <- "data.frame" :
Reached total allocation of 1535Mb: see help(memory.size)
2: In class(data) <- "data.frame" :
Reached total allocation of 1535Mb: see
2011 Nov 28
1
2 Y-AXIS labels on the same (left-hand side) Y-AXIS XXXX
Hello everyone,
Is it possible to specify a 2 line y-axis label on the same lef-hand side
y-axis? I am using the \n regular expression, but only the 2nd line appears
(I assume the 1st line is printed off the page...)
plot(PRE_SHB,R1,
main="Figure 1.1: Scatterplot of Residualized
Post Score",
xlab = "Pre Score",
ylab = "Residualized Post Score \n
(Adjusted for Age
2011 Aug 27
2
Placing a column name in a variable XXXX
Hi everyone,
How does one place an object name (in this case a vector name) into another
object (while essentially masking the values of the first object?
For example:
> JOBSAT<-rnorm(40)
>
> CI<-function(x,alpha){
+ result<-cbind(x,mean=mean(x),alpha)
+ print(result)
+ }
> CI(JOBSAT,.05)
I want this to return:
Variable mean alpha
JOBSTAT 0.02844131 0.05
2012 Jan 17
3
Using !is.na() in a HAVING clause in sqldf() XXXX
Hi everyone,
I have the following:
sqldf("select Premie,count(tpounds) N,avg(tpounds) Avg_Weight,
stddev_samp(tpounds) StdDev
from children
group by Premie
having !is.na(Premie)")
sqldf() does not like the !is.na(Premie) specification. How does one
exclude a "missing" group in an aggregated query using sqldf()?
Thanks!
Dan
[[alternative HTML version deleted]]
2012 Mar 07
2
Remove a word from a character vector value XXXX
Hi everyone,
What is the easiest way to remove the word Average and strip leading
and trailing blanks from the character vector (d5.Region) below?
.nrow.d5. d5.Region
1 1 Central Average
2 2 Coastal Average
3 3 East Average
4 4 Metro East Average
5 5 Metro North Average
6 6 Metro South Average
7
2012 Jan 04
3
Using a mathematical expression in sapply() XXXX
Hello everyone,
I have the following call to sapply() and error message. Is the most
efficient way to deal with this to make sum(!is.na(x)) a function in a
separate line prior to this call? If not, please advise.
N.Valid=sapply(x,sum(!is.na(x)))
Error in match.fun(FUN) :
'sum(!is.na(x))' is not a function, character or symbol
Thanks!
Dan
[[alternative HTML version deleted]]
2012 Jan 24
2
Writing a function to return column position XXXX
Hello everyone,
I am writing my own function to return the column index of all variables
(these are currently character vectors) in a data frame that contain a
dollar sign($). A small piece of the data look like this:
can_sta can_zip ind_ite_con ind_uni_con AL 36106 $251,895.80 $22,874.43
AL 35802 $141,373.60 $7,100.00 AL 35201 $273,208.50 $18,193.66 AR
72404 $186,918.00
$25,391.00 AR
2012 Jan 19
2
Reading in tab (and space) delimited data within a script XXXX
Hello everyone,
I use Bob Muenchen's approach for reading in "in-stream" (to use SAS
parlance) delimited data within a script. This works great:
mystring <-
"id,workshop,gender,q1,q2,q3,q4
1,1,f,1,1,5,1
2,2,f,2,1,4,1
3,1,f,2,2,4,3
4,2, ,3,1, ,3
5,1,m,4,5,2,4
6,2,m,5,4,5,5
7,1,m,5,3,4,4
8,2,m,4,5,5,5"
mydata <- read.table( textConnection(mystring),
2013 Oct 08
1
Summary functions in sqldf() XXXX
Hi everyone,
Is it possible to obtain the 1st & 3rd quartiles & the median in a sqldf()
select statement? If so, can you please provide the summary fn code?
Thanks!
Dan
[[alternative HTML version deleted]]
2011 Nov 10
1
R package for segmentation with both continuous and categorical input variables XXXX
Hello everyone,
Can anyone suggest a decently documented (with good examples in the
documentation) R package/function that performs segmentation (cluster,
mixture modeling) of a population using both continuous and categorical
input variables?
Thank you,
Dan
[[alternative HTML version deleted]]
2011 Dec 30
2
Applyiing mode() or class() to each column of a data.frame XXXX
Hi everyone,
I am attempting to use the apply() function to obtain the mode and class of
each column in a data frame, however, I am encountering unexpected results.
I have the following example data:
v13<-1:6
v14<-c(1,2,3,3,NA,1)
v15<-c("Good","Bad",NA,"Good","Bad","Bad")
2012 Jan 08
2
Convert components of a list to separate columns in a data frame or matrix XXXX
Hello everyone,
What is the most efficient & simpliest way to convert all components of a
list to separate columns in a matrix?
Is there an easy way to programmatically "pad" the length of the resulting
shorter character vectors so that they can be easily combined into a data
frame?
I have the following code that stores the 2 compoents (of differing
lengths) in the same character
2011 May 04
3
SAPPLY function XXXX
Hello everyone,
I am attempting to write a function to count the number of non-missing
values of each column in a data frame using the sapply function. I have the
following code which is receiving the error message below.
> n.valid<-sapply(data1,sum(!is.na))
Error in !is.na : invalid argument type
Ultimately, I would like for this to be 1 conponent in a larger function
that will produce
2011 May 08
3
%in% operator - NOT IN
Hello everyone,
I am attempting to use the %in% operator with the ! to produce a NOT IN type
of operation. Why does this not work? Suggestions?
> data2[data1$char1 %in% c("string1","string2"),1]<-min(data1$x1)
> data2[data1$char1 ! %in% c("string1","string2"),1]<-max(data1$x1)+1000
Error: unexpected '!' in "data2[data1$char1
2016 Apr 16
2
Equivalent in R of the Contains operator in SAS
Hi all,
I want to select all variables in the data.frame with a name that
includes are certain string. Something like the following:
merge3[,names(merge3) %in% c("Email","Email.x")]
But there are too many variations on the Email variable names to list them all.
Can anyone advise?
Thanks!
Dan
2017 Jul 14
2
Efficient Binning
Hi all,
I have a situation where I have 16 bins. I generate a random number and
then want to know which bin number the random number falls in. Right now, I
am using a serious of 16 if() else {} statements which get very complicated
with the embedded curly braces. Is there a more efficient (i.e., easier)
way to go about this?
boundaries<-(0:16)/16
rand<-runif(1)
Which bin number (1:16)
2012 May 07
3
Using expression() in plot() XXXX
Hello everyone,
I am trying to add the following text (in proper notation) to a
graphic using expression().
X-bar (with a subscript of cv) = XX.
Note: Ideally "cv" would be a subscript, but it doesn't have to be.
I have the following code:
> text(625,.012,expression(bar(X)cv = 552.01))
Error: unexpected symbol in "text(625,.012,expression(bar(X)cv"
I have also
2012 Jan 13
2
Merging data XXXX
Hello everyone,
I have 1 data frame (just a vector in the example below) with 12
individuals listed and a separate vector of 36 days (in week intervals).
What is the best way to merge these together so that each individual
("specialist" here) has all 36 days matched with their specialist number (a
one to many merge in SAS; essentially resulting in long format data).