similar to: number of distinct values in a dataframe

Displaying 20 results from an estimated 1000 matches similar to: "number of distinct values in a dataframe"

2003 Feb 04
1
test for two samples
Hi R-users, My question is more methodological one, rather than technical. I have to samples representing residuals based on two measurements techniques (resid1,resid2; n=69). I need to compare two samples, to reject one technique (the worse one), and to keep the one which gave lower residuals (better one). What to look for? What should I analyse? Means, variance, std. deviations? Based on
2002 Aug 30
2
postscript() problem
Hi R-users, I am using "postscript("xx.eps", horizontal=FALSE, onefile=FALSE, height=6, width=10, pointsize=10)" and "barplot(xx.dat)" in R to produce EPS graphics for LATeX document. When checking the graphics with "gv" I see that it was not produced with full details. The main graphic is there, but axes labels and titles are missing. While checking the
2002 Nov 26
4
how to identify the outliers
Hello R-users, Is there any more sophisticated way how to identify the dataset outliers other then seeing them in boxplot? I wanna exclude them from further analysis and I am interested in their position in my vector data. Rado -- Radoslav Bonk M.S. Dept. of Physical Geography and Geoecology Faculty of Sciences, Comenius University Mlynska Dolina 842 15, Bratislava, SLOVAKIA tel: +421 2 602
2002 Nov 07
4
negative and positive values in diff. colors
Hi R-users, I have a set of elevation residuals as geodata points. I would like to display them in the following way: -negative values using pch=20 (filled circle) -positive values using pch=1 (empty circle) while using the cex.max=5, cex.min=0.1 for points() to represent the residuals value. Basically I would like to distinguish neagtive and positive values at my map. How to do this. Thanks
2003 Sep 04
2
documents for writing functions
Hi, Does anybody know suitable documents (manuals) on writing user functions (covering loops, conditions ...) in R? Other than the usually available manuals. Thanks, Rado -- Radoslav Bonk M.S. Dept. of Physical Geography and Geoecology Faculty of Sciences, Comenius University Mlynska Dolina 842 15, Bratislava, SLOVAKIA tel: +421 905 968 127 e-mail: rbonk at host.sk
2002 Jul 08
2
matrix: rows to columns conversion
Hi, I'm new to "R". I have a matrix of 6x12. I need to produce matrix of 12x6, such that rows in new matrix are columns from original matrix, and columns in a new matrix are rows from the original one. I was playing with rbind(), and cbind() with no success. Regards, -Rado -- Radoslav Bonk M.S. Dept. of Physical Geography and Geoecology Faculty of Natural Sciences, Comenius
2004 Mar 22
4
R commands formating for LaTeX
Dear R users, I have to include typical UNIX formated R commands and outputs into the article using LaTeX. What is the easiest way to include R prompt commands and text outputs (including messages) into LaTeX documents? Is there any template or library? So far I'm trying to produce R commands in LaTeX using LaTeX general format options. Thanks, Rado
2004 Jul 14
3
ROracle - fetch gives empty dataframe
Dear R-users, I was able to make ROracle package to connect to the DB (Oracle91, 64bit, on Solaris). But after executing siple SQL query, "fetch" commaned gives me an empty dataframe. ### RORACLE INSTALATION PROCEDURE ### R CMD INSTALL --configure-args='--enable-extralibs' --enable-oracle32=no ~/tmp/ROracle_0.5-5.tar.gz #since we have 64bit Oracle9i instalation PROBLEM:
2004 Jul 05
4
extract columns from a dataframe
Dear R users, I'm coming back to R after while. I have a data frame with 200 columns, each column has a name. How to extract all columns to a new dataset, but the specified (by names) ones? I was playing with that for a little bit using the vector syntax but got several syntax errors. Thanks, Rado
2003 Feb 20
3
outliers/interval data extraction
Dear R-users, I have two outliers related questions. I. I have a vector consisting of 69 values. mean = 0.00086 SD = 0.02152 The shape of EDA graphics (boxplots, density plots) is heavily distorted due to outliers. How to define the interval for outliers exception? Is <2SD - mean + 2SD> interval a correct approach? Or should I define 95% (or 99%) limit of agreement for data interval,
2004 Jul 05
1
R and databases (Oracle)
Dear R users, I'm working on implementation of hydrometeorological DB in Oracle9i and would like to use R, as a plotting engine for graphs and maps and also to perform some non trivial statistics on DB data using R. Thus any links with examples or efforts on: - general DB and R cooperation - Oracle91 and R cooperation - Oracle PL/SQL and R are welcomed. Thanks in advance, Rado --
2006 May 18
2
matrix transformation into 3 columns II.
Dear R-users Sorry for posting the previous message too soon before I have finished it. I have matrix (mprecip) with headers: > dim(mprecip) [1] 6268 170 > mprecip date GilzeRijen Eindhoven Volkel ZuidLimburg Arcen Ubachsberg 1 01/01/1978 NA 0.0 NA 0.1 NA NA 2 01/02/1978 NA 0.0 NA 0.0 NA NA 3
2004 Jul 12
1
ROracle package error
Dear R-users, In order to use ROracle (v. 0.5-5) package I compiled R-1.9.1 on Linux (2.4.20-28.9smp) and installed it. I need to use Oracle9i on Solaris. When executing "library(ROracle)" I got the following error message: > library(ROracle) Error in dyn.load(x, as.logical(local), as.logical(now)) : unable to load shared library
2004 Sep 10
3
Improving on Rice coding
Hello, I am the author of the Bonk audio compression program... i've just been looking at your comparrison table, and i noticed bonk gets marginally better compression than Flac on some files (actually i was rather surprised to see bonk on the list at all, it's not exactly high profile :-) ). Bonk in lossless mode is a pretty naive implementation of a predictive coder, so the main
2011 Apr 07
3
Correlation Matrix
Listers, I have a question regarding correlation matrices. It is fairly straight forward to build a correlation matrix of an entire data frame. I simply use the command cor(MyDataFrame). However, what I would like to do is construct a smaller correlation matrix using just three of the variable out of my data set. When I run this: cor(MyDataFrame$variable1,
2009 Dec 08
1
problem with split eating giga-bytes of memory
I'm having trouble using split on a very large data-set with ~1400 levels of the factor to be split. Unfortunately, I can't reproduce it with the simple self-contained example below. As you can see, splitting the artificial dataframe of size ~13MB results in a split dataframe of ~ 144MB, with an increase memory allocation of ~10 fold for the split object. If split scales linearly, then my
2007 Oct 24
3
Partial aggregate on sorted data
Hi All, I'm looking for ways to compute aggregate statistics (with the aggregate function) but with an option for sorting and selecting a subset of the data frame. For example, I have would like to turn this : aggregate(myDataframe$TargetValue,list(SomeFactor = myDataframe$SomeFactor),mean) into something like aggregate(myDataframe$TargetValue,list(SomeFactor =
2010 Mar 11
2
Can't convert list to matrix properly
Hi guys, here is a list of names that I have: MyList: > myList<-list("A", "B","C","D") > myList [[1]] [1] "A" [[2]] [1] "B" [[3]] [1] "C" [[4]] [1] "D" I want to turn this list into a matrix of 1 row and 4 columns with those four components (A, B, C, D) so here is what I do: myDataFrame <-
2010 Apr 26
1
help with code
I am new to R and have tried for a good while to figure out how to code this in R. The dataset below: FTIStandKey State County FTITract CoverType Ver_CT V_Origin V_SpGrp NAH6005-001 Texas Jasper NAH6005 PPLB-2000-U PPLB-2000-U P P NAH6005-002 Texas Jasper NAH6005 NHHX-1950-O NHHX-1950-I N H NAH6253-001 Texas Tyler NAH6253 PPLB-2001-U PPLB-2001-U P P
2012 Jul 19
2
problem with using apply for dataframe
Dear people, I am including an example of a dataframe: mydataframe<-data.frame(X=c(1:4),total_bill=c(16.99,10.34,21.01,23.68),tip=c(1.01,1.66,3.50,3.31),sex=c("Male","Male","Male","Female")) When I use the sapply function getting the information about the factors works: sapply(mydataframe,function(x)is.factor(x)) X total_bill tip