thr3ads.net - similar to: "number of distinct values in a dataframe"

Displaying 20 results from an estimated 1000 matches similar to: "number of distinct values in a dataframe"

2003 Feb 04

test for two samples

Hi R-users, My question is more methodological one, rather than technical. I have to samples representing residuals based on two measurements techniques (resid1,resid2; n=69). I need to compare two samples, to reject one technique (the worse one), and to keep the one which gave lower residuals (better one). What to look for? What should I analyse? Means, variance, std. deviations? Based on

postscript() problem

2002 Aug 30

postscript() problem

Hi R-users, I am using "postscript("xx.eps", horizontal=FALSE, onefile=FALSE, height=6, width=10, pointsize=10)" and "barplot(xx.dat)" in R to produce EPS graphics for LATeX document. When checking the graphics with "gv" I see that it was not produced with full details. The main graphic is there, but axes labels and titles are missing. While checking the

how to identify the outliers

2002 Nov 26

how to identify the outliers

Hello R-users, Is there any more sophisticated way how to identify the dataset outliers other then seeing them in boxplot? I wanna exclude them from further analysis and I am interested in their position in my vector data. Rado -- Radoslav Bonk M.S. Dept. of Physical Geography and Geoecology Faculty of Sciences, Comenius University Mlynska Dolina 842 15, Bratislava, SLOVAKIA tel: +421 2 602

negative and positive values in diff. colors

2002 Nov 07

negative and positive values in diff. colors

Hi R-users, I have a set of elevation residuals as geodata points. I would like to display them in the following way: -negative values using pch=20 (filled circle) -positive values using pch=1 (empty circle) while using the cex.max=5, cex.min=0.1 for points() to represent the residuals value. Basically I would like to distinguish neagtive and positive values at my map. How to do this. Thanks

documents for writing functions

2003 Sep 04

documents for writing functions

Hi, Does anybody know suitable documents (manuals) on writing user functions (covering loops, conditions ...) in R? Other than the usually available manuals. Thanks, Rado -- Radoslav Bonk M.S. Dept. of Physical Geography and Geoecology Faculty of Sciences, Comenius University Mlynska Dolina 842 15, Bratislava, SLOVAKIA tel: +421 905 968 127 e-mail: rbonk at host.sk

matrix: rows to columns conversion

2002 Jul 08

matrix: rows to columns conversion

Hi, I'm new to "R". I have a matrix of 6x12. I need to produce matrix of 12x6, such that rows in new matrix are columns from original matrix, and columns in a new matrix are rows from the original one. I was playing with rbind(), and cbind() with no success. Regards, -Rado -- Radoslav Bonk M.S. Dept. of Physical Geography and Geoecology Faculty of Natural Sciences, Comenius

R commands formating for LaTeX

2004 Mar 22

R commands formating for LaTeX

Dear R users, I have to include typical UNIX formated R commands and outputs into the article using LaTeX. What is the easiest way to include R prompt commands and text outputs (including messages) into LaTeX documents? Is there any template or library? So far I'm trying to produce R commands in LaTeX using LaTeX general format options. Thanks, Rado

ROracle - fetch gives empty dataframe

2004 Jul 14

ROracle - fetch gives empty dataframe

Dear R-users, I was able to make ROracle package to connect to the DB (Oracle91, 64bit, on Solaris). But after executing siple SQL query, "fetch" commaned gives me an empty dataframe. ### RORACLE INSTALATION PROCEDURE ### R CMD INSTALL --configure-args='--enable-extralibs' --enable-oracle32=no ~/tmp/ROracle_0.5-5.tar.gz #since we have 64bit Oracle9i instalation PROBLEM:

extract columns from a dataframe

2004 Jul 05

extract columns from a dataframe

Dear R users, I'm coming back to R after while. I have a data frame with 200 columns, each column has a name. How to extract all columns to a new dataset, but the specified (by names) ones? I was playing with that for a little bit using the vector syntax but got several syntax errors. Thanks, Rado

outliers/interval data extraction

2003 Feb 20

outliers/interval data extraction

Dear R-users, I have two outliers related questions. I. I have a vector consisting of 69 values. mean = 0.00086 SD = 0.02152 The shape of EDA graphics (boxplots, density plots) is heavily distorted due to outliers. How to define the interval for outliers exception? Is <2SD - mean + 2SD> interval a correct approach? Or should I define 95% (or 99%) limit of agreement for data interval,

R and databases (Oracle)

2004 Jul 05

R and databases (Oracle)

Dear R users, I'm working on implementation of hydrometeorological DB in Oracle9i and would like to use R, as a plotting engine for graphs and maps and also to perform some non trivial statistics on DB data using R. Thus any links with examples or efforts on: - general DB and R cooperation - Oracle91 and R cooperation - Oracle PL/SQL and R are welcomed. Thanks in advance, Rado --

matrix transformation into 3 columns II.

2006 May 18

matrix transformation into 3 columns II.

Dear R-users Sorry for posting the previous message too soon before I have finished it. I have matrix (mprecip) with headers: > dim(mprecip) [1] 6268 170 > mprecip date GilzeRijen Eindhoven Volkel ZuidLimburg Arcen Ubachsberg 1 01/01/1978 NA 0.0 NA 0.1 NA NA 2 01/02/1978 NA 0.0 NA 0.0 NA NA 3

ROracle package error

2004 Jul 12

ROracle package error

Dear R-users, In order to use ROracle (v. 0.5-5) package I compiled R-1.9.1 on Linux (2.4.20-28.9smp) and installed it. I need to use Oracle9i on Solaris. When executing "library(ROracle)" I got the following error message: > library(ROracle) Error in dyn.load(x, as.logical(local), as.logical(now)) : unable to load shared library

Improving on Rice coding

2004 Sep 10

Improving on Rice coding

Hello, I am the author of the Bonk audio compression program... i've just been looking at your comparrison table, and i noticed bonk gets marginally better compression than Flac on some files (actually i was rather surprised to see bonk on the list at all, it's not exactly high profile :-) ). Bonk in lossless mode is a pretty naive implementation of a predictive coder, so the main

Correlation Matrix

2011 Apr 07

Correlation Matrix

Listers, I have a question regarding correlation matrices. It is fairly straight forward to build a correlation matrix of an entire data frame. I simply use the command cor(MyDataFrame). However, what I would like to do is construct a smaller correlation matrix using just three of the variable out of my data set. When I run this: cor(MyDataFrame$variable1,

problem with split eating giga-bytes of memory

2009 Dec 08

problem with split eating giga-bytes of memory

I'm having trouble using split on a very large data-set with ~1400 levels of the factor to be split. Unfortunately, I can't reproduce it with the simple self-contained example below. As you can see, splitting the artificial dataframe of size ~13MB results in a split dataframe of ~ 144MB, with an increase memory allocation of ~10 fold for the split object. If split scales linearly, then my

Partial aggregate on sorted data

2007 Oct 24

Partial aggregate on sorted data

Hi All, I'm looking for ways to compute aggregate statistics (with the aggregate function) but with an option for sorting and selecting a subset of the data frame. For example, I have would like to turn this : aggregate(myDataframe$TargetValue,list(SomeFactor = myDataframe$SomeFactor),mean) into something like aggregate(myDataframe$TargetValue,list(SomeFactor =

Can't convert list to matrix properly

2010 Mar 11

Can't convert list to matrix properly

Hi guys, here is a list of names that I have: MyList: > myList<-list("A", "B","C","D") > myList [[1]] [1] "A" [[2]] [1] "B" [[3]] [1] "C" [[4]] [1] "D" I want to turn this list into a matrix of 1 row and 4 columns with those four components (A, B, C, D) so here is what I do: myDataFrame <-

help with code

2010 Apr 26

help with code

I am new to R and have tried for a good while to figure out how to code this in R. The dataset below: FTIStandKey State County FTITract CoverType Ver_CT V_Origin V_SpGrp NAH6005-001 Texas Jasper NAH6005 PPLB-2000-U PPLB-2000-U P P NAH6005-002 Texas Jasper NAH6005 NHHX-1950-O NHHX-1950-I N H NAH6253-001 Texas Tyler NAH6253 PPLB-2001-U PPLB-2001-U P P

problem with using apply for dataframe

2012 Jul 19

problem with using apply for dataframe

Dear people, I am including an example of a dataframe: mydataframe<-data.frame(X=c(1:4),total_bill=c(16.99,10.34,21.01,23.68),tip=c(1.01,1.66,3.50,3.31),sex=c("Male","Male","Male","Female")) When I use the sapply function getting the information about the factors works: sapply(mydataframe,function(x)is.factor(x)) X total_bill tip

similar to: number of distinct values in a dataframe