similar to: Summarization

Displaying 20 results from an estimated 4000 matches similar to: "Summarization"

2011 May 31
2
Text Summarization
Is there a text mining/ NLP package in R that could do text summarization? For example, take a huge text as input and provide a summary of the text. In package tm, summarization is defined more as high frequency terms which is not what I want. I actually want a summary of what is present in the huge volume of text. Any help on a R package would be helpful. Thank you. Ravi -- View this message
2013 Feb 14
3
list of matrices --> array
i'm somehow embarrassed to even ask this, but is there any built-in method for doing this: my_list <- list() my_list[[1]] <- matrix(1:20, ncol = 5) my_list[[2]] <- matrix(20:1, ncol = 5) now, knowing that these matrices are identical in dimension, i'd like to unfold the list to a 2x4x5 (or some other permutation of the dim sizes) array. i know i can initialize the array, then
2008 Jul 11
1
data summarization etc...
Hello, I am trying to do some fairly straightforward data summarization, i.e., the kind you would do with a pivot table in excel or by using SQL queires. I have a moderately sized data set of ~70,000 records and I am trying to compute some group averages and sum values within groups. the code example below shows how I am trying to go about doing this pti <-rnorm(70000,10) fid <-
2012 Mar 01
2
Problems downloading file
I am running the following line to download data from the US Energy Information Administration. This function has worked successfully for me in the past but yesterday gave the error/warning messages below. If I simply type "http://ir.eia.gov/wpsr/psw09.xls" (no quotes) into a browser, the file is available to download. I am running R 2.13.0 on Windows XP. # Download File Attempt
2011 May 26
2
zoo column names
I have a zoo object that contains 2 time series named "A-B" and "V1". When I create a third series "V2", the name of the "A-B" series is changed to "A.B". Although I could recreate the names for the 3 series I am wondering if there is a way of preventing the name change from happening ( ... maybe an equivalent of the keep.names=TRUE statement
2008 Sep 14
3
Using R from Java
Hello, I am interesting in using R from a web application, for basic statistics and plots. The server is Java-based (tomcat). The simplest solution is a system call that generates the text or the image, then the servlet forwards the output. This can be done from any language, but it is quite inelegant and slow for the initialization time. Is there any package or approach for accessing R from a
2011 Dec 22
2
Renaming Within A Function
I am trying to rename column names in a dataframe within a function. I am seeing an error (listed below) that I don't understand. Would be grateful of an explanation of what I am doing wrong and how I should rewrite the function to allow me to be able to rename my variables. Thanks. # Test Function myfunc <-function(var){ d = c(1,2,3,4,5) dts =
2002 Dec 12
4
sum a list of vectors
In Mathematica there is a neat feature, where you can change the head of a list from "list" to say "+" and obtain a sum of the list elements. I can't find a way to sum a list of vectors of same length or list of matrices of the same dimension and was curious if something like that exists in R. do.call("+",list) doesn't work because "+" accepts only
2012 Dec 07
1
how to add a column from another dataset with "merge"
kiotoqq wrote > I want to add a shorter column to my dataset with the function "merge", > it > should be filled with NAs wo be as long as the other colums, like this: > > id age > 9 46 > 8 56 > 6 52 > 5 NA > 4 NA > 3 NA > 1 NA > > i did this: > pa1 <- merge(pa1, an1, by="mergeid") > > and it says
2003 Aug 20
5
Interlacing two vectors
I want to interlace two vectors. This I can do: > x <- 1:4 > z <- x+0.5 > as.vector(t(cbind(x,z))) [1] 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 but this seems rather inelegant. Any suggestions? Murray -- Dr Murray Jorgensen http://www.stats.waikato.ac.nz/Staff/maj.html Department of Statistics, University of Waikato, Hamilton, New Zealand Email: maj at waikato.ac.nz
2007 Jan 11
1
Matching on multiple columns
Am I correct in believing that one cannot match on multiple columns? One can indeed subset on multiple criteria from different variables (or columns) but not from unique combinations thereof. I need to exclude about 10000 rows from 108000 rows of data based on several unique combinations of identifiers in two columns. Only merge() seems to be able to do that. Merge would allow me to positively
2009 Jun 11
2
Optimization Question
Hi All Apologies if this is not the correct list for this question. The Rglpk package offers the following example in its documentation library(Rglpk) ## Simple mixed integer linear program. ## maximize: 3 x_1 + 1 x_2 + 3 x_3 ## subject to: -1 x_1 + 2 x_2 + x_3 <= 4 ## 4 x_2 - 3 x_3 <= 2 ## x_1 - 3 x_2 + 2 x_3 <= 3 ## x_1, x_3 are non-negative integers ## x_2 is a non-negative real
2003 Apr 02
19
Combining the components of a character vector
Dear Help, Suppose I have a character vector. x <- c("Bob", "loves", "Sally") I want to combine it into a single string: "Bob loves Sally" . paste(x) yields: paste(x) [1] "Bob" "loves" "Sally" The following function combines the character vector into a string in the way that I want, but it seems somewhat inelegant.
2003 Mar 31
4
Convert char vector to numeric table
I'm a great fan of read.table(), but this time the data had a lot of cruft. So I used readLines() and editted the char vector to eventually get something like this: " 23.4 1.5 4.2" " 19.1 2.2 4.1" and so on. To get that into a 3 col numeric table, I first just used: writeLines(data,"tempfile")
2011 Mar 29
3
passing arguments via "..."
I would like to do something like the following: Fancyhist<-function(x,...) { # first, process x into xprocess somehow, then ... if (is.null(breaks)) { # yes, I know this is wrong # define the histogram breaks somehow, then call hist: hist(xprocess,breaks=breaks,...) } else { # use breaks give in calling argument hist(xprocess,...) } } But, those of you who know R better
2011 Jan 15
2
Rounding variables in a data frame
Hi All I am trying to use the round function on some columns of a dataframe while leaving others unchanged. I wish to specify those columns to leave unchanged. My attempt is below - here, I would like the column d3 to be left but columns d1, d2 and d4 to be rounded to 0 decimal places. I would welcome any suggestions for a nicer way of doing this. d1= rnorm(10,10) d2= rnorm(10,6) d3=
1999 Aug 02
2
zero replacement
AARRGGHH! Sometimes it's the simple things that are particularly frustrating, especially late at night.... Can anyone suggest a simple means for replacing all of the zero values in a matrix with NANs? I ended up writing an awk script to massage the input file, which works, of course, but is rather an inelegant blunt instrument. I'd prefer an R operation. I'm certain that
2008 Apr 14
2
Plotting with exact axis limits
Hello, If I make a plot, say something simple like plot( runif(100) ) then the origin (0,0) is not at the bottom-left corner of the box surrounding the plot. The axis limits are "padded" slightly. This is ordinarily a good feature, because it makes plots look better. But now I would like to make a plot with the origin exactly on the bottom left. Through trial and error, I have
2013 Mar 25
2
Ordering a matrix by row value in R2.15
fitz_ra wrote > I know this is posted a lot, I've been through about 40 messages reading > how to do this so let me apologize in advance because I can't get this > operation to work unlike the many examples shown. > > I have a 2 row matrix >> temp > [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] > [,9] [,10] > [1,] 17.000
2005 Apr 02
2
An exercise in the use of 'substitute'
I would like to create a method for the generic function "with" applied to a class of fitted models. The method should do two things: 1. Substitute the name of the first argument for '.' throughout the expression 2. Evaluate the modified expression using the data argument to the fitted model as the first element of the search list. The second part is relatively easy. The