Displaying 13 results from an estimated 13 matches for "aajit75".
2011 Oct 31
3
How to get Quartiles when data contains both numeric variables and factors
When data contains both factor and numeric variables, how to get quartiles
for all numeric variables?
n <- 100
x1 <- runif(n)
x2 <- runif(n)
x3 <- x1 + x2 + runif(n)/10
x4 <- x1 + x2 + x3 + runif(n)/10
x5 <- factor(sample(c('a','b','c'),n,replace=TRUE))
x6 <- factor(1*(x5=='a' | x5=='c'))
data1 <- cbind(x1,x2,x3,x4,x5,x6)
data
2011 Nov 15
2
Putting directory path as a parameter
Hi List,
I am new to R, this may be simple.
I want to store directory path as parameter which in turn to be used while
reading and writing data from csv files.
How I can use dir defined in the below mentioned example while reading the
csv file.
Example:
dir <- "C:/Users/Desktop" #location of file
temp_data <- read.csv("dir/bs_dev_segment_file.csv")
If I run this
2012 Oct 30
2
Java Exception error while reading large data in R from DB using RJDBC.
Dear List,
Java Exception error while reading large data in R from DB using RJDBC.
I am trying to read large data from DB table(Vectorwise), using RJDBC
connection.
I have tested the connection with small size data and was able to fetch DB
tables using same connection(conn as in my code).
Please suggest where am i going wrong or alternate option to solve such
issues while reading large DB
2011 Oct 20
2
How to remove multiple outliers
Hi All,
I am working on the dataset in which some of the variables have more than
one observations with outliers .
I am using below mentioned sample script
library(outliers)
x1 <- c(10, 10, 11, 12, 13, 14, 14, 10, 11, 13, 12, 13, 10, 19, 18, 17,
10099, 10099, 10098)
outlier_tf1 = outlier(x1,logical=TRUE)
find_outlier1 = which(outlier_tf1==TRUE, arr.ind=TRUE)
beh_input_ro1 =
2011 Nov 04
1
Decision tree model using rpart ( classification
Hi Experts,
I am new to R, using decision tree model for getting segmentation rules.
A) Using behavioural data (attributes defining customer behaviour, ( example
balances, number of accounts etc.)
1. Clustering: Cluster behavioural data to suitable number of clusters
2. Decision Tree: Using rpart classification tree for generating rules for
segmentation using cluster number(cluster id) as target
2011 Oct 19
1
Subsetting data by eliminating redundant variables
Dear All,
I am new to R, I have one question which might be easy.
I have a large data with more than 250 variable, i am reducing number of
variables by redun function as in the example below,
n <- 100
x1 <- runif(n)
x2 <- runif(n)
x3 <- x1 + x2 + runif(n)/10
x4 <- x1 + x2 + x3 + runif(n)/10
x5 <- factor(sample(c('a','b','c'),n,replace=TRUE))
x6 <-
2012 Feb 15
1
Passing date as parameter while retrieving data from database using dbGetQuery
Hi All,
This might be simple question, I need to retrive data for modelling from the
databases. Eveytime date values changes so I countnot fix date value in the
code, it is required to pass as parameter.
When I pass the date as parameter, it throws error.
(ERROR: column "start_dt" does not exist Position: 285)
My script is as below, please guide me where am I going wrong?
All parameters
2011 Oct 22
1
Data frame manipulation by eliminating rows containing extreme values
Dear All,
I have got the limits for removing extreme values for each variables using
following function .
f=function(x){quantile(x, c(0.25, 0.75),na.rm = TRUE) - matrix(IQR(x,na.rm =
TRUE) * c(1.5), nrow = 1) %*% c(-1, 1)}
#Example:
n <- 100
x1 <- runif(n)
x2 <- runif(n)
x3 <- x1 + x2 + runif(n)/10
x4 <- x1 + x2 + x3 + runif(n)/10
x5 <-
2012 Aug 10
1
Solving binary integer optimization problem
Hi,
I am new to R for solving optimization problems, I have set of communication
channels with limited capacity with two types of costs, fixed and variable
cost. Each channel has expected gain for a single communication.
I want to determine optimal number of communications for each channel
maximizing ROI)return on investment) with overall budget as constraint.60000
is the budget allocated.
2012 May 24
1
Issues while using “lift.chart” and “adjProbScore” function from ”BCA” library
Dear List,
Couple of issues while using functions from ?BCA? library:
1. I am trying to use ?lift.chart? function from ?BCA? library, but facing
issues while using model where model formula is passed as formula object in
glm.
When model formula is written as text, then it works fine. In my case input
variables and target variables are going to change dynamically, so have to
used formula as
2011 Dec 19
1
Calculating the probability of an event at time "t" from a Cox model fit
Dear R-users,
I would like to determine the probability of event at specific time using
cox model fit. On the development sample data I am able to get the
probability of a event at time point(t).
I need probability score of a event at specific time, using scoring scoring
dataset which will have only covariates and not the response variables.
Here is the sample code:
n = 1000
beta1 = 2; beta2 =
2011 Nov 29
0
Any function\method to use automatically Final Model after bootstrapping using boot.stepAIC()
Hi List,
Being new to R, I am trying to apply boot.stepAIC() for Model selection by
bootstrapping the stepAIC() procedure. I had gone through the discussion in
various thread on the variable selection methods. Understood the pros and
cons of various method, also going through the regression modelling
strategies in rms.
I want to read Final model or Formula or list of variables automatically
2012 Feb 09
0
Java heap space Error while reading table from postgres database using RJDBC
Hi List,
I am reading table from postgres database into R session using RJDBC, table
contains 150 columns and 200000 rows.
Sample code is as below, which works fine with smaller tables.
########################################################################
db_driver <- mydir$db_driver
db_jar_file <- mydir$db_jar_file
db_server <- mydir$db_server
db_server_lgn <- mydir$db_server_lgn