Displaying 20 results from an estimated 700 matches similar to: "Approximating discrete distribution by continuous distribution"
2011 Jul 15
2
Convert continuous variable into discrete variable
Dear all,
I have a continuous variable that can take on values between 0 and 100, for
example: x<-runif(100,0,100)
I also have a second variable that defines a series of thresholds, for
example: y<-c(3, 4.5, 6, 8)
I would like to convert my continuous variable into a discrete one using the
threshold variables:
If x is between 0 and 3 the discrete variable should be 1
If x is between 3
2009 Mar 16
2
FW: Select a random subset of rows out of matrix
Dear all,
I have a large dataset (N=100,000 with 89 variables per subject). This dataset is stored in a 100.000 x 89 matrix where each row describes one individual and each column one variable.
What is the easiest way of selecting a subset of let's say 1.000 individuals out of that whole matrix?
Thanks,
Michael
Michael Haenlein
Associate Professor of Marketing
ESCP-EAP European School of
2011 Apr 12
2
Testing equality of coefficients in coxph model
Dear all,
I'm running a coxph model of the form:
coxph(Surv(Start, End, Death.ID) ~ x1 + x2 + a1 + a2 + a3)
Within this model, I would like to compare the influence of x1 and x2 on the
hazard rate.
Specifically I am interested in testing whether the estimated coefficient
for x1 is equal (or not) to the estimated coefficient for x2.
I was thinking of using a Chow-test for this but the Chow
2011 May 13
6
Powerful PC to run R
Dear all,
I'm currently running R on my laptop -- a Lenovo Thinkpad X201 (Intel Core
i7 CPU, M620, 2.67 Ghz, 8 GB RAM). The problem is that some of my
calculations run for several days sometimes even weeks (mainly simulations
over a large parameter space). Depending on the external conditions, my
laptop sometimes shuts down due to overheating.
I'm now thinking about buying a more
2010 Nov 11
2
predict.coxph and predict.survreg
Dear all,
I'm struggling with predicting "expected time until death" for a coxph and
survreg model.
I have two datasets. Dataset 1 includes a certain number of people for which
I know a vector of covariates (age, gender, etc.) and their event times
(i.e., I know whether they have died and when if death occurred prior to the
end of the observation period). Dataset 2 includes another
2016 Apr 16
1
Social Network Simulation
Dear all,
I am trying to simulate a series of networks that have characteristics
similar to real life social networks. Specifically I am interested in
networks that have (a) a reasonable degree of clustering (as measured by
the transitivity function in igraph) and (b) a reasonable degree of degree
polarization (as measured by the average degree of the top 10% nodes with
highest degree divided by
2010 Aug 03
2
Collinearity in Moderated Multiple Regression
Dear all,
I have one dependent variable y and two independent variables x1 and x2
which I would like to use to explain y. x1 and x2 are design factors in an
experiment and are not correlated with each other. For example assume that:
x1 <- rbind(1,1,1,2,2,2,3,3,3)
x2 <- rbind(1,2,3,1,2,3,1,2,3)
cor(x1,x2)
The problem is that I do not only want to analyze the effect of x1 and x2 on
y but
2008 Apr 18
1
spdep question - Moran's I
Dear all,
I would like to calculate a Moran's I statistic using the moran function in the spdep package. The problem I'm having deals with how to create the listw object.
My data stems from the area of social network analysis. I have list of poeple and for each pair of them I have a measure of their relationship strength. So my dataset looks like: Jim; Bob; 0.5
This measure of
2011 Mar 26
1
Effect size in multiple regression
Dear all,
is there a convenient way to determine the effect size for a regression
coefficient in a multiple regression model?
I have a model of the form lm(y ~ A*B*C*D) and would like to determine
Cohen's f2 (http://en.wikipedia.org/wiki/Effect_size) for each predictor
without having to do it manually.
Thanks,
Michael
Michael Haenlein
Associate Professor of Marketing
ESCP Europe
Paris,
2010 Apr 22
1
Convert character string to top levels + NAN
Dear all,
I have several character strings with a high number of different levels.
unique(x) gives me values in the range of 100-200.
This creates problems as I would like to use them as predictors in a coxph
model.
I therefore would like to convert each of these strings to a new string
(x_new).
x_new should be equal to x for the top n categories (i.e. the top n levels
with the highest
2011 Feb 22
1
System of related regression equations
Dear all,
I would like to estimate a system of regression equations of the following
form:
y1 = a1 + b1 x1 + b2x2 + e1
y2 = a2 + c1 y1 + c2 x2 + c3 x3 + e2
Specifically the dependent variable in Equation 1 appears as an independent
variable in Equation 2. Additionally some independent variables that appear
in Equation 1 are also included in Equation 2.
I assume that I cannot estimate these two
2011 May 27
1
Help to improve existing R-Code
Dear all,
I have written a relatively brief R-Code to run a series of simulations.
Currently the code runs for a very long time (up to several days, depending
on the conditions) and I expect this to be the case because it might not be
very efficiently written. I am, for example, relying on several for(...)
loops which could probably be done much faster using a different way of
programming.
I am
2010 Jul 28
1
Time-dependent covariates in survreg function
Dear all,
I'm asking this question again as I didn't get a reply last time:
I'm doing a survival analysis with time-dependent covariates. Until now,
I have used a simple Cox model for this, specifically the coxph function
from the survival library. Now, I would like to try out an accelerated
failure time model with a parametric specification as implemented for
example in the survreg
2011 May 11
1
Total effect of X on Y under presence of interaction effects
Dear all,
this is probably more a statistics question than an R question but probably
there is somebody who can help me nevertheless.
I'm running a regression with four predictors (a, b, c, d) and all their
interaction effects using lm. Based on theory I assume that a influences y
positively. In my output (see below) I see, however, a negative regression
coefficient for a. But several of the
2011 Sep 19
1
Binary optimization problem in R
Dear all,
I would like to solve a problem similar to a multiple knapsack problem and
am looking for a function in R that can help me.
Specifically, my situation is as follows: I have a list of n items which I
would like to allocate to m groups with fixed size. Each item has a certain
profit value and this profit depends on the type of group the item is in. My
problem is to allocate the items
2011 Sep 27
2
Pearson chi-square test
Dear all,
I have some trouble understanding the chisq.test function.
Take the following example:
set.seed(1)
A <- cut(runif(100),c(0.0, 0.35, 0.50, 0.65, 1.00), labels=FALSE)
B <- cut(runif(100),c(0.0, 0.25, 0.40, 0.75, 1.00), labels=FALSE)
C <- cut(runif(100),c(0.0, 0.25, 0.50, 0.80, 1.00), labels=FALSE)
x <- table(A,B)
y <- table(A,C)
When I calculate the test statistic by hand
2011 Sep 21
2
Cannot allocate vector of size x
Dear all,
I am running a simulation in which I randomly generate a series of vectors
to test whether they fulfill a certain condition. In most cases, there is no
problem. But from time to time, the (randomly) generated vectors are too
large for my system and I get the error message: "Cannot allocate vector of
size x".
The problem is that in those cases my simulation stops and I have to
2012 Apr 12
2
Curve fitting, probably splines
Dear all,
This is probably more related to statistics than to [R] but I hope someone
can give me an idea how to solve it nevertheless:
Assume I have a variable y that is a function of x: y=f(x). I know the
average value of y for different intervals of x. For example, I know that
in the interval[0;x1] the average y is y1, in the interval [x1;x2] the
average y is y2 and so forth.
I would like to
2010 Jul 14
1
Printing status updates in while-loop
Dear all,
I'm using a while loop in the context of an iterative optimization
procedure. Within my while loop I have a counter variable that helps me to
determine how long the loop has been running. Before the loop I initialize
it as counter <- 0 and the last condition within my loop is counter <-
counter + 1.
I'd like to print out the current status of "counter" while the
2017 Dec 14
1
Aggregation across two variables in data.table
Dear all,
I have a data.frame that includes a series of demographic variables for a
set of respondents plus a dependent variable (Theta). For example:
Age Education Marital Familysize
Income Housing Theta
1: 50 Associate degree Divorced 4
70K+ Owned with mortgage 9.147777
2: 65