thr3ads.net - similar to: "Removing values from a string"

Displaying 20 results from an estimated 4000 matches similar to: "Removing values from a string"

Using the effects package to plot logit probabilities

2012 Aug 13

Using the effects package to plot logit probabilities

I'm trying to run a logit model and plot the probability curve for a number of the important predictors. I'm trying to do this with the Effects package. df=data.frame(income=c(5,5,3,3,6,5), won=c(0,0,1,1,1,0), age=c(18,18,23,50,19,39), home=c(0,0,1,0,0,1)) str(df) md1 = glm(factor(won) ~ income + age + home, data=df,

Error message with glm

2011 Dec 22

Error message with glm

I'm working on a logistic regression in R with the car package but keep getting the following error message. It's only and warning and not an error, but I'm just not sure how to resolve the issues. glm.fit: algorithm did not converge glm.fit: fitted probabilities numerically 0 or 1 occurred d1 = data.frame(mwin=c(mwin), mbid=c(mbid)) m1 = zelig(mwin ~ mbid, data=d1,

Plotting the probability curve from a logit model with 10 predictors

2012 Jul 05

Plotting the probability curve from a logit model with 10 predictors

I have a logit model with about 10 predictors and I am trying to plot the probability curve for the model. Y=1 = 1 / 1+e^-z where z=B0 + B1X1 + ... + BnXi If the model had only one predictor, I know to do something like below. mod1 = glm(factor(won) ~ as.numeric(bid), data=mydat, family=binomial(link="logit")) all.x <- expand.grid(won=unique(won), bid=unique(bid)) y.hat.new

Naive Bayes in R

2012 Aug 02

Naive Bayes in R

I'm developing a naive bayes in R. I have the following data and am trying to predict on returned (class). dat = data.frame(home=c(0,1,1,0,0), gender=c("M","M","F","M","F"), returned=c(0,0,1,1,0)) str(dat) dat$home <- as.factor(dat$home) dat$returned <- as.factor(dat$returned) library(e1071) m <- naiveBayes(returned ~ ., dat) m

Error When Installing the RODBC Package

2011 Nov 17

Error When Installing the RODBC Package

I'm running R in Ubuntu 10.10 and am trying to install the RODBC package. However, I get the following error message: ERROR: configuration failed for package ‘RODBC’ * removing ‘/home/amathew/R/i686-pc-linux-gnu-library/2.13/RODBC’ The downloaded packages are in ‘/tmp/RtmpekzPOQ/downloaded_packages’ Warning message: In install.packages() : installation of package 'RODBC' had

Predicting a linear model for all combinations

2011 Dec 21

Predicting a linear model for all combinations

Lets say I have a linear model and I want to find the average expented value of the dependent variable. So let's assume that I'm studying the price I pay for coffee. Price = B0 + B1(weather) + B2(gender) + ... What I'm trying to find is the predicted price for every possible combination of values in the independent variables. So Expected price when: weather=1, gender=male weather=1,

Grouping together a time variable

2012 Feb 09

Grouping together a time variable

I have the following variable, time, which is a character variable and it's structured as follows. > head(as.character(dat$time), 30) [1] "00:00:01" "00:00:16" "00:00:24" "00:00:25" "00:00:25" "00:00:40" "00:01:50" "00:01:54" "00:02:33" "00:02:43" "00:03:22" [12]

Calculating percentages across multiple columns

2012 Aug 08

Calculating percentages across multiple columns

I have the following data and am trying to find the percentage of bid values purchased for that price. So let's say I have a bid of 5 and it's sold 2 times for $3 and $5. Since the original bid was $5, the percentage of times that that bid value results in a sold purchase AT that specific bid level was 1/3 because of the three time where the bid was three, it ended up being sold for $5

Reordering a numeric variable

2011 Dec 15

Reordering a numeric variable

I'm running a linear model in R using the car package. I have a variable education, which i have recoded and regrouped to my wishes. However, R seems to place each element of that variable in alphabetical order. When I am running the model, don't I need the model order from lowest to highest to make an inference that a one unit change in one variable produced a one unit change in

Re-grouping data in R

2012 Aug 07

Re-grouping data in R

I have a data frame with a column of values that I want to bucket (group) into specific levels. > str(dat)'data.frame': 3678 obs. of 39 variables: $ id : int 23 76 129 156 166 180 200 214 296 344 ... $ final_purchase_amount : Factor w/ 32 levels "\\N","1082","1109",..: 1 1 1 1 1 1 1 1 1 1 ... So I ran the following to

Plotting every probability curve

2012 Sep 11

Plotting every probability curve

I don't have a logistic regression model and am trying to generate probability curves for all possible combinations of the variables. My logit model has 5+ variables, and I want to draw curves for every scenario. See code below. When home_owner is 0 and 1, I want curves. The same goes for all other variables categories, so that I have permutations for all possible combinations. I've

Using a function inside a function

2011 Jun 09

Using a function inside a function

I'm trying to run a function inside a function but get an error message. lst <- list(roots = c("car insurance", "auto insurance"), roots2 = c("insurance"), prefix = c("cheap", "budget"), prefix2 = c("low cost"), suffix = c("quote", "quotes"), suffix2 = c("rate", "rates"), suffix3 =

xml2-config issues

2011 Jul 23

xml2-config issues

I'm trying to install the XML package on Ubuntu 10.10, and I keep getting a warning message the XML could not be found and had non-zero exit status. How can I fix this problem? > install.packages() Loading Tcl/Tk interface ... done --- Please select a CRAN mirror for use in this session --- Installing package(s) into ‘/home/amathew/R/i686-pc-linux-gnu-library/2.13’ (as ‘lib’ is

Specifying a response variable in a Bayesian network

2012 Sep 26

Specifying a response variable in a Bayesian network

I'm trying to teach myself about Bayesian Networks and am working with the following data and the bnlearn package. I understand the conceptual aspects of BNs, but I'm not sure how to specify the response variables in R when constructing a dag plot. I've cecked ?hc and done numerous google searches without luck. Can anyone help? library("bnlearn")

Partial Matching

2011 Jun 04

Partial Matching

Let's say that I have a string and I want to know if a single word is present in the string. I've written the following function to see if the word "Geico" is mentioned in the string "Cheap Geico car insurance". However, it doesn't work, and I assume it has something to do with the any() function. Do I need to use regular expressions? (I hope not) main <-

Zellig Error Message

2011 Dec 16

Zellig Error Message

I'm trying to calculate predicted probabilities in R with Zelig and keep getting the following error. Can anyone help? > x.low <- setx(mod, type=1)Error in dta[complete.cases(mf), names(dta) %in% vars, drop = FALSE] : incorrect number of dimensions When I ran the model, I ran everything but the explanatory variable as a numeric variable. Now, I'm trying everything and no

read.xls question

2009 Mar 15

read.xls question

I'm an R newbie and had a question about the read.xls function. I've heard that this is often not a reliable function to use for importing data. However, I have created numerous xls files which contain information about voter turnout and macroeconomic indicators in India. I'm writing a paper on the relationship between economic growth and voter turnout. This is the command I use:

Merge two columns of a data frame

2011 Jun 06

Merge two columns of a data frame

I have the following data: prefix <- c("cheap", "budget") roots <- c("car insurance", "auto insurance") suffix <- c("quote", "quotes") prefix2 <- c("cheap", "budget") roots2 <- c("car insurance", "auto insurance") roots3 <- c("car insurance", "auto

Error: missing values where TRUE/FALSE needed

2011 Jun 09

Error: missing values where TRUE/FALSE needed

I'm writing a function and keep getting the following error message. myfunc <- function(lst) { lst <- list(roots = c("car insurance", "auto insurance"), roots2 = c("insurance"), prefix = c("cheap", "budget"), prefix2 = c("low cost"), suffix = c("quote", "quotes"), suffix2 = c("rate",

Subsetting Data

2011 Apr 28

Subsetting Data

I'm using the subset() function in R. dat <- data.frame(one=c(6,7,8,9,10), Number=c(5,15,13,1,13)) subset(dat, Number >= 10) However, I want to find the number of all rows who meet the Number>=10 condition. I've done this in the past with something like colSums or rowSums or another similar function. But I don't remember how to get the number of elements which meet that

similar to: Removing values from a string