thr3ads.net - similar to: "Help with creating conditional categorical variables in R"

Displaying 20 results from an estimated 1000 matches similar to: "Help with creating conditional categorical variables in R"

Splitting data into test and train (80:20) kepping attributes similar

2012 Apr 24

Splitting data into test and train (80:20) kepping attributes similar

Hi, I am trying to do some predictive modeling around attrition and want to split the dataset into test and train (80:20) and keep the ratio of attritees:non attrites same. In my dataset the attrition indicator is coded as 0(for non-attritees) and 1 (for attritees) and I want to keep the ratio of 0's to 1 similar. I apologize for this trivial question but this is my second week with R.

Validation of logistic models in R 2.12

2012 May 03

Validation of logistic models in R 2.12

Hi everyone, I am trying to validate a logistic model built in R. Not my version of R is 2.12 and I cannot install ROCR. I have gone to a point where I have the predicted values using the code: pred1 = predict(trainlogit1,testdata_1, type = "response") How do I proceed from here? Is there another way in which I can plot lift charts? My model output is: Call: glm(formula =

How to create lift chart and ROC curve in R

2012 Jun 12

How to create lift chart and ROC curve in R

Hi, I have a logit model built in R which I am trying to validate . Could you please help me with creating lift charts and area under ROC for the same. I am trying to use package ROCR The indicative code for this is : newlogit3<- glm(Attrition_ind ~ Time.in.AXP+ Age +as.factor(Increase.in.Rating.in.Current.year) +as.factor(Below.Market.Merit.Budget)+as.factor(Drop.in.AIA)

Increase the word list in the sentiment analysis

2012 Jun 14

Increase the word list in the sentiment analysis

Hi, I'am using the sentient analysis available in "R2.15". Now the challenge that I'am facing is that the Naïve Baye's has an inbuilt list of 6500 words in which it has been trained. So my question that can I increase the number of words on which this algorithm is trained? In case i can increase the list can someone tell me the procedure? Regards Nalin American Express

Rule Based Text Classification

2012 Jun 18

Rule Based Text Classification

Hi, Is there any package in R which can help me do rule based text mining. -Nalin American Express made the following annotations on Mon Jun 18 2012 00:27:17 ****************************************************************************** "This message and any attachments are solely for the intended recipient and may contain confidential or privileged information. If you are not the

FW: Validation of logistic models in R 2.12

2012 May 04

FW: Validation of logistic models in R 2.12

Could someone help me on this please? -----Original Message----- From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf Of Dwaipayan Dasgupta Sent: Thursday, May 03, 2012 1:43 PM To: r-help at r-project.org Subject: [R] Validation of logistic models in R 2.12 Hi everyone, I am trying to validate a logistic model built in R. Not my version of R is 2.12 and I

comparing two strings from data

2017 Oct 13

comparing two strings from data

Combining and completing the advice from Greg and Boris the complete solution is two lines: data_2 <- read.csv("excel_data.csv", stringsAsFactors = FALSE) match_list <- match( data_2$data1, data_2$data2 ) The vector match_list will have the matching position when it exists and NA's otherwise. Its length will be the same as the length of data_2$data1. You should get

How to apply a function to all combinations of the values of 2 or more variables?

2009 Apr 15

How to apply a function to all combinations of the values of 2 or more variables?

Hi, All Forgive me if this is a stupid newbie question. I'm having no luck googling an answer to this, probably because I don't know the right R terminology to frame my question. I want to know how to run an R function on each combination of the values of 2 or more variables. In SAS-speak this is multiple "BY variables" or "CLASS variables". In R I've figured

comparing two strings from data

2017 Oct 12

comparing two strings from data

It's generally a very good idea to examine the structure of data after you have read it in. str(data2) would have shown you that read.csv() turned your strings into factors, and that's why the == operator no longer does what you think it does. use ... data_2 <- read.csv("excel_data.csv", stringsAsFactors = FALSE) ... to turn this off. Also, the %in% operator will achieve

tftp-hpa features?

2016 Jan 11

tftp-hpa features?

On Fri, Jan 8, 2016 at 5:46 PM, Geert Stappers via Syslinux <syslinux at zytor.com> wrote: > On Fri, Jan 08, 2016 at 03:40:37PM -0600, Matt Zagrabelny via Syslinux wrote: >> Greetings, >> >> Is this list (a|the best) place to make feature requests for tftp-hpa? > > Yes, it is. > from email header List-Id: For discussion of Syslinux and tftp-hpa

Merge by Range in R

2017 Sep 04

Merge by Range in R

Hi,? I have two big data set.? data _1 :? > dim(data_1) [1] 15820 5 > head(data_1) ? ?Chromosome ?????Start????????End????????Feature GroupA_3 1: ? ? ? ????????chr1 521369 ?750000 ????chr1-0001 ? ?????0.170 2: ? ? ? ????????chr1 750001 ?800000 ????chr1-0002 ? ????-0.086 3: ? ? ? ????????chr1 800001 ?850000 ????chr1-0003 ? ?????0.006 4: ? ? ? ????????chr1 850001 ?900000 ????chr1-0004 ?

help awk y shells en R

2015 Jun 08

help awk y shells en R

Hola buenas, a veces empleo desde R shells de unix, Existe alguna manera de utilizar estos shelss desde windows o el lenguaje awk. La idea es hacerlo siempre desde R, igual invoncando cygwin desde windows es posible. Pero no me queda claro Un abrazo y gracias por adelntado Javier #_____________________________________________________________________ # EJEMPLO, ¿Que habría que poner en #

comparing two strings from data

2017 Oct 12

comparing two strings from data

Hi, I have two columns that contain numbers along with letters (as shown below) and have different lengths. Each entry in the first column is likely to be found in the second column at most once. For each entry of the first column, if that entry is found in the second column, I would like to get the corresponding index. For instance, if the first entry of the first column is 5th entry in the

How to make a time series object from a data frame containing dates.

2009 Apr 22

How to make a time series object from a data frame containing dates.

Hi, All I have a data frame like this. > names(x) [1] "month" "alloc_gb" One of the columns contains the dates I want to use: > data.class(x$month) [1] "yearmon" I would like to make a time series object out of this. However the ts function seems to require me to spell out the starting date. Is there a way to tell ts to use the date column I already

$ in domain name, Samba 2.2.8a

2003 Oct 30

$ in domain name, Samba 2.2.8a

> -----Original Message----- > From: Gerald (Jerry) Carter [mailto:jerry@samba.org] > Sent: Wednesday, October 01, 2003 11:49 AM > To: Stuckless, Colin 709 778-3815 > Cc: Andrew Bartlett; 'samba@lists.samba.org' > Subject: Re: [Samba] $ in domain name, Samba 2.2.8a > > Andrew Bartlett wrote: > > | Yes, we are stripping it out for security reasons. The >

ocfs2 and Suse 9.2

2005 Oct 13

ocfs2 and Suse 9.2

I'm having some problems with ocfs2 and Suse 9.2. The documentation for ocfs2 says it comes with Suse 9.2. I've installed Suse and used YAST to get to SP2. # SPident -vv Summary (using 612 packages) Product/ServicePack conflict match update (shipped) SLES-9-x86_64 0 0% 293 47.9% 44 (1597 18.3%) SLES-9-x86_64-SP1 0 0% 110

df returns weird values

2008 Dec 10

df returns weird values

Hi, I'm starting to play with glusterfs, and I'm having a problem with the df output. The value seems to be wrong. (on the client) /var/mule-client$ du -sh 584K . /var/mule-client$ df -h /var/mule-client/ Filesystem Size Used Avail Use% Mounted on glusterfs 254G 209G 32G 88% /var/mule-client (on the server) /var/mule$ du -sh 584K . Is it a known

help awk y shells en R

2015 Jun 08

help awk y shells en R

Al final resulto más fácil de lo esperado. Hay que instalar cywin y utilizar los comandos de la siguiente manera system('C:/cygwin/bin/wc -l var_risco_2012.csv') Esto en principio funciona El 8 de junio de 2015, 17:41, Carlos Ortega <cof en qualityexcellence.es> escribió: > Hola, > > Mira esto: > >

extract data

2012 Apr 04

extract data

HI, I would like to extract data in a specific way. For example, the rainfall data 0,0,1.5,0,0, 3,1,2.5,0,0,0,0, 2.3,0,0,0, 2.1,1.4,0,0,0, 3,2,1,0,0,0... data_1: 1.5, 2.3 ( a single nonzero data between zeros data) data_2: 3.1, 2.5, 2.1,1.4 ( two nonzero data between zeros data) data_3: 3,1,2.5, 3,2,1 ( three nonzero data between zeros data) Thank you so much for any

NT domain and valid user problem

2004 Aug 11

NT domain and valid user problem

Hello everybody, I met a problem when I configure the samba for our company. The case is, 1. We have to use Windows Domain to authenticate the user. 2. Only a few domain users can access the samba server. 2. We wanna all the allowed domain users are mapped to 1 unix userid: checkpoint. The Window Domain auth is working perfect. And if we created the unix id same as the domain users,

similar to: Help with creating conditional categorical variables in R