similar to: Help with creating conditional categorical variables in R

Displaying 20 results from an estimated 1000 matches similar to: "Help with creating conditional categorical variables in R"

2012 Apr 24
1
Splitting data into test and train (80:20) kepping attributes similar
Hi, I am trying to do some predictive modeling around attrition and want to split the dataset into test and train (80:20) and keep the ratio of attritees:non attrites same. In my dataset the attrition indicator is coded as 0(for non-attritees) and 1 (for attritees) and I want to keep the ratio of 0's to 1 similar. I apologize for this trivial question but this is my second week with R.
2012 May 03
0
Validation of logistic models in R 2.12
Hi everyone, I am trying to validate a logistic model built in R. Not my version of R is 2.12 and I cannot install ROCR. I have gone to a point where I have the predicted values using the code: pred1 = predict(trainlogit1,testdata_1, type = "response") How do I proceed from here? Is there another way in which I can plot lift charts? My model output is: Call: glm(formula =
2012 Jun 12
0
How to create lift chart and ROC curve in R
Hi, I have a logit model built in R which I am trying to validate . Could you please help me with creating lift charts and area under ROC for the same. I am trying to use package ROCR The indicative code for this is : newlogit3<- glm(Attrition_ind ~ Time.in.AXP+ Age +as.factor(Increase.in.Rating.in.Current.year) +as.factor(Below.Market.Merit.Budget)+as.factor(Drop.in.AIA)
2012 Jun 14
0
Increase the word list in the sentiment analysis
Hi, I'am using the sentient analysis available in "R2.15". Now the challenge that I'am facing is that the Naïve Baye's has an inbuilt list of 6500 words in which it has been trained. So my question that can I increase the number of words on which this algorithm is trained? In case i can increase the list can someone tell me the procedure? Regards Nalin American Express
2012 Jun 18
0
Rule Based Text Classification
Hi, Is there any package in R which can help me do rule based text mining. -Nalin American Express made the following annotations on Mon Jun 18 2012 00:27:17 ****************************************************************************** "This message and any attachments are solely for the intended recipient and may contain confidential or privileged information. If you are not the
2012 May 04
0
FW: Validation of logistic models in R 2.12
Could someone help me on this please? -----Original Message----- From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf Of Dwaipayan Dasgupta Sent: Thursday, May 03, 2012 1:43 PM To: r-help at r-project.org Subject: [R] Validation of logistic models in R 2.12 Hi everyone, I am trying to validate a logistic model built in R. Not my version of R is 2.12 and I
2017 Oct 13
1
comparing two strings from data
Combining and completing the advice from Greg and Boris the complete solution is two lines: data_2 <- read.csv("excel_data.csv", stringsAsFactors = FALSE) match_list <- match( data_2$data1, data_2$data2 ) The vector match_list will have the matching position when it exists and NA's otherwise. Its length will be the same as the length of data_2$data1. You should get
2009 Apr 15
3
How to apply a function to all combinations of the values of 2 or more variables?
Hi, All Forgive me if this is a stupid newbie question. I'm having no luck googling an answer to this, probably because I don't know the right R terminology to frame my question. I want to know how to run an R function on each combination of the values of 2 or more variables. In SAS-speak this is multiple "BY variables" or "CLASS variables". In R I've figured
2017 Oct 12
0
comparing two strings from data
It's generally a very good idea to examine the structure of data after you have read it in. str(data2) would have shown you that read.csv() turned your strings into factors, and that's why the == operator no longer does what you think it does. use ... data_2 <- read.csv("excel_data.csv", stringsAsFactors = FALSE) ... to turn this off. Also, the %in% operator will achieve
2016 Jan 11
2
tftp-hpa features?
On Fri, Jan 8, 2016 at 5:46 PM, Geert Stappers via Syslinux <syslinux at zytor.com> wrote: > On Fri, Jan 08, 2016 at 03:40:37PM -0600, Matt Zagrabelny via Syslinux wrote: >> Greetings, >> >> Is this list (a|the best) place to make feature requests for tftp-hpa? > > Yes, it is. > from email header List-Id: For discussion of Syslinux and tftp-hpa
2017 Sep 04
1
Merge by Range in R
Hi,? I have two big data set.? data _1 :? > dim(data_1) [1] 15820 5 > head(data_1) ? ?Chromosome ?????Start????????End????????Feature GroupA_3 1: ? ? ? ????????chr1 521369 ?750000 ????chr1-0001 ? ?????0.170 2: ? ? ? ????????chr1 750001 ?800000 ????chr1-0002 ? ????-0.086 3: ? ? ? ????????chr1 800001 ?850000 ????chr1-0003 ? ?????0.006 4: ? ? ? ????????chr1 850001 ?900000 ????chr1-0004 ?
2015 Jun 08
2
help awk y shells en R
Hola buenas, a veces empleo desde R shells de unix, Existe alguna manera de utilizar estos shelss desde windows o el lenguaje awk. La idea es hacerlo siempre desde R, igual invoncando cygwin desde windows es posible. Pero no me queda claro Un abrazo y gracias por adelntado Javier #_____________________________________________________________________ # EJEMPLO, ¿Que habría que poner en #
2017 Oct 12
4
comparing two strings from data
Hi, I have two columns that contain numbers along with letters (as shown below) and have different lengths. Each entry in the first column is likely to be found in the second column at most once. For each entry of the first column, if that entry is found in the second column, I would like to get the corresponding index. For instance, if the first entry of the first column is 5th entry in the
2009 Apr 22
1
How to make a time series object from a data frame containing dates.
Hi, All I have a data frame like this. > names(x) [1] "month" "alloc_gb" One of the columns contains the dates I want to use: > data.class(x$month) [1] "yearmon" I would like to make a time series object out of this. However the ts function seems to require me to spell out the starting date. Is there a way to tell ts to use the date column I already
2003 Oct 30
1
$ in domain name, Samba 2.2.8a
> -----Original Message----- > From: Gerald (Jerry) Carter [mailto:jerry@samba.org] > Sent: Wednesday, October 01, 2003 11:49 AM > To: Stuckless, Colin 709 778-3815 > Cc: Andrew Bartlett; 'samba@lists.samba.org' > Subject: Re: [Samba] $ in domain name, Samba 2.2.8a > > Andrew Bartlett wrote: > > | Yes, we are stripping it out for security reasons. The >
2005 Oct 13
1
ocfs2 and Suse 9.2
I'm having some problems with ocfs2 and Suse 9.2. The documentation for ocfs2 says it comes with Suse 9.2. I've installed Suse and used YAST to get to SP2. # SPident -vv Summary (using 612 packages) Product/ServicePack conflict match update (shipped) SLES-9-x86_64 0 0% 293 47.9% 44 (1597 18.3%) SLES-9-x86_64-SP1 0 0% 110
2008 Dec 10
1
df returns weird values
Hi, I'm starting to play with glusterfs, and I'm having a problem with the df output. The value seems to be wrong. (on the client) /var/mule-client$ du -sh 584K . /var/mule-client$ df -h /var/mule-client/ Filesystem Size Used Avail Use% Mounted on glusterfs 254G 209G 32G 88% /var/mule-client (on the server) /var/mule$ du -sh 584K . Is it a known
2015 Jun 08
2
help awk y shells en R
Al final resulto más fácil de lo esperado. Hay que instalar cywin y utilizar los comandos de la siguiente manera system('C:/cygwin/bin/wc -l var_risco_2012.csv') Esto en principio funciona El 8 de junio de 2015, 17:41, Carlos Ortega <cof en qualityexcellence.es> escribió: > Hola, > > Mira esto: > >
2012 Apr 04
2
extract data
HI, I would like to extract data in a specific way.  For example, the rainfall data 0,0,1.5,0,0, 3,1,2.5,0,0,0,0, 2.3,0,0,0, 2.1,1.4,0,0,0, 3,2,1,0,0,0... data_1: 1.5, 2.3               ( a single nonzero data between zeros data) data_2: 3.1, 2.5, 2.1,1.4   ( two nonzero data between zeros data) data_3: 3,1,2.5, 3,2,1       ( three nonzero data between zeros data) Thank you so much for any
2004 Aug 11
0
NT domain and valid user problem
Hello everybody, I met a problem when I configure the samba for our company. The case is, 1. We have to use Windows Domain to authenticate the user. 2. Only a few domain users can access the samba server. 2. We wanna all the allowed domain users are mapped to 1 unix userid: checkpoint. The Window Domain auth is working perfect. And if we created the unix id same as the domain users,