Athmakuru Prasad
2018-May-25 04:08 UTC
[R] Urgent - R help - Multivariate - Naive Bayes code for R
Friends, I am doing a URL classification, based on certain key words whether it contains an executive information or not. I have already went through 50K URL's and identified the key words and made it as 0, 1 ( 0 - do not have the key word and 1 - have the key word) and 0- do not contain executive information 1 - contains executive information. A sample set of data is shown below. DomainID Domain LinkID Raw_Link Cleansed_Link Biz_name Address1 City State PostalCode Address_Page_flag Executive_page_flag collections other_keywords conditions policy story history brand login job career who company people staff Board management team terms privacy shop gallery News location site Sitemap page Content Event blog categories Services Index Product Reviews Testimonials about contact Home_page Link_Len LinkCount ExecWordCount ExecWordRatio Category 250842730 www.aaronwomenscenterhouston.com 250842730-1 http://www.aaronwomenscenterhouston.com aaronwomenscenterhouston.com AARON WOMEN?S CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 28 9 1 0.48% Clinic 250842730 www.aaronwomenscenterhouston.com 250842730-2 http://www.aaronwomenscenterhouston.com/surgical-termination aaronwomenscenterhouston.com/surgical-termination AARON WOMEN?S CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 49 9 1 0.65% Clinic 250842730 www.aaronwomenscenterhouston.com 250842730-3 http://www.aaronwomenscenterhouston.com/non-surgical-termination aaronwomenscenterhouston.com/non-surgical-termination AARON WOMEN?S CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 53 9 1 0.79% Clinic 250842730 www.aaronwomenscenterhouston.com 250842730-4 http://www.aaronwomenscenterhouston.com/birth-control aaronwomenscenterhouston.com/birth-control AARON WOMEN?S CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 9 1 0.59% Clinic 250842730 www.aaronwomenscenterhouston.com 250842730-5 http://www.aaronwomenscenterhouston.com/late-term-termination aaronwomenscenterhouston.com/late-term-termination AARON WOMEN?S CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 9 2 0.71% Clinic 250842730 www.aaronwomenscenterhouston.com 250842730-6 http://www.aaronwomenscenterhouston.com/patient-forms aaronwomenscenterhouston.com/patient-forms AARON WOMEN?S CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 9 1 0.78% Clinic I understand that i need to use Multivariate Bernouli classification to segregate the URL's......I am struggling to get an appropriate R code for the same.... Any help in providing and R code for this would be greatly appreciated. Cheers ALN [[alternative HTML version deleted]]
PIKAL Petr
2018-May-25 06:54 UTC
[R] Urgent - R help - Multivariate - Naive Bayes code for R
Hi Within https://rseek.org/ I got quite a lot pages from query Multivariate Bernouli classification Maybe some of them could help you. Cheers Petr Tento e-mail a jak?koliv k n?mu p?ipojen? dokumenty jsou d?v?rn? a podl?haj? tomuto pr?vn? z?vazn?mu prohl??en? o vylou?en? odpov?dnosti: https://www.precheza.cz/01-dovetek/ | This email and any documents attached to it may be confidential and are subject to the legally binding disclaimer: https://www.precheza.cz/en/01-disclaimer/> -----Original Message----- > From: R-help [mailto:r-help-bounces at r-project.org] On Behalf Of Athmakuru > Prasad > Sent: Friday, May 25, 2018 6:08 AM > To: r-help at r-project.org > Subject: [R] Urgent - R help - Multivariate - Naive Bayes code for R > > Friends, > > I am doing a URL classification, based on certain key words whether it contains > an executive information or not. I have already went through 50K URL's and > identified the key words and made it as 0, 1 ( 0 - do not have the key word and > 1 - have the key word) and 0- do not contain executive information 1 - contains > executive information. > > A sample set of data is shown below. > > DomainID Domain LinkID Raw_Link Cleansed_Link Biz_name Address1 City > State PostalCode Address_Page_flag Executive_page_flag collections > other_keywords conditions policy story history brand login job career who > company people staff Board management team terms privacy shop gallery > News location site Sitemap page Content Event blog categories Services Index > Product Reviews Testimonials about contact Home_page Link_Len LinkCount > ExecWordCount ExecWordRatio Category > 250842730 www.aaronwomenscenterhouston.com 250842730-1 > http://www.aaronwomenscenterhouston.com > aaronwomenscenterhouston.com AARON WOMEN?S CENTER 2505 North > Shepherd Dr Houston TX 77008 0 0 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 28 9 1 0.48% Clinic > 250842730 www.aaronwomenscenterhouston.com 250842730-2 > http://www.aaronwomenscenterhouston.com/surgical-termination > aaronwomenscenterhouston.com/surgical-termination AARON WOMEN?S > CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 49 9 1 0.65% Clinic > 250842730 www.aaronwomenscenterhouston.com 250842730-3 > http://www.aaronwomenscenterhouston.com/non-surgical-termination > aaronwomenscenterhouston.com/non-surgical-termination AARON WOMEN?S > CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 53 9 1 0.79% Clinic > 250842730 www.aaronwomenscenterhouston.com 250842730-4 > http://www.aaronwomenscenterhouston.com/birth-control > aaronwomenscenterhouston.com/birth-control AARON WOMEN?S CENTER > 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 > 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 9 1 0.59% Clinic > 250842730 www.aaronwomenscenterhouston.com 250842730-5 > http://www.aaronwomenscenterhouston.com/late-term-termination > aaronwomenscenterhouston.com/late-term-termination AARON WOMEN?S > CENTER 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 > 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 50 9 2 0.71% Clinic > 250842730 www.aaronwomenscenterhouston.com 250842730-6 > http://www.aaronwomenscenterhouston.com/patient-forms > aaronwomenscenterhouston.com/patient-forms AARON WOMEN?S CENTER > 2505 North Shepherd Dr Houston TX 77008 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 > 0 0 0 0 0 > 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 42 9 1 0.78% Clinic > > > I understand that i need to use Multivariate Bernouli classification to segregate > the URL's......I am struggling to get an appropriate R code for the same.... > > Any help in providing and R code for this would be greatly appreciated. > > Cheers > ALN > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.________________________________ Tento e-mail a jak?koliv k n?mu p?ipojen? dokumenty jsou d?v?rn? a jsou ur?eny pouze jeho adres?t?m. Jestli?e jste obdr?el(a) tento e-mail omylem, informujte laskav? neprodlen? jeho odes?latele. Obsah tohoto emailu i s p??lohami a jeho kopie vyma?te ze sv?ho syst?mu. Nejste-li zam??len?m adres?tem tohoto emailu, nejste opr?vn?ni tento email jakkoliv u??vat, roz?i?ovat, kop?rovat ?i zve?ej?ovat. Odes?latel e-mailu neodpov?d? za eventu?ln? ?kodu zp?sobenou modifikacemi ?i zpo?d?n?m p?enosu e-mailu. V p??pad?, ?e je tento e-mail sou??st? obchodn?ho jedn?n?: - vyhrazuje si odes?latel pr?vo ukon?it kdykoliv jedn?n? o uzav?en? smlouvy, a to z jak?hokoliv d?vodu i bez uveden? d?vodu. - a obsahuje-li nab?dku, je adres?t opr?vn?n nab?dku bezodkladn? p?ijmout; Odes?latel tohoto e-mailu (nab?dky) vylu?uje p?ijet? nab?dky ze strany p??jemce s dodatkem ?i odchylkou. - trv? odes?latel na tom, ?e p??slu?n? smlouva je uzav?ena teprve v?slovn?m dosa?en?m shody na v?ech jej?ch n?le?itostech. - odes?latel tohoto emailu informuje, ?e nen? opr?vn?n uzav?rat za spole?nost ??dn? smlouvy s v?jimkou p??pad?, kdy k tomu byl p?semn? zmocn?n nebo p?semn? pov??en a takov? pov??en? nebo pln? moc byly adres?tovi tohoto emailu p??padn? osob?, kterou adres?t zastupuje, p?edlo?eny nebo jejich existence je adres?tovi ?i osob? j?m zastoupen? zn?m?. This e-mail and any documents attached to it may be confidential and are intended only for its intended recipients. If you received this e-mail by mistake, please immediately inform its sender. Delete the contents of this e-mail with all attachments and its copies from your system. If you are not the intended recipient of this e-mail, you are not authorized to use, disseminate, copy or disclose this e-mail in any manner. The sender of this e-mail shall not be liable for any possible damage caused by modifications of the e-mail or by delay with transfer of the email. In case that this e-mail forms part of business dealings: - the sender reserves the right to end negotiations about entering into a contract in any time, for any reason, and without stating any reasoning. - if the e-mail contains an offer, the recipient is entitled to immediately accept such offer; The sender of this e-mail (offer) excludes any acceptance of the offer on the part of the recipient containing any amendment or variation. - the sender insists on that the respective contract is concluded only upon an express mutual agreement on all its aspects. - the sender of this e-mail informs that he/she is not authorized to enter into any contracts on behalf of the company except for cases in which he/she is expressly authorized to do so in writing, and such authorization or power of attorney is submitted to the recipient or the person represented by the recipient, or the existence of such authorization is known to the recipient of the person represented by the recipient.