Displaying 20 results from an estimated 100 matches similar to: "conditional mean between two data frames with different levels"
2002 Jul 03
2
grouping in scatterplot3d
Hi all,
I've been using the scatterplot3d function (from the scatterplot3d library) to create 3D plots. I was wondering whether there is
anyway to group the points according to a particular group variable. For example I used the plot function as
plot( factor( <GROUPINGVAR> ) , <PLOTVARS> )
to organise the results in a according to the grouping variable. I was wondering
2013 Jun 17
2
SVMREF infinte number of genes
dear all,
I am a student in cs college. I would like to know how to plot infinte
number of genes after using the svm.
the data set i have consists of
x which is a matrix of 39 cancer patients [rows] and 2000 gene names
[colmns]. each cell is the value of the gene for a particular patient.
there are two types of cancer people representedas factor y.
here is the code:
library(e1071)
#load
2011 Sep 02
2
misclassification rate
Hi users
I'm student who is struggling with basic R programming. Would you please
help me with this problem.
"My english is bad" I hope that my question is clear:
I have a matrix in wich there are two colmns( yp, yt)
Yp: predicted values from my model.
yt: true values ( my dependante variable y is a categorical;3 modalities
(0,1,2)
I don't know how to procede to calculate the
2010 Jun 07
3
Very OT: World Cup Statistics
Hello,
Sorry for the very Off TOpic post, but I need some data on past football
(soccer) world cups. I'd like to find (or calculate) the time to the
first goal of each match (where a goal was scored).
I''ve looked at the uefa website and can't find what I want, maybe
someone here can help ?
Thanks
Paul.
2013 May 07
3
Announce: cis-puppet 0.2.0 is now available
Overview
========
This module implements the Center for Internet Security (CIS) Security Configuration Benchmark for Red Hat Enterprise Linux 6 v.1.1.0 (avilable at http://benchmarks.cisecurity.org). Each scored control has been implemented as a class or a custom fact.
Installation
============
Please either:
- Clone git repo from https://github.com/arildjensen/cis-puppet
- Run "puppet
2009 Oct 11
3
passing field name parameter to function
Hi,
I am passing a data frame and field name to a function. I've figured out how
I can create the formula based on the passed in field name, but I'm
struggling to create a vector based in that field.
for example if I hard code with the actual field name
Y = df$Target, everything works fine.
but if I use the passed in parameter name, it doesn't give me what I want,
Y =
2009 Mar 10
3
reliability, scale scores in the psych package
Dear Professor Revelle and R-helpers,
This is a two-part question: the first part is general, and the second
is specific to the psych package.
First question: In the past I've constructed composite variables from
questionnaire data by using rowMeans(), and then correlating items
with the scale using cor() as an informal check for "bad" items. Over
the weekend I decided to take a
2010 Oct 30
1
summary.lm as a post-hoc test?
Let's assume that according to Anova(lm(y~a*b, data=d)) the a:b interaction is significant, and I would like to know if there are specific combinations of a and b levels that differ from the control group. Are there any caveats against simply looking at the p-values in the output generated by summary(lm(y~a*b,data=d))? The documentation for summary.lm says nothing about multiple comparison
2012 Dec 17
1
rms R code
Greetings, useRs.
Does anybody have replication of the examples from the RMS book by Harrell coded in R? I find that most the code does not work and it takes too much time to debug.
For example from p.276
> age.t <- w[,"age"]
> f.full <- lrm(cvd~scored(rx)+rcs(dtime,5)+age.t+wt.t+pf.t+hx+sbp+ekg.t+sz.t+sg.t+ap.t+bm+hg.t,x=T,y=T)
Error in model.frame.default(formula = cvd ~
2009 Oct 10
1
field names as function parameters
Hi,
I am passing a data frame and field name to a function. I've figured out how
I can create the formula based on the passed in field name, but I'm
struggling to create a vector based in that field.
for example if I hard code with the actual field name
Y = df$Target, everything works fine.
but if I use the passed in parameter name, it doesn't give me what I want,
Y = df$mytarget
2009 Jul 26
3
Question about rpart decision trees (being used to predict customer churn)
Hi,
I am using rpart decision trees to analyze customer churn. I am finding that
the decision trees created are not effective because they are not able to
recognize factors that influence churn. I have created an example situation
below. What do I need to do to for rpart to build a tree with the variable
experience? My guess is that this would happen if rpart used the loss matrix
while creating
2017 Jun 28
2
R package for scorecard development
Hello all,
Is there any R package that can develop a scorecard model for a binary
target variable?
More details:
I want to create a scorecard based on the raw data I have.
I have a binary target variable and a few numeric and character input
variables.
I want to bin the variables and assign a score to each of the bins.
Each subject will be scored based on the bin it falls in for each
2004 Dec 08
2
Center for Internet Security "scoring tool"
Hello,
Has anyone tried out the security scoring tool at
http://www.cisecurity.org/bench_freebsd.html?
Any thoughts or opinions?
Regards,
stheg
__________________________________
Do you Yahoo!?
Yahoo! Mail - 250MB free storage. Do more. Manage less.
http://info.mail.yahoo.com/mail_250
2006 Oct 29
3
Best practice - polymorphic or not
Hello I have this scenario (hope it is understandable):
I have a model to support fact messages of different types. This is
sport related and I have currently the following model design.
matchfacts (is the generic table connecting all matchfacts to for
instance a match/game, match has_many matchfacts).
Some matchfact-data is generic, like the time of fact and therefore the
data should be stored
2006 Sep 20
8
Understanding boost ?
Hi,
I''m confused about managing field boosting ...
I have set the :boost for the :name field in my docs to 10, via :boost
=> 10
Then I performed a search for ''keith'' over all fields via with
*:(keith*), expecting a doc with Keith in the :name field to come out on
top. But another doc with Keith mentioned in other fields (:comments,
:address) scored higher.
I
2014 Oct 23
2
Invoking the spam checker on the sieve script
Hi.
TL/DR version:
Is it advisable to invoke an spam checker from the sieve script, and then,
once the message is filtered, decide if should be moved to a certain spam
folder?
A bit more context on why I'm considering this:
I'm just a geek who wants to handle his own mail, but not a great experience
as a system administrator. I don't have any corporate setup, just a simple VPS
2007 Aug 23
4
scoring problem in acts_as_ferret
Hi,
I am using acts_as_ferret and have a problem with scoring. I would like
to organize it in such way that, if any of the searched terms fits, I
get 1.0 score as a result. I will explain it on the example.
I have in index:
a) "one two three four"
b) "one two three"
c) "one two"
d) "one"
When I search for "one" I would like to get 1.0 score for
2006 Feb 21
6
How to sum values across multiple variables using a wildcard?
I have a dataframe called "data" with 5 records (in rows) each of
which has been scored on each of many variables (in columns).
Five of the variables are named var1, var2, var3, var4, var5 using
headers. The other variables are named using other conventions.
I can create a new variable called var6 with the value 15 for each
record with this code:
> var6=var1+var2+var3+var4+var5
2006 Jan 19
8
I can''t figure out how to solve this one
I am creating an application to run a fishing tournament (see http://
www.ruby-forum.com/topic/51209 for a little background)
Now I am running into an issue trying to calculate a leader board.
each participate can enter multiple fish but only the largest fish
per a given species counts towards the overall score. My entries
table looks like this:
2006 Apr 23
1
Comparing GLMMs and GLMs with quasi-binomial errors?
Dear All,
I am analysing a dataset on levels of herbivory in seedlings in an
experimental setup in a rainforest.
I have seven classes/categories of seedling damage/herbivory that I want to
analyse, modelling each separately.
There are twenty maternal trees, with eight groups of seedlings around each.
Each tree has a TreeID, which I use as the random effect (blocking factor).
There are two