thr3ads.net - R help - [R] How does predict.lm work? [Sep 2008]

If this information is useful, please help other people find it:
Share via:

Williams, Robin

2008-Sep-09 14:59 UTC

[R] How does predict.lm work?

Hi, 
  Please could someone explain how this element of predict.lm works?
From the help file 
`
newdata	  
An optional data frame in which to look for variables with which to
predict. If omitted, the fitted values are used.
' 
  Does this dataframe (newdata) need to have the same variable names as
was used in the original data frame used to fit the model? Or will R
just look across consecutive columns of newdata, and apply them to the
call as appropriate?
  For example, if I have fitted a model with four variables
(x1,x2,x3,x4) in my original dataframe, and then have a second dataframe
which I want to supply to the newdata argument in predict.lm with
variable names (x5, x6, x7, x8), do I need to change the variable names
in my newdata dataframe to match those of the original dataframe? Or
will R treat x5 as x1, x6 as x2, etc, when using predict.lm? 
  I would like to know so that I can design the structure of some
somewhat larger dataframes in a manner which will make using predict.lm
straight forward and quick.
Hope this makes sense.
Many thanks for any help. 
   Robin Williams
Met Office summer intern - Health Forecasting
robin.williams@metoffice.gov.uk 
 

	[[alternative HTML version deleted]]

Gabor Grothendieck

2008-Sep-09 15:03 UTC

head link

[R] How does predict.lm work?

Just try it:
> BOD # built in data frame  Time demand
1    1    8.3
2    2   10.3
3    3   19.0
4    4   16.0
5    5   15.6
6    7   19.8> BOD.lm <- lm(demand ~ Time, BOD)
> predict(BOD.lm, list(Time = 10))       1
25.73571> predict(BOD.lm, list(10))Error in eval(expr, envir, enclos) : object "Time" not found


On Tue, Sep 9, 2008 at 10:59 AM, Williams, Robin
<robin.williams at metoffice.gov.uk> wrote:> Hi,
>  Please could someone explain how this element of predict.lm works?
> >From the help file
> `
> newdata
> An optional data frame in which to look for variables with which to
> predict. If omitted, the fitted values are used.
> '
>  Does this dataframe (newdata) need to have the same variable names as
> was used in the original data frame used to fit the model? Or will R
> just look across consecutive columns of newdata, and apply them to the
> call as appropriate?
>  For example, if I have fitted a model with four variables
> (x1,x2,x3,x4) in my original dataframe, and then have a second dataframe
> which I want to supply to the newdata argument in predict.lm with
> variable names (x5, x6, x7, x8), do I need to change the variable names
> in my newdata dataframe to match those of the original dataframe? Or
> will R treat x5 as x1, x6 as x2, etc, when using predict.lm?
>  I would like to know so that I can design the structure of some
> somewhat larger dataframes in a manner which will make using predict.lm
> straight forward and quick.
> Hope this makes sense.
> Many thanks for any help.
>   Robin Williams
> Met Office summer intern - Health Forecasting
> robin.williams at metoffice.gov.uk
>
>
>        [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

Marc Schwartz

2008-Sep-09 15:12 UTC

head link

[R] How does predict.lm work?

on 09/09/2008 09:59 AM Williams, Robin wrote:> Hi, 
>   Please could someone explain how this element of predict.lm works?
>>From the help file 
> `
> newdata	  
> An optional data frame in which to look for variables with which to
> predict. If omitted, the fitted values are used.
> ' 
>   Does this dataframe (newdata) need to have the same variable names as
> was used in the original data frame used to fit the model? 
Yes. Also, see the Note in ?predict.lm:

"Variables are first looked for in newdata and then searched for in the
usual way (which will include the environment of the formula used in the
fit). A warning will be given if the variables found are not of the same
length as those in newdata if it was supplied."


It also says "Variables", not columns.
> Or will R
> just look across consecutive columns of newdata, and apply them to the
> call as appropriate?
No.
>   For example, if I have fitted a model with four variables
> (x1,x2,x3,x4) in my original dataframe, and then have a second dataframe
> which I want to supply to the newdata argument in predict.lm with
> variable names (x5, x6, x7, x8), do I need to change the variable names
> in my newdata dataframe to match those of the original dataframe? 
Yes.
> Or
> will R treat x5 as x1, x6 as x2, etc, when using predict.lm? 
>   I would like to know so that I can design the structure of some
> somewhat larger dataframes in a manner which will make using predict.lm
> straight forward and quick.
> Hope this makes sense.
> Many thanks for any help. 
HTH,

Marc Schwartz

Reasonably Related Threads

Search for more seemingly similar threads

R help - Sep 2008 - How does predict.lm work?

[R] How does predict.lm work?

[R] How does predict.lm work?

[R] How does predict.lm work?

Reasonably Related Threads