Tara Adcock
2017-Jun-01 10:02 UTC
[R] Data import R: some explanatory variables not showing up correctly in summary
Hi, I have a question regarding data importing into R. When I import my data into R and review the summary, some of my explanatory variables are being reported as if instead of being one variable, they are two with the same name. See below for an example; Behav person Behav dog Position **combination : 38 combination : 4** Bank :372 **combination : 7 combination : 4** **Island :119** fast :123 fast : 15 **Island : 11** slow :445 slow : 95 Land : 3 stat :111 stat : 14 Water :230 Also, all of the distances I have imported are showing up in the summary along with a line entitled "other". However, I haven't used any other distances? Distance Distance.dog 2-10m :184 <50m : 35 <50m :156 2-10m : 27 10-20m :156 20-30m : 23 20-30m : 91 30-40m : 16 40-50m : 57 10-20m : 13 **(Other): 82 (Other): 18** I have checked my data sheet over and over again and I think standardised the data, but the issue keeps arising. I'm assuming I need to clean the data set but as a nearly complete novice in R I am not certain how to do this. Any help at all with this would be much appreciated. Thanks so much. Kind Regards, Tara Adcock. [[alternative HTML version deleted]]
Rui Barradas
2017-Jun-01 15:29 UTC
[R] Data import R: some explanatory variables not showing up correctly in summary
Hello, In order for us to help we need to know how you've imported your data. What was the file type? What instructions have you used to import it? Did you use base R or a package? Give us a minimal but complete code example that can reproduce your situation. Hope this helps, Rui Barradas Em 01-06-2017 11:02, Tara Adcock escreveu:> Hi, > > I have a question regarding data importing into R. > > When I import my data into R and review the summary, some of my explanatory variables are being reported as if instead of being one variable, they are two with the same name. See below for an example; > > Behav person Behav dog Position > **combination : 38 combination : 4** Bank :372 > **combination : 7 combination : 4** **Island :119** > fast :123 fast : 15 **Island : 11** > slow :445 slow : 95 Land : 3 > stat :111 stat : 14 Water :230 > > Also, all of the distances I have imported are showing up in the summary along with a line entitled "other". However, I haven't used any other distances? > > Distance Distance.dog > 2-10m :184 <50m : 35 > <50m :156 2-10m : 27 > 10-20m :156 20-30m : 23 > 20-30m : 91 30-40m : 16 > 40-50m : 57 10-20m : 13 > **(Other): 82 (Other): 18** > > I have checked my data sheet over and over again and I think standardised the data, but the issue keeps arising. I'm assuming I need to clean the data set but as a nearly complete novice in R I am not certain how to do this. Any help at all with this would be much appreciated. Thanks so much. > > Kind Regards, > > Tara Adcock. > > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >
Ulrik Stervbo
2017-Jun-01 15:49 UTC
[R] Data import R: some explanatory variables not showing up correctly in summary
Hi Tara, It seems that you categorise and count for each category. Could it be that the method you use puts everything that doesn't match the predefined categories in Other? I'm only guessing because without a minimal reproducible example it's difficult to do anything else. Best wishes Ulrik Rui Barradas <ruipbarradas at sapo.pt> schrieb am Do., 1. Juni 2017, 17:30:> Hello, > > In order for us to help we need to know how you've imported your data. > What was the file type? What instructions have you used to import it? > Did you use base R or a package? > Give us a minimal but complete code example that can reproduce your > situation. > > Hope this helps, > > Rui Barradas > > Em 01-06-2017 11:02, Tara Adcock escreveu: > > Hi, > > > > I have a question regarding data importing into R. > > > > When I import my data into R and review the summary, some of my > explanatory variables are being reported as if instead of being one > variable, they are two with the same name. See below for an example; > > > > Behav person Behav dog Position > > **combination : 38 combination : 4** Bank :372 > > **combination : 7 combination : 4** **Island :119** > > fast :123 fast : 15 **Island : 11** > > slow :445 slow : 95 Land : 3 > > stat :111 stat : 14 Water :230 > > > > Also, all of the distances I have imported are showing up in the summary > along with a line entitled "other". However, I haven't used any other > distances? > > > > Distance Distance.dog > > 2-10m :184 <50m : 35 > > <50m :156 2-10m : 27 > > 10-20m :156 20-30m : 23 > > 20-30m : 91 30-40m : 16 > > 40-50m : 57 10-20m : 13 > > **(Other): 82 (Other): 18** > > > > I have checked my data sheet over and over again and I think > standardised the data, but the issue keeps arising. I'm assuming I need to > clean the data set but as a nearly complete novice in R I am not certain > how to do this. Any help at all with this would be much appreciated. Thanks > so much. > > > > Kind Regards, > > > > Tara Adcock. > > > > > > [[alternative HTML version deleted]] > > > > ______________________________________________ > > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > > https://stat.ethz.ch/mailman/listinfo/r-help > > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > > and provide commented, minimal, self-contained, reproducible code. > > > > ______________________________________________ > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >[[alternative HTML version deleted]]
Charles C. Berry
2017-Jun-01 17:06 UTC
[R] Data import R: some explanatory variables not showing up correctly in summary
On Thu, 1 Jun 2017, Rui Barradas wrote:> Hello, > > In order for us to help we need to know how you've imported your data. What > was the file type? What instructions have you used to import it? Did you use > base R or a package? > Give us a minimal but complete code example that can reproduce your > situation. > > Hope this helps, > > Rui BarradasAbsolutely. It would also help to see what the unique values of each column *really* are. To that end run and report the results of this: lapply(your.data.frame, function(x) unique(as.character(x))) I'll bet you have both "combination" and "combination " as values or something similar where two different strings look to your eye to be the same when printed by summary(). HTH, Chuck> > Em 01-06-2017 11:02, Tara Adcock escreveu: >> Hi, >> >> I have a question regarding data importing into R. >> >> When I import my data into R and review the summary, some of my explanatory >> variables are being reported as if instead of being one variable, they are >> two with the same name. See below for an example; >> >> Behav person Behav dog Position >> **combination : 38 combination : 4** Bank :372 >> **combination : 7 combination : 4** **Island :119** >> fast :123 fast : 15 **Island : 11** >> slow :445 slow : 95 Land : 3 >> stat :111 stat : 14 Water :230 >> >> Also, all of the distances I have imported are showing up in the summary >> along with a line entitled "other". However, I haven't used any other >> distances? >> >> Distance Distance.dog >> 2-10m :184 <50m : 35 >> <50m :156 2-10m : 27 >> 10-20m :156 20-30m : 23 >> 20-30m : 91 30-40m : 16 >> 40-50m : 57 10-20m : 13 >> **(Other): 82 (Other): 18** >> >> I have checked my data sheet over and over again and I think standardised >> the data, but the issue keeps arising. I'm assuming I need to clean the >> data set but as a nearly complete novice in R I am not certain how to do >> this. Any help at all with this would be much appreciated. Thanks so much. >> >> Kind Regards, >> >> Tara Adcock. >> >> >> [[alternative HTML version deleted]] >> >> ______________________________________________ >> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see >> https://stat.ethz.ch/mailman/listinfo/r-help >> PLEASE do read the posting guide >> http://www.R-project.org/posting-guide.html >> and provide commented, minimal, self-contained, reproducible code. >> >Charles C. Berry Dept of Family Medicine & Public Health cberry at ucsd edu UC San Diego / La Jolla, CA 92093-0901 http://biostat.ucsd.edu/ccberry.htm
Apparently Analagous Threads
- Data import R: some explanatory variables not showing up correctly in summary
- Data import R: some explanatory variables not showing up correctly in summary
- Data import R: some explanatory variables not showing up correctly in summary
- Data import R: some explanatory variables not showing up correctly in summary
- Data import R: some explanatory variables not showing up correctly in summary