Dear list,
I have the following data set
id 1 2 3 4 5 6 7 8 9 10
disease a b c d e f g h i j
age 23 40 32 34 25 32 22 35 29 21
city NY LD NY SG NY LD VG SA LD SG
sex 1 1 2 2 2 2 1 1 1 2
treat_a y y y y
treat_b n n n n n n
ques1_1 2 4 5 6 8 3 1 2 4 5
ques1_2 6 4 5 12 10 9 8 4 5 7
ques1_3 17 23 32 25 14 24 23 22 32 29
ques2_1 4 7 9 10 6 8 5 7 8 9
ques2_2 8 9 10 12 17 19 14 21 22 19
ques2_3 23 18 19 20 23 24 26 28 29 22
ques3_1 5 7 9 1 4 7 9 8 10 5
ques3_2 34 35 32 23 31 29 27 25 32 33
ques3_3 29 33 27 25 27 23 24 29 27 24
where the first row is the header row in a dataframe. First I want to merge the
two variables
treat_a and treat_b to a new variable called "treat" which will be
given n if it's left blank
in the variable treat_a and y if it's left blank in treat_b. The new data
set will look like
id 1 2 3 4 5 6 7 8 9 10
disease a b c d e f g h i j
age 23 40 32 34 25 32 22 35 29 21
city NY LD NY SG NY LD VG SA LD SG
sex 1 1 2 2 2 2 1 1 1 2
treat n n n y y y n n y n
ques1_1 2 4 5 6 8 3 1 2 4 5
ques1_2 6 4 5 12 10 9 8 4 5 7
ques1_3 17 23 32 25 14 24 23 22 32 29
ques2_1 4 7 9 10 6 8 5 7 8 9
ques2_2 8 9 10 12 17 19 14 21 22 19
ques2_3 23 18 19 20 23 24 26 28 29 22
ques3_1 5 7 9 1 4 7 9 8 10 5
ques3_2 34 35 32 23 31 29 27 25 32 33
ques3_3 29 33 27 25 27 23 24 29 27 24
Now I want to reshape the data in a long format with target output
id disease age city sex treat ques
1 a 23 NY 1 n 1_1
1 a 23 NY 1 n 1_2
1 a 23 NY 1 n 1_3
1 a 23 NY 1 n 2_1
1 a 23 NY 1 n 2_2
1 a 23 NY 1 n 2_3
1 a 23 NY 1 n 3_1
1 a 23 NY 1 n 3_2
1 a 23 NY 1 n 3_3
2 b 40 LD 1 n 1_1
2 b 40 LD 1 n 1_2
2 b 40 LD 1 n 1_3
2 b 40 LD 1 n 2_1
2 b 40 LD 1 n 2_2
2 b 40 LD 1 n 2_3
2 b 40 LD 1 n 3_1
2 b 40 LD 1 n 3_2
2 b 40 LD 1 n 3_3
.
.
.
10 j 21 SG 2 n 3_3
How can I do this in R?
Thanks alot for any help,
Tom
---------------------------------
Jmfr pris p flygbiljetter och hotellrum:
http://shopping.yahoo.se/c-169901-resor-biljetter.html
[[alternative HTML version deleted]]