Dear R-Users, Did anyone follow more closely the SARS Cov-2 lineages? I have done a quick check of Cov-2 mutations on the list downloaded from NCBI (see GitHub page below); but it seems that the list contains the cumulative mutations only for B.1 => B.1.1, but not after the B.1.1 branch: # B.1 => B.1.1 seems cumulative diff.lineage("B.1.1", "B.1", data=z) # but B.1.1 => B.1.1.529 is NOT cumulative anymore; diff.lineage("B.1.1.529", "B.1.1", data=z) diff.lineage("B.1.1.529", "BA.2", data=z) diff.lineage("B.1.1.529", "BA.5", data=z) # Column id: B(oth) = present in both lineages: ??????? V?? Mutation??? P??? AA Pos AAi AAm Polymorphism id 899 B.1.1 nsp3:F106F nsp3 F106F 106?? F F???????? TRUE? B 900 B.1.1 RdRp:P323L RdRp P323L 323?? P L??????? FALSE? B 901 B.1.1??? S:D614G??? S D614G 614?? D G??????? FALSE? B 902 B.1.1??? N:R203K??? N R203K 203?? R K??????? FALSE? 1 903 B.1.1??? N:R203R??? N R203R 203?? R R???????? TRUE? 1 904 B.1.1??? N:G204R??? N G204R 204?? G R??????? FALSE? 1 896?? B.1 nsp3:F106F nsp3 F106F 106?? F F???????? TRUE? B 897?? B.1 RdRp:P323L RdRp P323L 323?? P L??????? FALSE? B 898?? B.1??? S:D614G??? S D614G 614?? D G??????? FALSE? B # B.1.1.529 and branches do not have any of the defining mutations of B.1.1; I have uploaded the code on GitHub: https://github.com/discoleo/R/blob/master/Stat/Infx/Cov2.Variants.R 1.) Does anyone have a better picture of what is going on? The sub-variants should have cumulative mutations. This should be the logic for the sub-lineages and I deduce it also by the data/post on the GitHub pango page: https://github.com/cov-lineages/pango-designation/issues/361 2.) Cumulative List It maybe that NCBI kept only the new mutations, as the number of mutations increased. Does anyone know if there is a full cumulative list? Alternatively, there might be a list or package with the full lineage encoding. There is a list on the Pango GitHub project, but I hope to skip at least this step; the synonyms in the NCBI file seem uglier to process. Note: This question may be more oriented towards Bioconductor; but I haven't found any real Covid packages on Bioconductor. Thank you very much for any help. Sincerely, Leonard