Displaying 3 results from an estimated 3 matches for "ackr4".
2017 Aug 25
1
splitting a dataframe in R based on multiple gene names in a specific column
...113979920 113979920 C T ncRNA_exonic LINC01191,LOC100499194
465 chr2 131279347 131279347 C G ncRNA_exonic LOC440910
525 chr2 223777758 223777758 T A exonic AP1S3
626 chr3 99794575 99794575 G A exonic COL8A1
643 chr3 132601066 132601066 A G exonic ACKR4
655 chr3 132601999 132601999 A G exonic BCDF5,CDFG6",
header=TRUE,stringsAsFactors=FALSE)
df.sample.out <- ( df.sample.gene
%>% mutate( Gene.refGene = strsplit( Gene.refGene
, ","...
2017 Aug 22
2
splitting a dataframe in R based on multiple gene names in a specific column
...NC01191,LOC100499194465 chr2 131279347 131279347 C G
ncRNA_exonic LOC440910525 chr2 223777758 223777758 T
A exonic AP1S3626 chr3 99794575 99794575 G
A exonic COL8A1643 chr3 132601066 132601066 A
G exonic ACKR4
How could I obtain a dataframe where each line that has multiple gene names
(in the field Gene.refGene) is replicated with only one gene name ? i.e.
for the second row :
448 chr2 113979920 113979920 C T ncRNA_exonic LINC01191,LOC100499194
we shall get in the final output (that contains al...
2017 Aug 23
0
splitting a dataframe in R based on multiple gene names in a specific column
...hr2 113979920 113979920 C T ncRNA_exonic LINC01191,LOC100499194
465 chr2 131279347 131279347 C G ncRNA_exonic LOC440910
525 chr2 223777758 223777758 T A exonic AP1S3
626 chr3 99794575 99794575 G A exonic COL8A1
643 chr3 132601066 132601066 A G exonic ACKR4
655 chr3 132601999 132601999 A G exonic BCDF5,CDFG6",
header=TRUE,stringsAsFactors=FALSE)
multgenes<-grep(",",df.sample.gene$Gene.refGene)
rep_genes<-strsplit(df.sample.gene$Gene.refGene[multgenes],",")
ngenes<-unlist(lapply(rep_genes,length))
dup_row...