thr3ads.net - similar to: "counting identical data in a column"

Displaying 20 results from an estimated 100 matches similar to: "counting identical data in a column"

2008 Feb 18

remove column names from a data frame

I want to remove the column names from a data frame. I do it the long way, can any body show me a better way ? df= data.frame(chrN= c(“chr1”, “chr2”, “chr3”), start= c(1, 2, 3), end= c(4, 5, 6), score= c(7, 8, 9)) df #I write a txt file without row or column names write.table(df,"df1.txt",sep='\t',quote=FALSE,row.names=F,col.names=F) #then I read it with the header = F

inserting text lines in a dat frame

2008 Feb 06

inserting text lines in a dat frame

Hi Jim I am trying to prepare a bed file to load as accustom track on the UCSC genome browser. I have a data frame that looks like the one below. > x V1 V2 V3 1 chr1 11255 55 2 chr1 11320 29 3 chr1 11400 45 4 chr2 21680 35 5 chr2 21750 84 6 chr2 21820 29 7 chr2 31890 46 8 chr3 32100 29 9 chr3 52380 29 10 chr3 66450 46 I would like to insert the following 4 lines at the beginning:

splitting a dataframe in R based on multiple gene names in a specific column

2017 Aug 22

splitting a dataframe in R based on multiple gene names in a specific column

I would appreciate please a suggestion on how to do the following : i'm working with a dataframe in R that contains in a specific column multiple gene names, eg : > df.sample.gene[15:20,2:8] Chr Start End Ref Alt Func.refGene Gene.refGene284 chr2 16080996 16080996 C T ncRNA_exonic GACAT3448 chr2 113979920 113979920 C T ncRNA_exonic LINC01191,LOC100499194465

how to search to value to another table

2011 Jan 31

how to search to value to another table

Hello, I'm a new R user. I have two different dummy tables with the variable name tb1 and tb2. tb1< v1 v2 v3 v4 "chr1" 22 23 3 "chr1" 36 37 1 "chr1" 54 55 0 "chr1" 77 78 1 "chr2" 80 81 4 "chr2" 85 86 0 "chr2" 99 100 1

data frame question

2008 Feb 10

data frame question

Hello I have 2 data frames df1 and df2. I would like to create a new data frame new_df which will contain only the common rows based on the first 2 columns (chrN and start). The column score in the new data frame should be replaced with a column containing the average score (average_score) from df1 and df2. df1= data.frame(chrN= c(“chr1”, “chr1”, “chr1”, “chr1”, “chr2”, “chr2”, “chr2”),

splitting a dataframe in R based on multiple gene names in a specific column

2017 Aug 23

splitting a dataframe in R based on multiple gene names in a specific column

Hi Bogdan, Messy, and very specific to your problem: df.sample.gene<-read.table( text="Chr Start End Ref Alt Func.refGene Gene.refGene 284 chr2 16080996 16080996 C T ncRNA_exonic GACAT3 448 chr2 113979920 113979920 C T ncRNA_exonic LINC01191,LOC100499194 465 chr2 131279347 131279347 C G ncRNA_exonic LOC440910 525 chr2 223777758 223777758 T A

splitting a dataframe in R based on multiple gene names in a specific column

2017 Aug 25

splitting a dataframe in R based on multiple gene names in a specific column

If row numbers can be dispensed with, then tidyr makes this easy with the unnest function: ##### library(dplyr) #> #> Attaching package: 'dplyr' #> The following objects are masked from 'package:stats': #> #> filter, lag #> The following objects are masked from 'package:base': #> #> intersect, setdiff, setequal, union library(purrr)

Histogram for each ID value

2011 Oct 17

Histogram for each ID value

I have a dataframe in the general format: chr1 0.5 chr1 0 chr1 0.75 chr2 0 chr2 0 chr3 1 chr3 1 chr3 0.5 chr7 0.75 chr9 1 chr9 1 chr22 0.5 chr22 0.5 where the first column is the chromosome location and the second column is some value. What I'd like to do is have a histogram created for each chr location (i.e. a separate histogram for chr1, chr2, chr3, chr7, chr9, and chr22). I am just

[Help]:How to use "loop" to achieve this aim?

2010 Oct 01

[Help]:How to use "loop" to achieve this aim?

Hello, Everyone, how to use "loop" to make the process automatic and fast? When compute each sample, the script type in R almost the same, just the input and output file's name is changed(chr1 change to chr2, chr3,chr4...). The first sample's script like this: >chr1=MEDIPS.readAlignedSeqences(BSgenome="hg19", file="chr1",numrows= )

replacing percentage of values in data frame

2011 Oct 19

replacing percentage of values in data frame

I've been looking for how to change a certain percentage of values in a data frame, but I've been struggling to find information in R. For example: #################example data############## > data V1 V2 V3 V4 V5 V6 V7 1 chr1 500 500 CHH 0 0.5 + 2 chr1 550 550 CHH 0 0.0 + 3 chr2 700 700 CHH 0 0.0 + 4 chr2 1000 1000 CHH 0 0.0 + 5 chr3

qplot: plotting precipitation data

2012 Sep 15

qplot: plotting precipitation data

Dear list, I wish to plot chromatin precipitation data: I would like to have a rectangles (x:end-start, y:peak) but I do not have an idea how to define x (in terms of qplot syntax) and to choose the correct geom. mydata is a subset of a larger file. > mydata chrom start end peak 1 chr11 5291000 5291926 8 2 chr11 10988025 10988526 7 3 chr11 11767950 11768676 8 4

Simple Question?

2012 Jun 21

Simple Question?

Greetings, I am new to R, but trying to put in the time to learn. I have read the R manual and several other introductory texts; however, there is nothing like actually putting it into practice. So here is my problem, and its more of a learning exercise for myself than anything else, but I'm stuck and getting extremely frustrated that I can't figure it out. I'm trying to make a

How do I order xyplot line points?

2010 Nov 07

How do I order xyplot line points?

I have the following xyplot figure: http://img577.imageshack.us/img577/686/filesizeresults12000000.png The data are organized in a matrix file as follows: Type Elements Chromosome Time bedGz 12000000 chr1 14.240 bedGz 12000000 chr2 7.949 bedGz 12000000 chr3 5.103 bedGz 12000000 chr4 5.290 bedGz 12000000 chr5 5.161 ... The x-axis labels in the Chromosome column are ordered

convertin a data frame column from character to numeric

2008 Feb 08

convertin a data frame column from character to numeric

I have a data.frame with all character columns, I would like to convert the last two columns into numeric.> x[1:5, ] chrN start end 1 chr1 71310034 71310064 2 chr14 23354088 23354118 3 chr14 71310034 71310064 4 chr15 37759058 37759088 5 chr22 18262638 18262668 > apply(x, 2, FUN = mode) chrN start end

Warning about undocumented data sets

2008 May 15

Warning about undocumented data sets

Hi, List, Sorry in advance if this turns out to be a stupid question -- I've been trying to work it out for awhile, and I don't have any new ideas -- I'm very new to R documentation/ LATEX. I am running "R CMD check " on a package that I am trying to write; the only warning is: * checking for missing documentation entries ... WARNING Undocumented data sets: Einter KG All

R functions

2011 Sep 15

R functions

Hi group, I am trying to right a code to do the following This is how the test file looks like: Chr start end sample1 sample2 chr2 9896633 9896683 0 0 chr2 9896639 9896690 0 0 chr2 14314039 14314098 0 -0.35 chr2 14404467 14404502 0 -0.35 chr2 14421718 14421777 -0.43 -0.35 chr2 16031710 16031769 -0.43 -0.35 chr2 16036178 16036237 -0.43 -0.35 chr2 16048665 16048724 -0.43 -0.35 chr2 37491676 37491735

How to Store the executed values in a dataframe & rle function

2011 Sep 26

How to Store the executed values in a dataframe & rle function

Hi group, This is how my test file looks like: Chr start end sample1 sample2 chr2 9896633 9896683 0 0 chr2 9896639 9896690 0 0 chr2 14314039 14314098 0 -0.35 chr2 14404467 14404502 0 -0.35 chr2 14421718 14421777 -0.43 -0.35 chr2 16031710 16031769 -0.43 -0.35 chr2 16036178 16036237 -0.43 -0.35 chr2 16048665 16048724 -0.43 -0.35 chr2 37491676 37491735 0 0 chr2 37702947 37703009 0 0

Loops for repetitive task

2011 Aug 10

Loops for repetitive task

Hello, I have an R script that I use as a template to perform a task for multiple files (in this case, multiple chromosomes). What I would like to do is to utilize a simple loop to parse through each chromosome number so that I don't have to type the same code over and over again in the R console. I've tried using: for(i in 1:22){ etc.. } and replacing each chromosome number with

sort a data.frame

2010 May 20

sort a data.frame

Hello, I have a dataframe: dd <- data.frame(b = c("chr2", "chr1", "chr15", "chr13"), x = c("A", "D", "A", "C"), y = c(8, 3, 9, 9), z = c(1, 1, 1, 2)) >dd b x y z 1 chr2 A 8 1 2 chr1 D 3 1 3 chr15 A 9 1 4 chr13 C 9 2 Now I want to sort them according column "b", but only its

Writing a summary file in R

2011 Jul 27

Writing a summary file in R

Hello, I have an input file: http://r.789695.n4.nabble.com/file/n3700031/testOut.txt testOut.txt where col 1 is chromosome, column2 is start of region, column 3 is end of region, column 4 and 5 is base position, column 6 is total reads, column 7 is methylation data, and column 8 is the strand. I would like a summary output file such as:

similar to: counting identical data in a column