Help for package Repliscope

Type:

Package

Language:

en-GB

Title:

Replication Timing Profiling using DNA Copy Number

Version:

1.1.1

Author:

Dzmitry G Batrakou

Maintainer:

Dzmitry G Batrakou <d.batrakou@gmail.com>

Description:

Create, Plot and Compare Replication Timing Profiles. The method is described in Muller et al., (2014) <doi:10.1093/nar/gkt878>.

License:

GPL-3

Encoding:

UTF-8

LazyData:

true

RoxygenNote:

7.2.1

Depends:

R (≥ 3.5.0), ggplot2, shiny

Imports:

stats, utils, methods, grDevices, colourpicker

Suggests:

knitr, rmarkdown

VignetteBuilder:

knitr

NeedsCompilation:

Packaged:

2022-09-13 07:03:36 UTC; dzmit

Repository:

CRAN

Date/Publication:

2022-09-13 07:20:02 UTC

Sequence read coverage ratios for S.cerevisiae Dbf4-9myc sample.

Description

Sequence read coverage ratios for Dbf4-9myc sample (T9394 strain). The cells were stained with DNA dye and sorted based on DNA content into S or G2/M phase fractions. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (S) and non-replicating (G2/M) samples were calculated in 1 kb genomic bins. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted by multiplying them by 1.402 to put the values onto biologically relevant relative copy number scale from 1 to 2.

Usage

data(Dbf4myc)

Format

data frame with 11356 rows and 7 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
name.rep: replicating sample name
name.nonRep: non-replicating sample name
ratio: ratio value in the current bin
ratioFactor: adjustment factor used for the current ratio

Source

S phase sample: SRA; G2 sample: SRA

References

Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)

Examples

data(Dbf4myc)

Replication profile for wild type DS2 H.volcanii

Description

Replication profile for H.volcanii wild isolate DS2. Genomic DNA for deep sequencing was isolated from 100 ml culture in stationary phase (A650 > 1, DS2_stat sample) or 1 litre in exponential phase (A650 0.1, DS2_exp sample). Unique reads for the two samples were calculated in 1 kb genomic bins using ASM2568v1 genome assembly. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted by multiplying them by 1.12 to put the values onto biologically relevant relative copy number scale from 1 to 2.

Usage

data(MFAseq)

Format

data frame with 3887 rows and 7 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
name.rep: replicating sample name
name.nonRep: non-replicating sample name
ratio: ratio value in the current bin
ratioFactor: adjustment factor used for the current ratio

Source

DS2_exp exponential phase sample: SRA; DS2_stat stationary sample: SRA

References

Hawkins et al. (2013) Nature 503(7477):544-547 (PubMed)

Examples

data(MFAseq)

Trep data calculated from syncSeq[["data"]]

Description

Trep is median replication time, expressed in minutes after release G1 arrest. It is calculated from multiple relative copy number datapoints across timeseries of a cell cycle experiment. For every genomic bin, a sigmoid function is fitted and its midpoint is reported.

Usage

data(TrepDF)

Format

data frame with 11341 rows and 5 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
Trep: calculated Trep value
TrepErr: error from sigmoid function fitting

References

Müller et al. (2014) NAR 42(1):e3 (PubMed)

Examples

data(TrepDF)

Sequence read coverage ratios for wild type S.cerevisiae W303

Description

Sequence read coverage ratios for wild type sample (T7107 strain). The cells were stained with DNA dye and sorted based on DNA content into S or G2/M phase fractions. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (S) and non-replicating (G2/M) samples were calculated in 1 kb genomic bins. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads.

Usage

data(W303)

Format

data frame with 11350 rows and 7 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
name.rep: replicating sample name
name.nonRep: non-replicating sample name
ratio: ratio value in the current bin
ratioFactor: adjustment factor used for the current ratio

Source

S phase sample: SRA; G2 sample: SRA

References

Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)

Examples

data(W303)

Sequence read coverage for wild type S.cerevisiae W303 non-replicating sample.

Description

Sequence read coverage for wild type non-replicating sample (T7107 strain). The cells were stained with DNA dye and sorted based on DNA content into G2/M phase fraction. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads were calculated in 1 kb genomic bins.

Usage

data(W303_G2)

Format

data frame with 11350 rows and 5 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
name: sample name
score: read number in current bin

Source

SRA

References

Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)

Examples

data(W303_G2)

Sequence read coverage for wild type S.cerevisiae W303 replicating sample

Description

Sequence read coverage for wild type replicating sample (T7107 strain). The cells were stained with DNA dye and sorted based on DNA content into S phase fraction. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads were calculated in 1 kb genomic bins.

Usage

data(W303_S)

Format

data frame with 11820 rows and 5 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
name: sample name
score: read number in current bin

Source

SRA

References

Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)

Examples

data(W303_S)

Normalised sequence read coverage ratios for wild type S.cerevisiae W303

Description

Usage

data(W303norm)

Format

data frame with 11340 rows and 7 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
name.rep: replicating sample name
name.nonRep: non-replicating sample name
ratio: ratio value in the current bin
ratioFactor: adjustment factor used for the current ratio

Source

S phase sample: SRA; G2 sample: SRA

References

Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)

Examples

data(W303norm)

A function to calculate Trep values from a sync-seq experiment calcTrep function fits a Boltzman sigmoid function into relative copy number datapoints for every genomic bin of the provided sync-seq merged dataframe. It then extracts time at which half of the cells have this genomic bin replicated (Trep). The output of the function is a dataframe containing Trep and TrepErr data for every genomic bin in a BED-like format.

Description

A function to calculate Trep values from a sync-seq experiment calcTrep function fits a Boltzman sigmoid function into relative copy number datapoints for every genomic bin of the provided sync-seq merged dataframe. It then extracts time at which half of the cells have this genomic bin replicated (Trep). The output of the function is a dataframe containing Trep and TrepErr data for every genomic bin in a BED-like format.

Usage

calcTrep(ratioDFs, times)

Arguments

ratioDFs

A merged ratios dataframe containing sync-seq samples (dataframe).

times

Time series data in the same order as in the ratioDFs (numeric vector).

Examples

TrepDF <- calcTrep(subset(syncSeq[["data"]],chrom=="chrI"),times=c(25,30,35,40,45,50,90))

A function to compare two replication profiles

Description

compareRatios takes two ratio dataframes that were binned the same way and uses z-score statistics to find p-values of their differences. The function outputs a combined dataframe containing the two input ratio dataframes in a long format with added 'p.value' column.

Usage

compareRatios(ratio1, ratio2)

Arguments

ratio1

Ratio dataframe, or a string containing name of a ratio dataframe (dataframe or string).

ratio2

Ratio dataframe, or a string containing name of a ratio dataframe (dataframe or string).

Examples

ratioDFs <- compareRatios(W303norm,Dbf4myc)

Guide dataframe for plotting smoothed sortSeq data

Description

Guide dataframe for plotting smoothed sortSeq data

Usage

data(guide)

Format

Dataframe with 2 rows and 6 variables:

order: Order to plot data in
name.rep: Name of replicating sample
name.nonRep: Name of non-replicating sample
raw: Should raw data be plotted?
smooth: Should smooth data be plotted?
color: Color to plot the profile in

Examples

data(guide)

Load a BED formatted file.

Description

The function reads BED formatted files. The BED file format defined by UCSC: http://genome.ucsc.edu/FAQ/FAQformat. First three columns ("chrom", "chromStart", "chromEnd") are mandatory. The file fields may be separated by tabs, spaces or commas. If the BED file contains a header, it will be ignored. If a genome mask dataframe is provided, only data intersected with the mask will be retained. Resulting data is ordered by "chromStart" columns.

Usage

loadBed(file, genome = NULL, name = NULL)

Arguments

file

Path to the BED file (string, mandatory)

genome

A mask dataframe to exclude data from the BED file (dataframe, optional). The genome dataframe must contain "chrom" column and may further contain "chromStart" and "chromEnd" columns in this order.

name

A string to replace the 'name' column of the loaded BED file with (string, optional).

Examples

W303_G2 <- loadBed(system.file("extdata/W303_G2.bed",package="Repliscope"), name='W303_G2')
W303_G2_chrI <- loadBed(system.file("extdata/W303_G2.bed",package="Repliscope"),
                       name='W303_G2',genome=sacCer3[["genome"]])

A helper function to create a gemome dataframe

Description

makeGenome is called by plotGenome() and plotCoverage() functions if a genome dataframe is not provided. It creates a BED-like dataframe containing unique chromosome names, their start coordinates (assumed 0), their ends (highest value in the corresponding 'chromEnd' of the BED dataframe) and 'midY' column containing half the max value of the "score" or "ratio" columns per chromosome. This later serves as y coordinate to add chromosome names. Axis name is passed via comment to the output dataframe. Extract it with 'attributes(genome)$axisName'.

Usage

makeGenome(DF, region = FALSE)

Arguments

DF

A BED or ratio dataframe containing either 'score' or 'ratio' column (dataframe).

region

String in the format 'chrI:1000-3000' (string, optional).

Examples

genomeDF <- makeGenome(W303_G2)

A helper function to create axis ticks and human readable labels.

Description

makeLabels is called by plotGenome() and plotCoverage() functions. It creates a dataframe containing two columns: 'ticks' and 'labels'. 'Ticks' contains axis ticks coordinates, 'labels' will contain human readable lables for the ticks (using prefixes and optional units).

Usage

makeLabels(theMin, theMax, unit = "")

Arguments

theMin

Minimum value for the scale (double).

theMax

Maximum value for the scale (double).

unit

Unit to use for the labels (string, optional)

Examples

labels <- makeLabels(0,1200000,"b")

A function to calculate 'score' ratio between two bed dataframes makeRatio merges two supplied bed dataframes, calculates ratio of their "score" values normalises the ratio by the 'score' sums.

Description

A function to calculate 'score' ratio between two bed dataframes makeRatio merges two supplied bed dataframes, calculates ratio of their "score" values normalises the ratio by the 'score' sums.

Usage

makeRatio(bedRep, bedNonRep)

Arguments

bedRep

Bed dataframe containing read counts from a replicating sample (dataframe). The bed dataframe must contain "chrom","chromStart", "chromEnd" and "score" columns.

bedNonRep

Bed dataframe containing read counts from a non-replicating sample (dataframe). The bed dataframe must contain "chrom","chromStart", "chromEnd" and "score" columns.

Examples

ratioDF <- makeRatio(W303_S,W303_G2)

A function to normalise ratio values from 'ratio' column of the provided dataframe to fit biologically-relevant scale. It scales values either using supplied 'rFactor' value or automatically to best fit 1 to 2 scale (the upper limit of the scale may be adjusted with the upperLimit parameter). Normalisation factor used is stored in 'ratioFactor' column and also passed as the dataframe comment. To extract it, use 'attributes(mergedBed)$comment'

Description

A function to normalise ratio values from 'ratio' column of the provided dataframe to fit biologically-relevant scale. It scales values either using supplied 'rFactor' value or automatically to best fit 1 to 2 scale (the upper limit of the scale may be adjusted with the upperLimit parameter). Normalisation factor used is stored in 'ratioFactor' column and also passed as the dataframe comment. To extract it, use 'attributes(mergedBed)$comment'

Usage

normaliseRatio(ratioDF, rFactor = NULL, upperLimit = 2, replace = TRUE)

Arguments

ratioDF

A ratio dataframe containing 'ratio' column (dataframe).

rFactor

Value to normalise by, related to replication progression (numeric, optional).

upperLimit

Top value for the scale, defaults to 2 (numeric, optional, defaults).

replace

Should the existing 'ratio' values be overwritten or stored in a new column (boolean, defaults to TRUE).

Examples

ratioDF <- normaliseRatio(W303) ## scales to 1 to 2 range, replaces original values.
ratioDF <- normaliseRatio(W303,rFactor=1.41,replace=FALSE)
# (multiplies score values by 1.41 and keeps the original values)

A function to boxplot 'score' column of a BED dataframe, per unique chromosome name in the 'chrom' column. The resulting plot also highlights outliers based on the inter quartile range (IQR). The genome wide median is plotted as a pink line through the boxplots.

Description

A function to boxplot 'score' column of a BED dataframe, per unique chromosome name in the 'chrom' column. The resulting plot also highlights outliers based on the inter quartile range (IQR). The genome wide median is plotted as a pink line through the boxplots.

Usage

plotBed(bed, plotting = TRUE)

Arguments

bed

A dataframe containing 'score' and 'chrom' columns (dataframe).

plotting

Should the plot object be sent to the default device? (boolean, defaults to TRUE).

Examples

plotBed(W303_S)
plotObject <- plotBed(W303_G2,plotting=FALSE)

A function to scatterplot 'score' column of a BED dataframe plotCoverage function plots values in the 'score' column of the supplied bed dataframe as a function of chromosome coordinates. The genome wide median is plotted as a pink line.

Description

A function to scatterplot 'score' column of a BED dataframe plotCoverage function plots values in the 'score' column of the supplied bed dataframe as a function of chromosome coordinates. The genome wide median is plotted as a pink line.

Usage

plotCoverage(bed, region = FALSE, plotting = TRUE)

Arguments

bed

A dataframe containing 'score','chrom','chromStart' and 'chromEnd' columns (dataframe).

region

Only plot for the provided region in the format 'chrI:1000-3000' (string, optional).

plotting

Should the plot object be sent to the default device? (boolean, defaults to TRUE).

Examples

plotCoverage(W303_G2)
plotObject <- plotCoverage(W303_S,plotting=FALSE)

plotGenome: plot replication profile.

Description

plotGenome plots scatterplot/barplot/polygon of 'score' and/or 'splineSmooth' columns values by genomic coordinates, either for the whole genome or a region. It also allows annotation with additional BED-formatted data. Ggplot2 package is used for plotting.

Usage

plotGenome(
  ratioDFs,
  geom = "geom_point",
  ylims = c(1, 2),
  plotting = TRUE,
  genome = NULL,
  region = FALSE,
  guide = NULL,
  lines = NULL,
  circles = NULL,
  rectangles = NULL,
  pointers = NULL,
  colourLines = "#00FF00",
  colourCircles = "#FFFFFF",
  colourRectangles = "#FF0000",
  colourPointers = "#FF7F00"
)

Arguments

ratioDFs

A ratio dataframe or combined ratios dataframe containing 'ratio' column (dataframe).

geom

ggplot2 geom to use for plotting: "geom_point","geom_ribbon" or "geom_segment" (string, defaults to "geom_point").

ylims

A vector of two values for y axis limits - first is lowest, second is highest (numeric vector, defaults to 1 and 2)

plotting

Should the plot object be sent to the default device? (boolean, defaults to TRUE).

genome

A mask dataframe to exclude data from the ratio dataframe (dataframe, optional). The genome dataframe must contain "chrom","chromStart" and "chromEnd" columns.

region

Only plot for the provided region in the format 'chrI:1000-3000' (string, optional).

guide

A dataframe guiding the plotGenome function how to plot the data (dataframe, optional). The guide dataframe must contain the following columns: 'order' (integer) - order to plot data in, 'name.rep' (character) - replicating sample name that matches the one in the ratioDFs dataframe, 'name.nonRep' (character) - non-replicating sample name that matches the one in the ratioDFs dataframe, 'raw' (logical) - should the raw raw data be plotted? 'smooth' (logical) - should the smoothed data be plotted? 'color'(character) - R color to plot the current sample with, both raw and smoothed data.

lines

Additionally plot vertical lines from a BED formatted dataframe (dataframe, optional).

circles

Additionally plot circles on the chromosome line from a BED formatted dataframe (dataframe, optional).

rectangles

Additionally plot rectangles on the chromosome line from a BED formatted dataframe (dataframe, optional).

pointers

Additionally plot downward pointing triangles from a BED formatted dataframe (dataframe, optional).

colourLines

Colour for 'lines' data (string, defaults to green).

colourCircles

Colour for 'circles' data (string, defaults to white).

colourRectangles

Colour for 'rectangles' data (string, defaults to red).

colourPointers

Colour for 'pointers' data (string, defaults to orange).

Examples

plotGenome(sortSeq,geom="geom_ribbon",guide=guide,region="chrIX:250000-439885",
    lines=sacCer3[["cen"]],circles=sacCer3[["ori"]])
# plot data as polygon for the specified region of chromosome 9

plotGenome(syncSeq[["data"]],geom="geom_segment",guide=syncSeq[["guide"]],
    region="chrVII:0-1090944",genome=sacCer3[["genome"]],lines=sacCer3[["cen"]],
    circles=sacCer3[["ori"]],colourLines="black")

plotGenome(MFAseq,region='chr1:0-2848000')
# plot marker frequency analysis for H.volcanii isolate DS2

A function to plot a histogram of supplied ratio vector plotRatio plots histogram of values in a supplied vector using ggplot2 and highlights interval between 1 and 2 in green.

Description

A function to plot a histogram of supplied ratio vector plotRatio plots histogram of values in a supplied vector using ggplot2 and highlights interval between 1 and 2 in green.

Usage

plotRatio(ratio, plotting = TRUE)

Arguments

ratio

A numeric vector containing raw or smoothed ratio values (vector).

plotting

Should the plot object be sent to the default device? (boolean, defaults to TRUE).

Examples

plotRatio(W303$ratio)
plotObject <- plotRatio(W303$ratio,plotting=FALSE)

A function to scatterplot 'Trep' column of a Trep dataframe plotTrep function plots values in the 'Trep' column of the supplied dataframe as a function of chromosome coordinates. The genome wide median is plotted as a pink line.

Description

A function to scatterplot 'Trep' column of a Trep dataframe plotTrep function plots values in the 'Trep' column of the supplied dataframe as a function of chromosome coordinates. The genome wide median is plotted as a pink line.

Usage

plotTrep(TrepDF, region = FALSE, plotting = TRUE)

Arguments

TrepDF

A dataframe containing 'chrom','chromStart', 'chromEnd' and 'Trep' columns (dataframe).

region

Only plot for the provided region in the format 'chrI:1000-3000' (string, optional).

plotting

Should the plot object be sent to the default device? (boolean, defaults to TRUE).

Examples

plotTrep(TrepDF,region="chrI")

A function to remove single chromosome data from a bed dataframe

Description

A function to remove single chromosome data from a bed dataframe

Usage

rmChr(bed, chr)

Arguments

bed

A bed dataframe containing 'chrom' column (dataframe, required).

chr

Chromosome to remove (string, required).

Examples

bedDF <- rmChr(W303_S,"chrM") ## removes mitochondria

A function to remove outliers from the "score" column of a supplied bed dataframe There are three methods: max, IQR and median. Max is used to remove 1 or more maximum values; IQR uses interquartile range to detect outliers, while median method can be used to remove data based on genome-wide median.

Description

A function to remove outliers from the "score" column of a supplied bed dataframe There are three methods: max, IQR and median. Max is used to remove 1 or more maximum values; IQR uses interquartile range to detect outliers, while median method can be used to remove data based on genome-wide median.

Usage

rmOutliers(bed, method, n = 1, range = 3, loLim = 0.25, hiLim = NULL)

Arguments

bed

A dataframe containing 'score' column (dataframe, required).

method

Method to detect outliers: "max", "IQR" or "median" (string).

n

Number of max values to remove (integer,defaults to 1). Use with "max" method.

range

Number of IQR above the 3rd or below the 1st IQR to set the threshold (double, defaults to 3). Use with "IQR" method.

loLim

Low limit for the median method (double, defaults to 0.25).

hiLim

High limit for the median method (double).

Examples

bedDF <- rmOutliers(W303_S,method="max",n=2) ## removes 2 rows of data containing 3 top values
bedDF <- rmOutliers(W303_S,method="IQR",range=3) ## removes datapoints outside 3 x IQR above the 3rd
 # and below the 1st IQR.
bedDF <- rmOutliers(W303_S,method="median",loLim=0.25,hiLim=2) # removes datapoints that are lower
 # than 0.25 x genome median or above 2 x genome median.

A function to launch Repliscope in interactive mode (Shiny app).

Description

A function to launch Repliscope in interactive mode (Shiny app).

Usage

runGUI()

S.cerevisiae genome information

Description

sacCer3 meta information: chromosome sizes, centromere and replication origin positions.

Usage

data(sacCer3)

Format

List containing three dataframes

genome: Chromosome information dataframe
cen: Centromere information dataframe
ori: Replication origin information dataframe

Source

Replication origin information: (OriDB)

References

Siow et al. (2011) NAR 40(Database issue):D682-6 (PubMed)

Examples

data(sacCer3)

A function to smooth ratio values using cubic smoothing spline smoothRatio function splits values from 'ratio' column by chromosome and based the supplied 'groupMin' and 'split' parameters and then applies smooth.spline() function from R stats package. The supplied dataframe may contain multiple ratios, i.e. ratios produced using multiple replicating samples and/or multiple non-replicating samples. This must be reflected in 'name.rep' and 'name.nonRep' columns. In other words, different ratio dataframes may be merged using rbind() function before calling smoothRatio() function.

Description

A function to smooth ratio values using cubic smoothing spline smoothRatio function splits values from 'ratio' column by chromosome and based the supplied 'groupMin' and 'split' parameters and then applies smooth.spline() function from R stats package. The supplied dataframe may contain multiple ratios, i.e. ratios produced using multiple replicating samples and/or multiple non-replicating samples. This must be reflected in 'name.rep' and 'name.nonRep' columns. In other words, different ratio dataframes may be merged using rbind() function before calling smoothRatio() function.

Usage

smoothRatio(ratioDF, groupMin = 5, splitNum = 5)

Arguments

ratioDF

A ratio dataframe or combined ratios dataframe containing 'ratio' column (dataframe).

groupMin

Minimum number of values required to make a group (integer, defaults to 5).

splitNum

Minimum number of adjacent bins with missing values to close current group (integer, defaults to 5).

Examples

ratioDF <- smoothRatio(W303norm)

Replication profiles for wild type and Dbf4-9myc S.cerevisiae samples

Description

Replication profiles for wild type and Dbf4-9myc samples (T7107 and T9394 strains). The cells were stained with DNA dye and sorted based on DNA content into S or G2/M phase fractions. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (S) and non-replicating (G2/M) samples were calculated in 1 kb genomic bins. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted by multiplying them by 1.41 and 1.402 for wild type and Dbf4-9myc samples, respectively, to put the values onto biologically relevant relative copy number scale from 1 to 2. The relative copy number values were smoothed using cubic spline and compared using z score statistics.

Usage

data(sortSeq)

Format

data frame with 22696 rows and 10 variables:

chrom: short chromosome name
chromStart: left chromosome coordinate
chromEnd: right chromosome coordinate
name.rep: replicating sample name
name.nonRep: non-replicating sample name
ratio: ratio value in the current bin
ratioFactor: adjustment factor used for the current ratio
group: Group number of the current bin
splineSmooth: Smoothed ratio value
p.value: Significance of ratio difference between Dbf4myc and W303 samples

Source

Dbf4myc S phase sample: SRA; Dbf4myc G2 sample: SRA; W303 S sample: SRA; W303 G2 sample: SRA

References

Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)

Examples

data(sortSeq)

Replication profiles budding yeast arrest-release samples

Description

Replication profiles of wild type S. cerevisiae arrest-release samples (AUY077 strain). The cells were arrested in G1 with alpha-factor followed by release using pronase. The samples were collected before the release (aFactor) and various time intervals after the release (25min,30min,35min,40min,45min,50min and 90min) Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (post-release) and non-replicating (aFactor) samples were calculated in 1 kb genomic bins. The ratios were created by dividing 'score' values from replicating samples by non-replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted based on bulk genome replication (as determined by flow cytometry), to put the values onto biologically relevant relative copy number scale from 1 to 2. The relative copy number values were smoothed using cubic spline.

Usage

data(syncSeq)

Format

List containing two data frames

data: syncSeq replication profiles data. Columns: chrom (short chromosome name), chromStart (left chromosome coordinate), chromEnd (right chromosome coordinate), name.rep (replicating sample name), name.nonRep (non-replicating sample name), ratio (ratio value in the current bin), ratioFactor (adjustment factor used for the current ratio), group (Group number of the current bin), splineSmooth (Smoothed ratio value)
guide: Guide dataframe for plotting the syncSeq data order (Order to plot data in), name.rep (Name of replicating sample), name.nonRep (Name of non-replicating sample), raw (Should raw data be plotted?), smooth (Should smooth data be plotted?), color (Color to plot the profile in)

Source

GEO

References

Müller et al. (2014) NAR 42(1):e3 (PubMed)

Examples

data(syncSeq)

A function to remove outliers from the "ratio" column of a supplied ratio dataframe trimRatio is applied to the calculated ratio of read counts from a replicating to a non-replicating samples.

Description

A function to remove outliers from the "ratio" column of a supplied ratio dataframe trimRatio is applied to the calculated ratio of read counts from a replicating to a non-replicating samples.

Usage

trimRatio(ratioDF, loLim, hiLim)

Arguments

ratioDF

A ratio dataframe containing 'ratio' column (dataframe, required).

loLim

Low limit threshold (double, required).

hiLim

High limit threshold (double, required).

Examples

W303 <- trimRatio(W303,0.5,1.5)