Major changes:
bcbioBaseTestsURL
to bcbioBaseTestsUrl
.getGTFFileFromYAML
to getGtfFileFromYaml
.getMetricsFromYAML
to getMetricsFromYaml
.getSampleDataFromYAML
to getSampleDataFromYaml
.getUMITypeFromCommands
to getUmiTypeFromCommands
.Minor changes:
Minor changes:
.
from global variables.Minor changes:
Major changes:
Minor changes:
Major changes:
copyToDropbox
function, which is not commonly used and is
difficult to properly unit test.Minor changes:
importDataVersions
and importProgramVersions
to now always
use base R engine, as this avoids some parsing issues that can pop up when
using readr engine instead. Primarily this applies to inconsistent date
formatting in data versions return, which can cause a POSIX date error to
return when using readr engine.sampleDirs
function now excludes nested pipeline directories such as
bcbioRNASeq
, which was added in 2021.import
(from pipette), which is used in working examples.Minor changes:
as
, is
, new
.Minor changes:
str_match
from basejump rather than stringr, reducing the
number of required packages in imports.Minor changes:
Minor changes:
rbindToDataFrame
function
(see AcidPlyr and/or basejump for detail) instead of the now deprecated
unlistToDataFrame
approach. The new rbindToDataFrame
function always
returns 1:1 from nested list elements to rows, which is ideal for sample
metadata and quality control metrics. Alternatively, rbindlist
from
data.table (with fill = TRUE
) is also work a look, but it can introduce
unwanted row expansion and there is no way currently to enforce 1:1 mapping.
So we wrote our own function inside of basejump.Minor changes:
getSampleDataFromYAML
: Fix for call to AcidPlyr::unlistToDataFrame
,
which now returns "name" column for nested YAML processing. See internal
.sampleYAML
generator for details.Minor changes:
unlistToDataFrame
function defined in AcidPlyr, rather than deprecated
plyr ldply
approach.Minor changes:
projectDir
: Improved error message on match failure.Minor changes:
Minor changes:
Minor changes:
Minor changes:
Minor changes:
Minor changes:
read*
functions in favor of import*
. This naming format is
more consistent with other functions used in basejump.Major changes:
readSampleData
and readTxGene
functions to basejump. These are
shared methods applicable outside of bcbio. They are still reexported here
inside the package, maintaining full backward compatibility.Minor changes:
copyToDropbox
now looks for the optional rdrop2 package, rather than
importing as a dependency. This keeps the package lighter.Minor changes:
Minor changes:
Minor changes:
Minor changes:
Major changes:
Minor changes:
readSampleData
. Updated
internal .isSampleData
assert check function to return more informative
error messages to the user on failure.Major changes:
Minor changes:
Minor changes:
Minor changes:
rnaseq
Docker image.tests.acidgenomics.com
URL.Updated to work with basejump v0.10 release series.
Deprecations:
bcbio_geom_abline
, bcbio_geom_label
, bcbio_geom_label_average
,
bcbio_geom_label_repel
.readLog
, readTx2gene
,
readYAMLSampleData
, readYAMLSampleMetrics
.Minor changes:
tests/testthat/
are now consistently formatted using kebab case instead of snake case.Minor changes:
Minor changes:
metadataBlacklist
to basejump package, for use in
makeSummarizedExperiment
and makeSingleCellExperiment
functions. Global
variable is still reexported here.%>%
) and
import
.Minor changes:
runDate()
: Updated assert check to be compatible with R 3.4 / BioC 3.6.Minor changes:
Minor changes:
sampleDirs
: Added an informative message for user regarding sample name
sanitization of cellular barcodes.Minor changes:
runDate
to documentation.Minor changes:
Minor changes:
logical(1)
instead of string
, for example. This approach was
inspired by the conventions used in the checkmate package.Major changes:
Minor changes:
Minor changes:
Minor changes:
Minor changes:
getBarcodeCutoffFromCommands
: Simplified flow in cutoff
internal variable
assignment.deprecated.R
file.projectDir
and runDate
functions into separate R files, from
detect.R
.yaml.R
.metadataBlacklist
global and added improved comments..assertIsSampleData
and .makeSampleData
internal code.Minor changes:
Minor changes:
Working towards a release candidate for Bioconductor submission.
New functions:
projectDir
: Returns the latest dated directory path inside a bcbio
run. If bcbio has been run multiple times to the same upload directory,
the function will return the latest project directory and warn the user.Major changes:
readTx2gene
now returns a tx2gene
class object.DataFrame
instead of data.frame
.sampleDirs
is now stricter about loading sample directories with names
that are not valid in [R][]. This now checks for non-alphanumeric characters,
including spaces, dashes, and samples that begin with a number. The
bcbio pipeline will be updated to enforce these rules, to avoid unexpected
downstream behavior in [R][] due to invalid names.readYAMLSampleMetrics
should return percentGC
instead of xGC
. 5'->3'
bias should return as x5x3Bias
. The plyr::ldply
call used to coerce
from a list
to data.frame
will sanitize names, so we need to apply our
rules before this step.Minor changes:
lanePattern
variable to basejump package.Minor changes:
prepareTemplate
: Migrated function to basejump package, and simplifed
to copy all files from rmarkdown/shared
directory inside a package.
Currently in use by bcbioRNASeq and bcbioSingleCell.This is a maintenance release designed to simplify the package for long-term
stability. Here we are moving all of the current S4 method support to
basejump for consistency, since the methods apply to SummarizedExperiment
and are not bcbio-workflow specific. We are keeping the package simple by
exporting the read*
family of functions here, which are designed to integrate
with the bcbio output directories.
Major changes:
flatFiles
, metrics
,
plotCorrelationHeatmap
, plotDot
, plotGene
, plotHeatmap
,
plotQuantileHeatmap
, plotViolin
.prepareSummarizedExperiment
has been moved to basejump and renamed
to makeSummarizedExperiment
, for consistency. The underlying code remains
the same.SummarizedExperiment
to list
coercion method to basejump.Minor changes:
separatorBar
and updateMsg
to basejump.Minor changes:
readYAML
family of functions.Major changes:
SummarizedExperiment
method suport to
basejump package: gene2symbol
, interestingGroups
, sampleData
,
sampleNames
, sanitizeSampleData
, selectSamples
,
uniteInterestingGroups
.SummarizedExperiment
to basejump:
convertGenesToSymbols
, counts
, gene2symbol
, interestingGroups
,
sampleData
, etc.SummarizedExperiment
method support.
The documentation for these heatmap functions has been improved (and
simplified), showing the supported arguments more clearly.Minor changes:
transcriptID
instead of txID
in tx2gene and GRanges metadata
defined in rowRanges
slot of object.aes
instead of aes_string
for all internal ggplot2 code,
which uses tidyeval (v3.0 update).rse_bcb
and rse_bcb
example data to basejump, for unit testing.
These datasets are still available in bcbioBase.assertFormalAnnotationCol
function defunct. No longer using to check
annotationCol
argument integration in heatmap functions, since we're now
providing only SummarizedExperiment
method support, instead of matrix
method support (which is too complicated and error prone for users).assertFormalInterestingGroups
function to basejump, where the
other assert check functions are defined. This helps improve package
consistency.Minor changes:
bcbio_geom_abline
.-methods.R
as a suffix, as recommended
by Bioconductor, in preparation for package submission.New functions:
minimalSampleData
enables easy creation of a sample metadata data.frame
by simplify specifying the sample names (e.g. "description" in bcbio
YAML). This function was added for easy metadata handling for 10X Cell Ranger
in the bcbioSingleCell package, but is generally applicable to other
bcbio datasets and may be incorporated into bcbioRNASeq in a future
update.Minor changes:
interestingGroups
handling inside plotHeatmap
family
of functions.prepareSummarizedExperiment
if missing transcripts are present. Now the
function simply lists the genes that don't have metadata in rowRanges
.bcbioBase-package.R
file.Minor changes:
sampleData<-
method support for standard data.frame
.Minor changes:
overwrite
parameter to prepareTemplate
, disabled by default.sampleData
accessor.sampleNames
.Minor changes:
sanitizeSampleData
from basejump to bcbioBase.interestingGroups
column in sampleData
return when
clean = TRUE
.New methods:
SummarizedExperiment
method support for sampleNames
generic.New functions:
bcbio_geom_abline
,
bcbio_geom_label
, bcbio_geom_label_average
, and bcbio_geom_label_repel
.Minor changes:
make.unique
instead of make.names
for convertGenesToSymbols
coercion on SummarizedExperiment objects. We're
using make.names
only when coercing seurat objects
(see bcbioSingleCell code).RangedSummarizedExperiment
working example, based on bcbioRNASeq
example dataset.Minor changes:
Minor changes:
description
to metadataBlacklist
.readSampleData
: Use description
metadata column internally instead of
sampleID
.Minor changes:
make.names
instead of make.unique
in convertGenesToSymbols
method
for SummarizedExperiment
class.Major changes:
sampleData
now supports clean = TRUE
argument, which will hide columns
that contain metrics or other calculations from the user. This is useful when
preparing an R Markdown report, where we only want to show relevant metadata
(e.g. factor columns). This is enabled by default.metadataBlacklist
to hide specific columns in sample metadata.sampleMetadata
is now deprecated in favor of sampleData
. This function
still works but will now warn the user, and should be removed from R Markdown
template code.SummarizedExperiment
to list
coercion support, which uses the same
code as flatFiles
.readSampleData
no longer requires or recommended fileName
column. Only
the description
column is required for demultiplexed data. In the case of
multiplexed samples (e.g. inDrops single-cell RNA-seq), then sampleName
,
index
and sequence
names are also required. Multiplexed cell ranger data
only requires sampleName
and index
, since the index barcode isn't present
in the counts matrix.Minor changes:
sampleID
is no longer included in metadata priority columns. This is only
to be used internally and should be hidden from the user where possible.prepareSampleData
is no longer needed and is now defunct.Minor changes:
readYAML
family of functions.readSampleData
: switched to using merge
instead of left_join
internally.inferno
, and
the British variant scale_colour_viridis
.plotHeatmap
, plotQuantileHeatmap
: Always attempt to convert genes to
symbols for heatmaps.Minor changes:
convertGenesToSymbols
and gene2symbol
.Major changes:
convertGenesToSymbols
now has method support for SummarizedExperiment
.rse_bcb
and rse_dds
.prepareSummarizedExperiment
now supports transgenes with the
transgeneNames
argument (e.g. "EGFP"), and FASTA spike-ins with the
spikeNames
argument (e.g. "ERCCs"). Additionally, rowRanges
and colData
are no longer required and can be left NULL
, although this isn't generally
recommended.Minor changes:
sampleYAML
function code.bcbioBase-package.R
file.globals.R
file.Major changes:
sampleData
generic. Providing method support for
SummarizedExperiment
class. This generic provides consistent sample
metadata support in both the bcbioRNASeq and bcbioSingleCell
packages. For bcbioSingleCell, colData
returns information about the
cells, not the samples. For bcbioRNASeq, colData
and sampleData
provide similar information on the samples.uniteInterestingGroups
as a generic. Provides support for adding an
interestingGroups
column to a data frame.SummarizedExperiment
method support for gene2symbol
,
interestingGroups
generics.prepareSummarizedExperiment
from a generic to a standard function.prepareSummarizedExperiment
now requires rowRanges and supports FASTA
spike-ins with isSpike
argument. This helps set up a SingleCellExperiment
object in the bcbioSingleCell package.prepareTemplate
has been converted from a generic to a standard function.readDataVersions
and readProgramVersions
now return an empty tibble
if the file is missing. bcbio doesn't always output these files, so we have
changed the behavior from stopping on a missing file to simply warning and
returning empty.sampleDirs
function, that informs the user about the names of the
sample directories in the bcbio run.Minor changes:
readSampleMetadataFile
and sampleYAMLMetadata
.projectDirPattern
to match bcbio project directories
(e.g. 2018-01-01_rnaseq
) across packages.rdsToken
in copyToDropbox
to be NULL
instead of NA
.sessioninfo::session_info
instead of
devtools::session_info
.BiocCheck
.Deprecations:
sampleMetadata
in favor of sampleData
.flatFiles
in favor of using as(object, "list")
coercion method
instead.prepareSampleMetadata
in favor of prepareSampleData
.annotable
method on SummarizedExperiment
objects defunct.summaryYAMLMetrics
return.assertFormalInterestingGroups
in camel case.checkInterestingGroups
since this code is still present on
bcbioRNASeq master branch.sampleMetadata<-
assignment generic.assert_formal_interesting_groups
assert checks. Deprecated
checkInterestingGroups
.copyToDropbox
function, which enables input of a list of local files
and returns Dropbox paths using rdrop2.abort
,
inform
, and warn
.prepareSampleMetadata
.assignAndSaveData
, loadData
, loadRemoteData
, saveData
from
basejump package.prepareTemplate
function to use internally stored data.Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.