The goal of BRETIGEA (BRain cEll Type specIfic Gene Expression Analysis) is to estimate and/or deconvolute relative cell type proportions from bulk gene expression data.
BRETIGEA simplifies the process of defining your own set of brain cell type marker genes by using a well-validated set of cell type-specific marker genes derived from multiple types of experiments, as described in our manuscript, McKenzie and Wang et al 2018. For brain tissue data sets, there are marker genes available for astrocytes, endothelial cells, microglia, neurons, oligodendrocytes, and oligodendrocyte precursor cells, derived from each of human, mice, and combination human/mouse data sets. We also provide markers from an alternative source that leveraged variation among intact tissue samples, Kelley et al 2018. However, if you have access to your own marker genes, the functions can be applied to bulk gene expression data from any tissue.
BRETIGEA also implements multiple options for relative cell type proportion estimation using these marker genes, adapting and expanding on approaches from the ‘CellCODE’ R package described in Chikina et al 2015. The number of cell type marker genes used in a given analysis can be increased or decreased based on your preferences and the data set. Finally, BRETIGEA provides functions to use the estimates to adjust for variability in the relative proportion of cell types across samples (i.e., deconvolute) prior to downstream analyses.
You can install BRETIGEA from CRAN with:
install.packages("BRETIGEA")
You can install the development version of BRETIGEA from Github, which is recommended to use the most updated version, with:
# install.packages("devtools")
::install_github("andymckenzie/BRETIGEA") devtools
Using example data from the Allen Brain Atlas, a subset of which is available in the package.
library(BRETIGEA)
library(knitr) #only for visualization
str(aba_marker_expression, list.len = 10) #input data format
str(aba_pheno_data) #input data format
= brainCells(aba_marker_expression, nMarker = 50)
ct_res kable(head(ct_res)) #output data format
= cor.test(ct_res[, "mic"], as.numeric(aba_pheno_data$ihc_iba1_ffpe), method = "spearman")
cor_mic print(cor_mic)
= cor.test(ct_res[, "ast"], as.numeric(aba_pheno_data$ihc_gfap_ffpe), method = "spearman")
cor_ast print(cor_ast)
See the basic vignette for help with getting started here: https://github.com/andymckenzie/BRETIGEA/blob/master/inst/doc/BRETIGEA_basic.pdf
You can view the manuscript describing BRETIGEA in detail as well as several applications here:
https://www.nature.com/articles/s41598-018-27293-5
McKenzie AT, Wang M, Hauberg ME, et al. Brain Cell Type Specific Gene Expression and Co-expression Network Architectures. Sci Rep. 2018;8(1):8868. See also: http://celltypes.org/
Chikina M, Zaslavsky E, Sealfon SC. CellCODE: a robust latent variable approach to differential expression analysis for heterogeneous cell populations. Bioinformatics. 2015;31(10):1584-91.
Miller JA, Guillozet-bongaarts A, Gibbons LE, et al. Neuropathological and transcriptomic characteristics of the aged brain. Elife. 2017;6. Available from: http://aging.brain-map.org/
Kelley KW, Nakao-inoue H, Molofsky AV, Oldham MC. Variation among intact tissue samples reveals the core transcriptional features of human CNS cell classes. Nat Neurosci. 2018;21(9):1171-1184. Alternative marker data from: http://oldhamlab.ctec.ucsf.edu/