The function BICAM (Bayesian Immune Cell Abundance Model) uses Bayesian hierarchical models to model abundance data for cell populations with a predictor variable. This models each cell type simultaneously while incorporating potential relationships between the cell populations. The relationship utilized in this model is user dependent based on the function inputs for setting up the covariance structure. This package utilizes rjags and runjags packages to run Gibbs sampling for posterior estimation.

Installation

Before running this package, the JAGS (Just Another Gibbs Sampler) program needs to be installed. You can download from https://sourceforge.net/projects/mcmc-jags/files/.

library(BTIME)

Dataset


# dfmif <- read.csv('~/mIF_Test_Data.csv', check.names = FALSE) # Load in data
# dat <- dfmif |> dplyr::select(suid,total,stage,M1,M2,M3) # Construct data to match setup of image below

For setting up your data, you will want to set your data into a data frame. For one predictor/covariate, the setup should be as follows:

1st column: Subject IDs
2nd column: Total Cell Counts
3rd column: Predictor Variable
Last M columns: Parameters of Interest Counts

For two or more predictors/covariates, the setup should be as follows:

1st column: Subject IDs
2nd column: Total Cell Counts
Next ncov columns: Predictor Variables
Last M columns: Parameters of Interest Counts

Example:
Below shows the set up of the data frame for one predictor (A) and two or more predictors (B). For (A), we have “Sub. IDs” in the first column, “Total Cell Count” in the second column, the stage of cancer (0 = “Low” (1 or 2), & 1 = “High” (3 or 4)) as the predictor variable in the third column, and markers M1, M2, and M3 in the last 3 columns (3 markers/parameters of interest). For (B), we have “Sub. IDs” in the first column, “Total Cell Count” in the second column, the stage of cancer and age at diagnosis as the predictor variables in the third & fourth columns, respectively, and markers M1, M2, and M3 in the last 3 columns (3 markers/parameters of interest).

Data setup

# Using BICAM

Required inputs:
dat - Data frame containing data set constructed as shown above
M - Number of parameters/markers of interest
adapt - Number of adaptation iterations for initializing JAGS model (iterations removed from analysis)
burn - Number of burn-in iterations (iterations removed from analysis)
it - Number of sampling iterations used for posterior analysis

Optional inputs:
thin - Thinning Interval (default = 1)
ran_eff - 0 for NO repeated measurements, 1 FOR repeated measurements (default = 1)
ncov - Number of predictors/covariates input in the model (default = 1)
chains - Number of chains to run in the JAGS model (default = 4)
cores - Number of cores to use to run the JAGS model (default = 4) (must be \(\geq\) chains for maximum efficiency)
v0_mu_logit - Predicted proportion of the parameters of interest (default = 0.01 —> 1%)
model - Covariance structure used in Bayesian model (default = “Unstr”) (See “BICAM: Covariance Structures” article for more information about the different covariance structures)
dis - Distance matrix used for exponential decay model (required if using exp. decay model) (default = NULL)
tree - Tree-structured covariance matrix used for tree model (required if using tree or scaled tree model) (default = NULL)
treelevels - List of matrices for the multi-level, tree-structured covariance matrix for ‘TreeLevels’ model (required if using multi-level tree model) (default = NULL)


##################### Unstructured Model #####################
# posterior_unstr <- BICAM(dat = dat,
#                          M = 3,
#                          adapt = 2000,
#                          burn = 100,
#                          it = 100)

##################### Exponential Decay Model #####################

# WILL GET ERROR IF 'dis' MATRIX IS NOT SUPPLIED. 'dis' MUST ALSO BE INVERTIBLE.

# dis <- matrix(c(0,1,1,
#                  1,0,2,
#                  1,2,0),nrow = 3)

# posterior_expdecay <- BICAM(dat = dat,
#                          M = 3,
#                          adapt = 2000,
#                          burn = 100,
#                          it = 100,
#                          model = "ExpDecay",
#                          dis = dis)

##################### Tree Covariance Structure Model #####################

# WILL GET ERROR IF 'tree' MATRIX IS NOT SUPPLIED. 'tree' MUST ALSO BE INVERTIBLE.

# tree <- matrix(c(1,1,1,
#                 1,2,1,
#                 1,1,2),nrow = 3)

# posterior_tree <- BICAM(dat = dat,
#                          M = 3,
#                          adapt = 2000,
#                          burn = 100,
#                          it = 100,
#                          model = "Tree",
#                          tree = tree)

########### Scaled Tree

# posterior_treescaled <- BICAM(dat = dat,
#                          M = 3,
#                          adapt = 2000,
#                          burn = 100,
#                          it = 100,
#                          model = "TreeScaled",
#                          tree = tree)


##################### Multi-Level Tree Covariance Structure Model #####################

# WILL GET ERROR IF 'treelevels' MATRIX IS NOT SUPPLIED.

# treelevels <- list(matrix(c(1,1,1,
#                             1,1,1,
#                             1,1,1),nrow = 3),
#                    matrix(c(0,0,0,
#                             0,1,0,
#                             0,0,1),nrow = 3))

# posterior_tree <- BICAM(dat = dat,
#                          M = 3,
#                          adapt = 2000,
#                          burn = 100,
#                          it = 100,
#                          model = "TreeLevels",
#                          treelevels = treelevels)

BICAM Output

The BICAM function returns a list of 6 items:

Output Item 1: Posterior Samples

Posterior samples for each monitored variable. Below shows an example of a list of first 9 iterations of 6 monitored variables after the burn-in phase.

# posterior_unstr[[1]]

Posterior samples

Output Item 2: Computation Time

Computation time of the entire function (not just sampling).

# posterior_unstr[[2]]

Output Item 3: Model String

String of the JAGS model based on model choice and function inputs. The string uses HTML syntax.

# posterior_unstr[[3]]

Model string

Output Item 4: Initial Values

Lists of initial values for each chain for each variable in the model that follows a distribution. Below is an example of the initial values for the variable ‘p_temp’ for the first chain.

# posterior_unstr[[4]]

Initial values

Output Item 5: Data

Data that the user had input into the JAGS model.

# posterior_unstr[[5]]

Data input

Output Item 6: Monitored Variables

List of variables monitored by the JAGS model.

# posterior_unstr[[6]]

Monitored variables