Posterior Variance for totalVI Estimates

amesch · April 29, 2021, 12:36am

Hello, I have been using totalVI for protein imputation for CITE-seq data. I am interested in possibly constructing confidence intervals for the protein estimates in my data and was wondering if the user is able to output the estimated posterior variance using totalVI?

Thank you!

adamgayoso · April 29, 2021, 3:53am

You can construct credible intervals by setting return_mean=False and n_samples>1 in the normalized expression function. Note that this will return a tensor of samples by cells by proteins.

Using the snippet from the tutorial

_, protein_means_samples = model.get_normalized_expression(
    n_samples=25,
    transform_batch="PBMC10k",
    include_protein_background=True,
    sample_protein_mixing=False,
    return_mean=False,
)

Also note that this credible interval would be constructed over posterior samples of the latent variables. In other words, the variation comes from sampling the latent variables, as the imputed values are a deterministic function of these. This can be seen around Equation 16 in the Nature Methods version of the manuscript, where the credible interval would be over the mean of this zero-inflated gamma distribution, as opposed to the random variable itself. There’s also a way you could sample from the likelihood p(y | ...) and get counts, such that it would be more like a counterfactual posterior predictive sample.

Topic		Replies	Views
Running TOTALVI data in which subset of cells do not have citeseq data scvi-tools integration , totalvi	8	504	March 25, 2021
TOTALVI RNA/protein analysis for R users scvi-tools	5	477	April 9, 2021
TotalVI log normalization and non-negativity scvi-tools totalvi	4	350	September 11, 2022
Comparing steps of Scanpy for scRNQ-seq and totalvi for CITE-seq scvi-tools totalvi	6	580	October 8, 2021
scVI imputation confusion scvi-tools scvi , imputation	1	575	June 16, 2021

Posterior Variance for totalVI Estimates

Related Topics