Scanvi best practices

crg · July 1, 2021, 1:53pm

Hi,

I’ve been reviewing the latest changes in SCANVI. I noted the removal of pretraining scvi and that the option to do pretraining was moved to the from_scvi_model() method.

Could you elaborate on whether it is still best practice to pretrain with scvi, or whether just using scanvi on its own is better?

Thanks
Charlotte

adamgayoso · July 1, 2021, 2:58pm

It is still best practice to pretrain a SCVI model and then instantiate SCANVI with the from_scvi_model class method. We moved this around for API reasons.

crg · July 1, 2021, 3:37pm

Thanks for the quick response!

grst · September 15, 2021, 7:02am

Does it make a difference if I already add the labels_key when training the SCVI model, i.e.

scvi.data.setup_anndata(adata, batch_key='batch', labels_key="seed_labels")
scvi_model = scvi.model.SCVI(adata)
scvi_model.train()
scanvi_model = scvi.model.SCANVI.from_scvi_model(scvi_model, 'Unknown')
scanvi_model.train()

or if I add the labels only when running SCANVI?

scvi.data.setup_anndata(adata, batch_key="batch")
scvi_model = scvi.model.SCVI(adata)
scvi_model.train()
scvi.data.setup_anndata(adata, batch_key="batch", labels_key="seed_labels")
scanvi_model = scvi.model.SCANVI.from_scvi_model(scvi_model, "Unknown", adata=adata)
scanvi_model.train()

The former method is used in the “seed labelling” tutorial, the latter in the “atlas-level integration” tutorial.

adamgayoso · September 15, 2021, 3:14pm

@grst either way will work, setup anndata just creates a dictionary in adata.uns["_scvi"] and the labels won’t do anything to SCVI. This will be more explicit in a future release.

kanefos · November 28, 2022, 8:30pm

Hi,

I have a similar question. If I have a scvi model trained with one labels_key, but then later want to use this scvi model to create a scanvi model but predicting a different set of labels (a new labels_key) - is this possible? Can I replace the original labels_key?

adamgayoso · November 28, 2022, 10:17pm

In this case, it’s better not to provide the labels key to scvi, as the only thing this enables is gene-label specific dispersion parameters.

If you do provide the labels_key, what you want will not be possible; however, if you only provide the labels at the time of scanvi initialization it is possible

Topic		Replies	Views
Posterior probability of being assigned to a specific label scvi-tools scanvi	4	412	July 28, 2021
Label transfer with SCVI-SCANVI pipeline changes (predicts wrong) labels in ref data scvi-tools scanvi , scvi	8	574	July 31, 2023
Label Transfer Discrepancy in scANVI Model Training scvi-tools	2	133	January 22, 2024
Mapping old scANVI code to 0.16.2 scvi-tools	8	596	June 1, 2022
Consistent results when using scvi.model.SCVI scvi-tools scvi , developer	3	584	May 11, 2022

Scanvi best practices

Related Topics