Add doublet detection to results #364
Replies: 3 comments 3 replies
-
Just popping in to say that I'm going to start reading up on this analysis, including linked papers and more that may be useful. I'll post back here with my thoughts based on the literature and where we might begin. |
Beta Was this translation helpful? Give feedback.
-
Jotting down a quick thought that is not yet fully fleshed out, but don't want to forget it! Potentially as part of benchmarking (or perhaps a separate analysis in and of itself?), we can also compare doublets to cell type annotations. I would hypothesize an association between cells with unknown labels (which |
Beta Was this translation helpful? Give feedback.
-
Noting this additional python-based method I hadn't come across yet as something we could try, but generally speaking I'm not sure we have enough cells in most of the processed data to get quite as robust inferences - https://github.com/kostkalab/vaeda |
Beta Was this translation helpful? Give feedback.
-
Proposed analysis
In the ScPCA workflow, we do not currently do any analysis or flagging of doublet cells. Adding flagging of potential doublets could improve data quality for cell typing and many other analyses.
Scientific goals
The presence of doublet cells disrupt many types of analyses, including cell typing and analysis of differential expression among groups of cells. TO that end it would be good to flag and ultimately remove those cells before downstream analysis.
We should start by exploring doublet identification methods that may be available and applicable to the ScPCA project, and implement one across the atlas.
Methods or approach
I expect that we will want to use the
scDblFinder
package. This method does not require genetic data or cellhashing, which we do not have in most cases, but it should be more robust than applying a maximum count cutoff, as has sometimes been proposed.There is a good overview and tutorial for the
scDblFinder
package as part of the OSCA book: https://bioconductor.org/books/release/OSCA.advanced/doublet-detection.html`I would expect we will begin with a bit of testing and benchmarking of this method and potentially others. After initial analysis, we can implement a robust workflow for flagging and/or removing likely doublet cells.
Existing modules
None
Input data
All! If we do use
scDblFinder
, this will be the SCE objects, but other methods may use other systems.Scientific literature
Germain et al. (2021) Doublet Identification in Single-Cell Sequencing Data Using scDblFinder. https://doi.org/10.12688/f1000research.73600.1.
Dahlin et al. (2018) A single-cell hematopoietic landscape resolves 8 lineage trajectories and defects in Kit mutant mice. https://doi.org/10.1182/blood-2017-12-821413
Other details
No response
Beta Was this translation helpful? Give feedback.
All reactions