Seurat: Downsampling

Created on 5 Apr 2019  路  2Comments  路  Source: satijalab/seurat

Hi,
If there are different number of cells in different conditions (or technology), are there any issues with bias in the integration workflow for clustering? I would imagine if condition A has many more cells than condition B, then the clustering would be biased towards the cluster/cell types in condition A. If this is the case, are there strategies to deal with it such as downsampling. Are there any examples in of the workflow examples?
Thanks.

  • Pankaj

Most helpful comment

Larger datasets also tend to contain more information, so we are not inherently concerned about imbalance. However, you can certainly downsample objects if you wish (i.e. to sample 1k cells)

object.downsample = subset(object, cells = sample(Cells(object), 1000))

All 2 comments

Larger datasets also tend to contain more information, so we are not inherently concerned about imbalance. However, you can certainly downsample objects if you wish (i.e. to sample 1k cells)

object.downsample = subset(object, cells = sample(Cells(object), 1000))

@satijalab Would you recommend doing the integration with all the data and maybe subsample unbalanced dataset for clustering?

Was this page helpful?
0 / 5 - 0 ratings

Related issues

Biogitte picture Biogitte  路  3Comments

bio-la picture bio-la  路  3Comments

sarahwajid picture sarahwajid  路  3Comments

RuiyangLiu94 picture RuiyangLiu94  路  3Comments

camilliano picture camilliano  路  3Comments