Seurat: Downsampling

Created on 5 Apr 2019  路  2Comments  路  Source: satijalab/seurat

Hi,
If there are different number of cells in different conditions (or technology), are there any issues with bias in the integration workflow for clustering? I would imagine if condition A has many more cells than condition B, then the clustering would be biased towards the cluster/cell types in condition A. If this is the case, are there strategies to deal with it such as downsampling. Are there any examples in of the workflow examples?
Thanks.

  • Pankaj

Most helpful comment

Larger datasets also tend to contain more information, so we are not inherently concerned about imbalance. However, you can certainly downsample objects if you wish (i.e. to sample 1k cells)

object.downsample = subset(object, cells = sample(Cells(object), 1000))

All 2 comments

Larger datasets also tend to contain more information, so we are not inherently concerned about imbalance. However, you can certainly downsample objects if you wish (i.e. to sample 1k cells)

object.downsample = subset(object, cells = sample(Cells(object), 1000))

@satijalab Would you recommend doing the integration with all the data and maybe subsample unbalanced dataset for clustering?

Was this page helpful?
0 / 5 - 0 ratings

Related issues

fly4all picture fly4all  路  3Comments

kathirij picture kathirij  路  3Comments

kysbbubbu picture kysbbubbu  路  3Comments

farhanma picture farhanma  路  3Comments

GHAStVHenry picture GHAStVHenry  路  3Comments