Seurat: Downsampling

Created on 5 Apr 2019 · 2Comments · Source: satijalab/seurat

Hi,
If there are different number of cells in different conditions (or technology), are there any issues with bias in the integration workflow for clustering? I would imagine if condition A has many more cells than condition B, then the clustering would be biased towards the cluster/cell types in condition A. If this is the case, are there strategies to deal with it such as downsampling. Are there any examples in of the workflow examples?
Thanks.

Pankaj

Source

pagarwal14

👍1

Most helpful comment

Larger datasets also tend to contain more information, so we are not inherently concerned about imbalance. However, you can certainly downsample objects if you wish (i.e. to sample 1k cells)

object.downsample = subset(object, cells = sample(Cells(object), 1000))

satijalab on 5 Apr 2019

👍5

All 2 comments

Larger datasets also tend to contain more information, so we are not inherently concerned about imbalance. However, you can certainly downsample objects if you wish (i.e. to sample 1k cells)

object.downsample = subset(object, cells = sample(Cells(object), 1000))

satijalab on 5 Apr 2019

👍5

@satijalab Would you recommend doing the integration with all the data and maybe subsample unbalanced dataset for clustering?

yueqiw on 24 Apr 2019

👍1

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Issue with "hdf5r" for Seurat installation

fly4all · 3Comments

Merge clusters

kathirij · 3Comments

How can i visualize TSNE by cell.id. instead of cluster?

kysbbubbu · 3Comments

Seurat Error: Duplicate cell names

farhanma · 3Comments

Workflow clarification

GHAStVHenry · 3Comments