Seurat: Do I have to run first RunUMAP or FindClusters?

Created on 1 Oct 2019  路  3Comments  路  Source: satijalab/seurat

Dear all, dear Seurat team,

I have two general questions.

In your vignettes "Integrating stimulated vs. control PBMC" your are running first RunUMAP and then FindNeighbors (setting the reduction to "pca" so that it will not take the UMAP reduction).
1) For running "FindClusters" you do not specify "reduction = pca" because the clustering is already done with "pca"??? Do I have to add reduction = pca here as well?

immune.combined <- RunUMAP(immune.combined, reduction = "pca", dims = 1:20)
immune.combined <- FindNeighbors(immune.combined, reduction = "pca", dims = 1:20)
immune.combined <- FindClusters(immune.combined, resolution = 0.5)

In the Vignette "Guided Clustering Tutorial" you are running RunUMAP after FindingClusters:
pbmc <- FindNeighbors(pbmc, dims = 1:10)
pbmc <- FindClusters(pbmc, resolution = 0.5)
pbmc <- RunUMAP(pbmc, dims = 1:10)

2) Is that because you are using UMAP here only for visualization? Why is the order different here compared to the other Vignitte?
Just want to understand it =)

Best regards,
Michael

Analysis Question

Most helpful comment

Hi Michael,

  1. FindClusters performs graph-based clustering on the neighbor graph that is constructed with the FindNeighbors function call. This neighbor graph is constructed using PCA space when you specifiy reduction = "pca". You shouldn't add reduction = "pca" to FindClusters.

  2. Yes, UMAP is used here only for visualization so the order of RunUMAP vs FindClusters doesn't really matter (you just want to have run both before plotting with DimPlot in order to visualize the clusters in UMAP space).

All 3 comments

Hi Michael,

  1. FindClusters performs graph-based clustering on the neighbor graph that is constructed with the FindNeighbors function call. This neighbor graph is constructed using PCA space when you specifiy reduction = "pca". You shouldn't add reduction = "pca" to FindClusters.

  2. Yes, UMAP is used here only for visualization so the order of RunUMAP vs FindClusters doesn't really matter (you just want to have run both before plotting with DimPlot in order to visualize the clusters in UMAP space).

If I try different resolutions with FindClusters(), should I run FindNeighbors() multiple times or I only need to run FindNeighbors() only once?

If you are only changing the resolution parameter, you only need to rerun FindClusters.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

farhanma picture farhanma  路  3Comments

mvalenzuelav picture mvalenzuelav  路  3Comments

Biogitte picture Biogitte  路  3Comments

bio-la picture bio-la  路  3Comments

camilliano picture camilliano  路  3Comments