How to determine the value of the n_gems parameter #23

wolfQK · 2024-12-04T09:56:23Z

Hi axelalmet ~
I noticed that your analysis code only uses two values for n_gems, 10 and 20. Could you please explain what criteria you used to determine the appropriate value for the n_gems parameter? Thanks! @axelalmet

fs.pp.construct_gems_using_pyliger(adata,
                                n_gems = 10,
                                layer_key = 'counts',
                                condition_key = condition_key)

The text was updated successfully, but these errors were encountered:

axelalmet · 2024-12-05T14:35:40Z

Hi wolfQK,

This is a good question. For the applications considered in the paper, setting n_gems to be 10 or 20 worked pretty well in terms of capturing meaningful differences with respect to 1) cell state heterogeneity or 2) capturing spatially meaningful modules (that lined up with cell region annotation) 3) different biological conditions, e.g., healthy vs moderate vs severe COVID-19. This was evaluated by me looking at mean cellwise membership for each of the modules with respect to meaningful cell labels like condition or cell type annotation. When I originally was analysing the datasets, I considered the case where I had set 5, 10, 15, ... etc GEMs and found that, often, 10 or 20 worked best.

But in general, there's no reason you have to pick 10 or 20 GEMs. I think picking the right number of GEMs is an incredibly non-trivial exercise, and, to the best of my knowledge, there's no single good method for choosing the number of factors for a matrix factorisation-based method. I know cNMF uses the silhouette score, but the literature has shown that this can have its drawbacks.

Best wishes,
Axel.

majeex233 · 2025-02-24T01:53:16Z

Hi axelalmet,
Thanks for your inspired answer! I am very interested to know which aspects of the results will be impacted after altering the parameters of nGEM？
Hoping for your earliest reply!

Best wishes,
Bella.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to determine the value of the n_gems parameter #23

How to determine the value of the n_gems parameter #23

wolfQK commented Dec 4, 2024

axelalmet commented Dec 5, 2024

majeex233 commented Feb 24, 2025

How to determine the value of the n_gems parameter #23

How to determine the value of the n_gems parameter #23

Comments

wolfQK commented Dec 4, 2024

axelalmet commented Dec 5, 2024

majeex233 commented Feb 24, 2025