Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding memory required for the first tutorial of cell2location #360

Open
2 of 4 tasks
sudeepthi5 opened this issue May 3, 2024 · 2 comments
Open
2 of 4 tasks
Labels
question Further information is requested

Comments

@sudeepthi5
Copy link

sudeepthi5 commented May 3, 2024

Please use the template below to post a question to https://discourse.scverse.org/c/ecosytem/cell2location/.

Problem

I am trying to run the example in the following page:
https://cell2location.readthedocs.io/en/latest/notebooks/cell2location_tutorial.html#Loading-packages

I used the following to request my node:
srun --ntasks-per-node=4 --cpus-per-task=4 --mem=32000 --time=12:00:00 --partition=amperenodes --job-name=cell2location --gres=gpu:2

So in the training step I am getting 221/250, instead of training complete, and always I am getting error saying 'use_gpu' command is unrecognized. I would like to request you to provide memory requirement details for tutorial. I will provide any other information if required, can you please help me with this, Thank you.

@sudeepthi5 sudeepthi5 added the question Further information is requested label May 3, 2024
@vitkl
Copy link
Contributor

vitkl commented May 29, 2024

Please make sure that you have installed the latest and compatible versions of packages and used the corresponding options (latest version doesn't have 'use_gpu' argument).

The tutorial can run on Google Colab with 12GB RAM given the same data. If you have larger data - you need more RAM.

Cell2location also doesn't use multiple GPUs (--gres=gpu:2) so unless you are starting two processes manually, you can request only one GPU.

There is a bug in the latest GitHub version that results in data being recorded during posterior sampling - resulting in huge RAM use. I am working on fixing this issue - however for now, you can use use_quantiles option of the posterior sampling and exclude_vars to exclude 'data_target' (if it exists).

@sudeepthi5
Copy link
Author

I will work on the recommended options. Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants