Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docker镜像启动vllm后端后,GPU显存一直占用20G左右(不管是否在用) #530

Open
2 tasks
WQL782795 opened this issue Sep 2, 2024 · 2 comments
Assignees

Comments

@WQL782795
Copy link

System Info / 系統信息

[root@localhost ~]# docker exec cb001b8e5deb nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2023 NVIDIA Corporation
Built on Mon_Apr__3_17:16:06_PDT_2023
Cuda compilation tools, release 12.1, V12.1.105
Build cuda_12.1.r12.1/compiler.32688072_0
Uploading smi.jpg…

Who can help? / 谁可以帮助到您?

No response

Information / 问题信息

  • The official example scripts / 官方的示例脚本
  • My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

使用docker镜像启动vllm后端

Expected behavior / 期待表现

用多少占多少

@WQL782795
Copy link
Author

这是smi截图
smi

@zhipuch zhipuch self-assigned this Sep 2, 2024
@zhipuch
Copy link
Collaborator

zhipuch commented Sep 3, 2024

你的gpu_memory_utilization设置多少?它会预分配显存

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants