[Doc] Add vllm-ascend usage doc & fix doc format #53

shen-shanshan · 2025-02-12T09:03:52Z

What this PR does / why we need it?

Add vllm-ascend tutorial doc for Qwen/Qwen2.5-7B-Instruct model serving doc
fix format of files in docs dir, e.g. format tables, add underline for links, add line feed...

Does this PR introduce any user-facing change?

no.

How was this patch tested?

no.

shen-shanshan · 2025-02-12T09:06:35Z

cc:

@Yikun @wangxiyuan @MengqingCao

wangxiyuan · 2025-02-12T09:15:32Z

No need to update installation and quick start doc. They will be updated in new PR.

shen-shanshan · 2025-02-12T09:18:51Z

No need to update installation and quick start doc. They will be updated in new PR.

ok.

docs/source/index.md

docs/source/installation.md

docs/source/quick_start.md

docs/source/running_vllm_with_ascend.md

MengqingCao · 2025-02-14T03:28:11Z

docs/source/running_vllm_with_ascend.md

+
+```bash
+cd /usr/local/Ascend/ascend-toolkit/latest/<arch>-linux  # <arch>: aarch64 or x86_64
+cat ascend_toolkit_install.info


We can just use one instruction

cat ~/Ascend/ascend-toolkit/latest/"$(uname -i)"-linux/ascend_toolkit_install.info

This has been removed now.

docs/source/running_vllm_with_ascend.md

Signed-off-by: Shanshan Shen <[email protected]>

Yikun

Overall, it has been greatly improved compared to the previous version, thank you!

Yikun · 2025-02-14T12:45:36Z

docs/source/tutorials.md

+```bash
+# Use Modelscope mirror to speed up model download
+export VLLM_USE_MODELSCOPE=True
+export MODELSCOPE_CACHE=/root/models/


Suggested change

export MODELSCOPE_CACHE=/root/models/

you can use default cache -v /root/.cache:/root/.cache

Yikun · 2025-02-14T12:52:23Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:52:46Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:53:05Z

docs/source/tutorials.md

+-v /root/models:/root/models \
+-p 8000:8000 \
+-e VLLM_USE_MODELSCOPE=True \
+-e MODELSCOPE_CACHE=/root/models/ \


Suggested change

-e MODELSCOPE_CACHE=/root/models/ \

Yikun · 2025-02-14T12:53:35Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:53:43Z

docs/source/tutorials.md

+```bash
+# Use Modelscope mirror to speed up model download
+export VLLM_USE_MODELSCOPE=True
+export MODELSCOPE_CACHE=/root/models/


Suggested change

export MODELSCOPE_CACHE=/root/models/

Yikun · 2025-02-14T12:54:20Z

docs/source/tutorials.md

+def clean_up():
+    destroy_model_parallel()
+    destroy_distributed_environment()
+    gc.collect()
+    torch.npu.empty_cache()


Looks like a little bit wired, would you mind taking a look? @wangxiyuan

since this is only a simple example, no need to do

del llm clean_up()

shen-shanshan marked this pull request as draft February 12, 2025 09:04

shen-shanshan force-pushed the doc branch from c71743c to ebc859c Compare February 13, 2025 12:04

shen-shanshan marked this pull request as ready for review February 13, 2025 12:04

shen-shanshan force-pushed the doc branch from ebc859c to f0816bf Compare February 14, 2025 02:53

Yikun reviewed Feb 14, 2025

View reviewed changes

MengqingCao reviewed Feb 14, 2025

View reviewed changes

Yikun reviewed Feb 14, 2025

View reviewed changes

docs/source/running_vllm_with_ascend.md Outdated Show resolved Hide resolved

add vllm-ascend tutorials

76fcb75

Signed-off-by: Shanshan Shen <[email protected]>

shen-shanshan force-pushed the doc branch from cbebd7b to 76fcb75 Compare February 14, 2025 10:40

Yikun approved these changes Feb 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] Add vllm-ascend usage doc & fix doc format #53

[Doc] Add vllm-ascend usage doc & fix doc format #53

shen-shanshan commented Feb 12, 2025 •

edited by Yikun

Loading

shen-shanshan commented Feb 12, 2025

wangxiyuan commented Feb 12, 2025

shen-shanshan commented Feb 12, 2025

MengqingCao Feb 14, 2025

shen-shanshan Feb 14, 2025

Yikun left a comment

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

wangxiyuan Feb 16, 2025

	-v /root/models:/root/models \
	-v /root/.cache:/root/.cache \

[Doc] Add vllm-ascend usage doc & fix doc format #53

Are you sure you want to change the base?

[Doc] Add vllm-ascend usage doc & fix doc format #53

Conversation

shen-shanshan commented Feb 12, 2025 • edited by Yikun Loading

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

shen-shanshan commented Feb 12, 2025

wangxiyuan commented Feb 12, 2025

shen-shanshan commented Feb 12, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Yikun left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shen-shanshan commented Feb 12, 2025 •

edited by Yikun

Loading