Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a CPU check and CI (#15) #25

Draft
wants to merge 3 commits into
base: main
Choose a base branch
from
Draft

Add a CPU check and CI (#15) #25

wants to merge 3 commits into from

Conversation

qihqi
Copy link
Collaborator

@qihqi qihqi commented Jan 10, 2025

Similar to #13, we
also add a CPU github action. This action will run pytest on the repo.

Currently there is only one test, which is the Llama test in
torch_xla_models.

In order to run the test today, we need a HF_TOKEN. I created a personal
read only token and
#14 tracks avoiding
the need for HF_TOKEN, after which I'll need to remember to invalidate
the token.

@@ -0,0 +1,29 @@
name: CPU tests
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks alright but need rebase


@functools.partial(
jax.jit,
out_shardings=name_to_sharding,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you test this on 405B? When I did something similar locally, the jit worked for 8B but the graph used >1TiB of HBM on 405B on two slices. The problem I encountered was that each device was still trying to generate the entire weight.

tengyifei and others added 3 commits January 13, 2025 17:50
Similar to #13, we
also add a CPU github action. This action will run `pytest` on the repo.

Currently there is only one test, which is the Llama test in
torch_xla_models.

In order to run the test today, we need a HF_TOKEN. I created a personal
read only token and
#14 tracks avoiding
the need for HF_TOKEN, after which I'll need to remember to invalidate
the token.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants