make all 3 gemms in Float8Linear support configurability, not user facing #315

vkuzo · 2024-07-12T21:36:19Z

Stack from ghstack (oldest at bottom):

-> make all 3 gemms in Float8Linear support configurability, not user facing #315

Summary:

This PR adds some plumbing for how to eventually make all 3 gemms in a linear fwd/bwd configurable:

add LinearMMConfig to Float8Tensor to tie together the three ScaledMMConfig objects, one per gemm
add GemmInputRole to Float8Tensor to specify how to pick the right config
plumb all of these throughout the codebase

Note that none of this is user facing, and there is no logic change. Planned follow-ups:

a future PR will make the per-gemm behavior configurable in a user facing way, which will hook up to the objects introduced in this PR
a future PR will update the naming from x/w/dL_dY to input/weight/grad_output throughout the codebase

Test Plan:

./test/test_everything.sh

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D59973551

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 302e7c443595b5bb65acb27f3fcca82ccf664a6d Pull Request resolved: #315

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 1ef7943cbe21517d40975c69d0be4a719c7bf20d Pull Request resolved: #315

drisspg · 2024-07-19T16:06:50Z

float8_experimental/float8_dynamic_utils.py

+    inpt_tensor: torch.Tensor,
+    linear_mm_config: LinearMMConfig,
+    reduce_amax: bool = False,
+    gemm_input_role: GemmInputRole = GemmInputRole.X,


maybe no default value for gemm role?

we can clean up in a separate PR, there is extra complexity because we'd need to change the argument order

drisspg · 2024-07-19T16:21:29Z

float8_experimental/float8_linear.py

-        self.backward_config = ScaledMMConfig(
-            emulate, False, False, config.pad_inner_dim
+        # TODO(future): user level configuration of gemms
+        self.linear_mm_config = LinearMMConfig(


[I think this might be another stylistic thing so no need to change]:

I think I would actually make this a func and then super document it. Its not very clear reading this what everything does so I would clearly explain in that func the exact recipe that we choose by default

this isn't user facing, so we can clean up at any time

drisspg · 2024-07-19T16:26:34Z

float8_experimental/float8_ops.py

@@ -76,6 +84,7 @@ def float8_cat(aten_op, args, kwargs=None):
    scale = chunked_tensors[0]._scale


Future PR:
We should code share between unflatten/flatten and here to just splat out the extra metadata that lives on the tensor

float8_experimental/float8_tensor.py

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 74ca77e254806922b7dd4d078cd1a8b9f9c74f6d Pull Request resolved: #315

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: ebffcec426b76fba656a881d9a43397499179773 Pull Request resolved: #315

…not user facing" Summary: This PR adds some plumbing for how to eventually make all 3 gemms in a linear fwd/bwd configurable: 1. add `LinearMMConfig` to `Float8Tensor` to tie together the three `ScaledMMConfig` objects, one per gemm 2. add `GemmInputRole` to `Float8Tensor` to specify how to pick the right config 3. plumb all of these throughout the codebase Note that none of this is user facing, and there is no logic change. Planned follow-ups: * a future PR will make the per-gemm behavior configurable in a user facing way, which will hook up to the objects introduced in this PR * a future PR will update the naming from x/w/dL_dY to input/weight/grad_output throughout the codebase Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 39cb928167b6e6be7e754569363f634f28f5472a Pull Request resolved: #315

vkuzo · 2024-07-19T18:11:57Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-07-19T22:29:11Z

This pull request has been merged in c58fb5d.

[wip] make all 3 gemms in Float8Linear configurable

4895a52

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 12, 2024

Update on "[wip] make all 3 gemms in Float8Linear configurable"

e3545db

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

drisspg reviewed Jul 19, 2024

View reviewed changes

float8_experimental/float8_tensor.py Outdated Show resolved Hide resolved

drisspg reviewed Jul 19, 2024

View reviewed changes

float8_experimental/float8_tensor.py Outdated Show resolved Hide resolved

Update on "[wip] make all 3 gemms in Float8Linear configurable"

bfe17f9

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Update on "[wip] make all 3 gemms in Float8Linear configurable"

57a4b32

Summary: not ready for review yet Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

vkuzo changed the title ~~[wip] make all 3 gemms in Float8Linear configurable~~ make all 3 gemms in Float8Linear support configurability, not user facing Jul 19, 2024

drisspg approved these changes Jul 19, 2024

View reviewed changes

vkuzo mentioned this pull request Jul 19, 2024

[wip] make all 3 gemms in float8 linear configurable #258

Closed

facebook-github-bot closed this in c58fb5d Jul 19, 2024

facebook-github-bot added the Merged label Jul 19, 2024

vkuzo mentioned this pull request Jul 25, 2024

add per-gemm config to Float8LinearConfig #334

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make all 3 gemms in Float8Linear support configurability, not user facing #315

make all 3 gemms in Float8Linear support configurability, not user facing #315

vkuzo commented Jul 12, 2024 •

edited

Loading

drisspg Jul 19, 2024

vkuzo Jul 19, 2024

drisspg Jul 19, 2024

vkuzo Jul 19, 2024

drisspg Jul 19, 2024

vkuzo commented Jul 19, 2024

facebook-github-bot commented Jul 19, 2024

		@@ -76,6 +84,7 @@ def float8_cat(aten_op, args, kwargs=None):
		scale = chunked_tensors[0]._scale

make all 3 gemms in Float8Linear support configurability, not user facing #315

make all 3 gemms in Float8Linear support configurability, not user facing #315

Conversation

vkuzo commented Jul 12, 2024 • edited Loading

drisspg Jul 19, 2024

Choose a reason for hiding this comment

vkuzo Jul 19, 2024

Choose a reason for hiding this comment

drisspg Jul 19, 2024

Choose a reason for hiding this comment

vkuzo Jul 19, 2024

Choose a reason for hiding this comment

drisspg Jul 19, 2024

Choose a reason for hiding this comment

vkuzo commented Jul 19, 2024

facebook-github-bot commented Jul 19, 2024

vkuzo commented Jul 12, 2024 •

edited

Loading