forked from state-spaces/mamba
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Adding ROCm support [AMD official] (state-spaces#359)
* Current progress * Current progress * FWD kernel compiles * Current progress: compiles and imports, not tested * Adapting tests * working benchmark for torch hipify * Removing hard-coded device typos * all dtypes support * fixed parallel compile issue * use optimized causal_conv1d if available * april18 perf bench script * april18 perf bench script * Delete csrc/selective_scan/selective_scan_fwd_kernel_minimal.cuh * reverted benchmark * reverted changes to base iteration 1 * removed files not in base * Ported bwd changes (partial) * Backward working fp32 * all dtypes with bwd * gitignore hipfied files * rocm cond and move max min to common * triton autotune conditional * Unifying setup.py (in progress) * triton conditional autotune configs * some more conditional compiles * Setup.py functional * Functional * Minmax changes * reduce repeatibility * Removed extra comments * fix template error * Update csrc/selective_scan/reverse_scan.cuh Co-authored-by: Jeff Daily <[email protected]> * restore permissions to base * permission for gitignore and readme * warp size based on code review * Adding ifndef + warnings for dynamic memory size adjustment * minor chnages to setup * fall back for warp size conditional * patch method updated * Minor stylistic changes + an extra warning about patching * 4096 knloads patch * Cleanup, conditional kernel launch parameters * Flexible warp size * Fix warp size to 32 for CUDA --------- Co-authored-by: Arseny Moskvichev <[email protected]> Co-authored-by: ajassani <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: Adeem Jassani <[email protected]> Co-authored-by: Gabe Weisz <[email protected]> Co-authored-by: ajassani <[email protected]> Co-authored-by: Jeff Daily <[email protected]> Co-authored-by: root <[email protected]>
- Loading branch information
1 parent
c2568f5
commit 3c77dcf
Showing
11 changed files
with
479 additions
and
142 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2,3 +2,5 @@ | |
*.egg-info/ | ||
build/ | ||
**.so | ||
*.hip | ||
*_hip.* |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.