NVIDIA / cutlass Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 6.1k

Code
Issues 206
Pull requests 30
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: NVIDIA/cutlass

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

206 Open 1,027 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[BUG] EVT seems to be producing wrong results ? - Needs Triage bug

Something isn't working

#2068 opened Jan 28, 2025 by alexsamardzic

[QST] How to use an Iterator over parameters ? - Needs Triage question

Question

#2067 opened Jan 28, 2025 by IzanCatalan

[BUG] CUTLASS 3.8 does not compile for 90a with CUDA 12.6.85 ? - Needs Triage bug

Something isn't working

#2064 opened Jan 26, 2025 by manishucsd

[QST] Cute docs need a concrete example using tensor cores ? - Needs Triage question

Question

#2063 opened Jan 25, 2025 by capybara-club

[QST] Fused mma and outer product ? - Needs Triage question

Question

#2062 opened Jan 25, 2025 by capybara-club

[QST] how to use example 67? ? - Needs Triage question

Question

#2060 opened Jan 25, 2025 by ginowu

[QST] How to implement a fused mixed precision matrix multiplication such as w4a4 + w16a16? ? - Needs Triage question

Question

#2058 opened Jan 24, 2025 by hyx1999

[QST] Cutlass Python not showing custom mofifications ? - Needs Triage question

Question

#2057 opened Jan 24, 2025 by IzanCatalan

[QST] In what scenarios or applications are gemm_with_reduction and gemm_with_k_reduction applied? ? - Needs Triage question

Question

#2056 opened Jan 23, 2025 by danielhua23

[QST]Why Does CUTLASS Handle the First K Dimension Separately in Matrix Multiplication? ? - Needs Triage question

Question

#2055 opened Jan 23, 2025 by ziyuhuang123

[QST] in implicit gemm conv, why does not support split-k when group !=1 ? ? - Needs Triage question

Question

#2049 opened Jan 21, 2025 by preFiredman

[QST] Terminology question on GMMA::ScaleOut::One ? - Needs Triage question

Question

#2046 opened Jan 17, 2025 by haeunlee99

[FEA] Does it supports quantization-matrix-mul? ? - Needs Triage feature request

New feature or request

#2044 opened Jan 17, 2025 by bianxuxuxu

[BUG][QST] Hopper Grouped GEMM Fails When Workspace not aligned at 64, but MinWorkspaceAlignment =16 ? - Needs Triage bug

Something isn't working

#2042 opened Jan 16, 2025 by ankutalev

[BUG] Modifying the block/warptile shapes and the output datatype in the unit test causes the tests to fail. ? - Needs Triage bug

Something isn't working

#2041 opened Jan 16, 2025 by xiaonans

[QST] link invalid in efficient_gemm.md ? - Needs Triage question

Question

#2038 opened Jan 13, 2025 by unship

[QST]Question about the picture in documentation Efficient GEMM in CUDA ? - Needs Triage question

Question

#2034 opened Jan 9, 2025 by sleepwalker2017

[BUG] Logic issue in nondeterministic reduction mode of Stream-K tile scheduler. ? - Needs Triage bug

Something isn't working

#2027 opened Jan 7, 2025 by allispaul

[QST] What is API version compatibility? ? - Needs Triage question

Question

#2025 opened Jan 6, 2025 by ZzEeKkAa

[QST] why have Int<2>{} in coalesce_x function when last shape value equal to constant one. ? - Needs Triage question

Question

#2023 opened Jan 5, 2025 by Shan19900305

[QST] why the implementation of f16xs8 mixed gemm is different between TRT-LLM and native cutlass mixed gemm example? ? - Needs Triage question

Question

#2022 opened Jan 5, 2025 by danielhua23

[BUG] Memory corruption/undefined behavior on GemmUniversal in 3.4.0 - 3.6.0 🐛 ? - Needs Triage bug

Something isn't working

#2017 opened Dec 28, 2024 by warpuv

[QST]Why Does CUTLASS Use 3-4-3 Swizzle? ? - Needs Triage question

Question

#2015 opened Dec 27, 2024 by ziyuhuang123

[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory? ? - Needs Triage inactive-30d question

Question

#2008 opened Dec 23, 2024 by ziyuhuang123

[BUG] wmma should be enabled w/ clang. ? - Needs Triage bug

Something isn't working

#2006 opened Dec 20, 2024 by Artem-B

Previous 1 2 3 4 5 … 8 9 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly