-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: NVIDIA/cutlass
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] EVT seems to be producing wrong results
? - Needs Triage
bug
Something isn't working
#2068
opened Jan 28, 2025 by
alexsamardzic
[QST] How to use an Iterator over parameters
? - Needs Triage
question
Question
#2067
opened Jan 28, 2025 by
IzanCatalan
[BUG] CUTLASS 3.8 does not compile for 90a with CUDA 12.6.85
? - Needs Triage
bug
Something isn't working
#2064
opened Jan 26, 2025 by
manishucsd
[QST] Cute docs need a concrete example using tensor cores
? - Needs Triage
question
Question
#2063
opened Jan 25, 2025 by
capybara-club
[QST] Fused mma and outer product
? - Needs Triage
question
Question
#2062
opened Jan 25, 2025 by
capybara-club
[QST] How to implement a fused mixed precision matrix multiplication such as w4a4 + w16a16?
? - Needs Triage
question
Question
#2058
opened Jan 24, 2025 by
hyx1999
[QST] Cutlass Python not showing custom mofifications
? - Needs Triage
question
Question
#2057
opened Jan 24, 2025 by
IzanCatalan
[QST] In what scenarios or applications are gemm_with_reduction and gemm_with_k_reduction applied?
? - Needs Triage
question
Question
#2056
opened Jan 23, 2025 by
danielhua23
[QST]Why Does CUTLASS Handle the First K Dimension Separately in Matrix Multiplication?
? - Needs Triage
question
Question
#2055
opened Jan 23, 2025 by
ziyuhuang123
[QST] in implicit gemm conv, why does not support split-k when group !=1 ?
? - Needs Triage
question
Question
#2049
opened Jan 21, 2025 by
preFiredman
[QST] Terminology question on GMMA::ScaleOut::One
? - Needs Triage
question
Question
#2046
opened Jan 17, 2025 by
haeunlee99
[FEA] Does it supports quantization-matrix-mul?
? - Needs Triage
feature request
New feature or request
#2044
opened Jan 17, 2025 by
bianxuxuxu
[BUG][QST] Hopper Grouped GEMM Fails When Workspace not aligned at 64, but MinWorkspaceAlignment =16
? - Needs Triage
bug
Something isn't working
#2042
opened Jan 16, 2025 by
ankutalev
[BUG] Modifying the block/warptile shapes and the output datatype in the unit test causes the tests to fail.
? - Needs Triage
bug
Something isn't working
#2041
opened Jan 16, 2025 by
xiaonans
[QST] link invalid in efficient_gemm.md
? - Needs Triage
question
Question
#2038
opened Jan 13, 2025 by
unship
[QST]Question about the picture in documentation Question
Efficient GEMM in CUDA
? - Needs Triage
question
#2034
opened Jan 9, 2025 by
sleepwalker2017
[BUG] Logic issue in nondeterministic reduction mode of Stream-K tile scheduler.
? - Needs Triage
bug
Something isn't working
#2027
opened Jan 7, 2025 by
allispaul
[QST] What is API version compatibility?
? - Needs Triage
question
Question
#2025
opened Jan 6, 2025 by
ZzEeKkAa
[QST] why have Int<2>{} in coalesce_x function when last shape value equal to constant one.
? - Needs Triage
question
Question
#2023
opened Jan 5, 2025 by
Shan19900305
[QST] why the implementation of f16xs8 mixed gemm is different between TRT-LLM and native cutlass mixed gemm example?
? - Needs Triage
question
Question
#2022
opened Jan 5, 2025 by
danielhua23
[BUG] Memory corruption/undefined behavior on GemmUniversal in 3.4.0 - 3.6.0 🐛
? - Needs Triage
bug
Something isn't working
#2017
opened Dec 28, 2024 by
warpuv
[QST]Why Does CUTLASS Use 3-4-3 Swizzle?
? - Needs Triage
question
Question
#2015
opened Dec 27, 2024 by
ziyuhuang123
[QST]How Does TMA Work in CUTLASS for Writing from Shared Memory to Global Memory?
? - Needs Triage
inactive-30d
question
Question
#2008
opened Dec 23, 2024 by
ziyuhuang123
[BUG] wmma should be enabled w/ clang.
? - Needs Triage
bug
Something isn't working
#2006
opened Dec 20, 2024 by
Artem-B
Previous Next
ProTip!
Follow long discussions with comments:>50.