Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Draft: split wgrad poc
#1510 opened Feb 26, 2025 by lhb8125 Draft
13 tasks
Support tensors with only column-wise data enhancement New feature or request performance
#1505 opened Feb 25, 2025 by timmoon10 Draft
7 of 13 tasks
[Pytorch] Dynamo ONNX export support
#1497 opened Feb 19, 2025 by pggPL Draft
8 of 13 tasks
[PyTorch] Enable MXFP8 LayerNorm and RMSNorm
#1487 opened Feb 15, 2025 by timmoon10 Loading…
5 of 13 tasks
Explicitly use python3 and pip3 executables testing Improvements to tests or testing infrastructure
#1486 opened Feb 15, 2025 by timmoon10 Loading…
6 of 13 tasks
RoPE enhancements
#1478 opened Feb 11, 2025 by sudhakarsingh27 Loading…
3 of 6 tasks
[PyTorch] Enabling Per-Tensor Current Scaling Recipe
#1471 opened Feb 11, 2025 by zhongbozhu Loading…
8 of 21 tasks
Add support for UB MNNVL 2.1.0
#1470 opened Feb 10, 2025 by nvcastet Loading…
1 of 6 tasks
support adam bf16 state
#1465 opened Feb 8, 2025 by XiaobingSuper Loading…
6 of 13 tasks
[PyTorch] Enable quantized activation backward kernels in operation-based API tests testing Improvements to tests or testing infrastructure
#1463 opened Feb 7, 2025 by timmoon10 Loading…
6 of 14 tasks
[JAX] THD ring attention
#1454 opened Feb 4, 2025 by zlsh80826 Loading…
8 of 13 tasks
[Pytorch] Nvidia-DLFramework-Inspect support
#1441 opened Jan 30, 2025 by pggPL Loading…
8 of 14 tasks
Add test for Lightning Thunder integration testing Improvements to tests or testing infrastructure
#1433 opened Jan 28, 2025 by timmoon10 Draft
6 of 14 tasks
Introduce NVSHMEM based communication API for pytorch
#1430 opened Jan 28, 2025 by gdengk Loading…
13 tasks
[PyTorch] cuBLAS workspace size fix for TP overlap unit test bug Something isn't working
#1415 opened Jan 17, 2025 by denera Loading…
8 of 13 tasks
Fix Linear Weight Initialization in the PaddlePaddle Implementation
#1413 opened Jan 17, 2025 by GuoxiaWang Loading…
4 of 13 tasks
Better cuBLAS handle management
#1389 opened Jan 2, 2025 by ptrendx Loading…
8 of 13 tasks
Update README.rst
#1385 opened Dec 23, 2024 by sbhavani Loading…
1 of 6 tasks
Don't touch nor send messages to the root logger.
#1380 opened Dec 19, 2024 by sagostinho-nvidia Loading…
4 of 13 tasks
Add paged attention support
#1355 opened Dec 4, 2024 by cyanguwa Loading…
8 of 13 tasks
[PyTorch] Bugfix for wgrad bulk overlap conflict when dgrad overlap is reduce-scatter bug Something isn't working
#1341 opened Nov 18, 2024 by denera Loading…
6 of 13 tasks
ProTip! Add no:assignee to see everything that’s not assigned.