Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Enforce PyTorch version 2.1 and run attention tests with torch.compile
#1516 opened Feb 27, 2025 by ksivaman Loading…
6 of 13 tasks
Fix shape of new quantized tensor in make_like bug Something isn't working
#1515 opened Feb 27, 2025 by ksivaman Loading…
5 of 13 tasks
Verified TE2.0 with offloading
#1514 opened Feb 27, 2025 by sanandaraj5597 Loading…
Blockwise float8 quantizer and quantized tensor class
#1513 opened Feb 27, 2025 by kwyss-nvidia Loading…
12 of 34 tasks
Draft: split wgrad poc
#1510 opened Feb 26, 2025 by lhb8125 Draft
13 tasks
Support tensors with only column-wise data enhancement New feature or request performance
#1505 opened Feb 25, 2025 by timmoon10 Draft
7 of 13 tasks
[Pytorch] Dynamo ONNX export support
#1497 opened Feb 19, 2025 by pggPL Draft
8 of 13 tasks
[PyTorch] Enable MXFP8 LayerNorm and RMSNorm
#1487 opened Feb 15, 2025 by timmoon10 Loading…
5 of 13 tasks
Explicitly use python3 and pip3 executables testing Improvements to tests or testing infrastructure
#1486 opened Feb 15, 2025 by timmoon10 Loading…
6 of 13 tasks
RoPE enhancements
#1478 opened Feb 11, 2025 by sudhakarsingh27 Loading…
3 of 6 tasks
[PyTorch] Enabling Per-Tensor Current Scaling Recipe
#1471 opened Feb 11, 2025 by zhongbozhu Loading…
8 of 21 tasks
Add support for UB MNNVL 2.1.0
#1470 opened Feb 10, 2025 by nvcastet Loading…
1 of 6 tasks
support adam bf16 state
#1465 opened Feb 8, 2025 by XiaobingSuper Loading…
6 of 13 tasks
[PyTorch] Enable quantized activation backward kernels in operation-based API tests testing Improvements to tests or testing infrastructure
#1463 opened Feb 7, 2025 by timmoon10 Loading…
6 of 14 tasks
[JAX] THD ring attention
#1454 opened Feb 4, 2025 by zlsh80826 Loading…
8 of 13 tasks
[Pytorch] Nvidia-DLFramework-Inspect support
#1441 opened Jan 30, 2025 by pggPL Loading…
8 of 14 tasks
Add test for Lightning Thunder integration testing Improvements to tests or testing infrastructure
#1433 opened Jan 28, 2025 by timmoon10 Draft
6 of 14 tasks
Introduce NVSHMEM based communication API for pytorch
#1430 opened Jan 28, 2025 by gdengk Loading…
13 tasks
[PyTorch] cuBLAS workspace size fix for TP overlap unit test bug Something isn't working
#1415 opened Jan 17, 2025 by denera Loading…
8 of 13 tasks
Fix Linear Weight Initialization in the PaddlePaddle Implementation
#1413 opened Jan 17, 2025 by GuoxiaWang Loading…
4 of 13 tasks
Better cuBLAS handle management
#1389 opened Jan 2, 2025 by ptrendx Loading…
8 of 13 tasks
Update README.rst
#1385 opened Dec 23, 2024 by sbhavani Loading…
1 of 6 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.