-
Notifications
You must be signed in to change notification settings - Fork 366
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Delete extra tensor objects after restoring float8 tensors
2.1.0
#1500
opened Feb 21, 2025 by
sudhakarsingh27
Loading…
1 of 13 tasks
[PyTorch] Enable MXFP8 LayerNorm and RMSNorm
#1487
opened Feb 15, 2025 by
timmoon10
Loading…
5 of 13 tasks
Explicitly use Improvements to tests or testing infrastructure
python3
and pip3
executables
testing
#1486
opened Feb 15, 2025 by
timmoon10
Loading…
6 of 13 tasks
[PyTorch] Enabling Per-Tensor Current Scaling Recipe
#1471
opened Feb 11, 2025 by
zhongbozhu
Loading…
8 of 21 tasks
[PyTorch] Enable quantized activation backward kernels in operation-based API tests
testing
Improvements to tests or testing infrastructure
#1463
opened Feb 7, 2025 by
timmoon10
Loading…
6 of 14 tasks
Support vectorized local reduction for p2p-based ReduceScatter overlap
#1452
opened Feb 4, 2025 by
erhoo82
Loading…
13 tasks
[Pytorch] Nvidia-DLFramework-Inspect support
#1441
opened Jan 30, 2025 by
pggPL
Loading…
8 of 14 tasks
Introduce NVSHMEM based communication API for pytorch
#1430
opened Jan 28, 2025 by
gdengk
Loading…
13 tasks
[PyTorch] cuBLAS workspace size fix for TP overlap unit test
bug
Something isn't working
#1415
opened Jan 17, 2025 by
denera
Loading…
8 of 13 tasks
Fix Linear Weight Initialization in the PaddlePaddle Implementation
#1413
opened Jan 17, 2025 by
GuoxiaWang
Loading…
4 of 13 tasks
Don't touch nor send messages to the root logger.
#1380
opened Dec 19, 2024 by
sagostinho-nvidia
Loading…
4 of 13 tasks
[PyTorch] Bugfix for wgrad bulk overlap conflict when dgrad overlap is reduce-scatter
bug
Something isn't working
#1341
opened Nov 18, 2024 by
denera
Loading…
6 of 13 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.