-
Notifications
You must be signed in to change notification settings - Fork 366
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Choose correct fuser for PyTorch versions < 2.0
#1512
opened Feb 26, 2025 by
ksivaman
Loading…
5 of 13 tasks
Export only necessary symbols from libtransformer_engine.so
#1511
opened Feb 26, 2025 by
KshitijLakhani
•
Draft
13 tasks
Delete extra tensor objects after restoring float8 tensors
2.1.0
#1500
opened Feb 21, 2025 by
sudhakarsingh27
Loading…
1 of 13 tasks
[PyTorch] Enable MXFP8 LayerNorm and RMSNorm
#1487
opened Feb 15, 2025 by
timmoon10
Loading…
5 of 13 tasks
Explicitly use Improvements to tests or testing infrastructure
python3
and pip3
executables
testing
#1486
opened Feb 15, 2025 by
timmoon10
Loading…
6 of 13 tasks
[PyTorch] Enabling Per-Tensor Current Scaling Recipe
#1471
opened Feb 11, 2025 by
zhongbozhu
Loading…
8 of 21 tasks
[PyTorch] Enable quantized activation backward kernels in operation-based API tests
testing
Improvements to tests or testing infrastructure
#1463
opened Feb 7, 2025 by
timmoon10
Loading…
6 of 14 tasks
Support vectorized local reduction for p2p-based ReduceScatter overlap
#1452
opened Feb 4, 2025 by
erhoo82
Loading…
13 tasks
[Pytorch] Nvidia-DLFramework-Inspect support
#1441
opened Jan 30, 2025 by
pggPL
Loading…
8 of 14 tasks
Introduce NVSHMEM based communication API for pytorch
#1430
opened Jan 28, 2025 by
gdengk
Loading…
13 tasks
[PyTorch] cuBLAS workspace size fix for TP overlap unit test
bug
Something isn't working
#1415
opened Jan 17, 2025 by
denera
Loading…
8 of 13 tasks
Fix Linear Weight Initialization in the PaddlePaddle Implementation
#1413
opened Jan 17, 2025 by
GuoxiaWang
Loading…
4 of 13 tasks
Don't touch nor send messages to the root logger.
#1380
opened Dec 19, 2024 by
sagostinho-nvidia
Loading…
4 of 13 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.