pytorch / FBGEMM Public

Notifications You must be signed in to change notification settings
Fork 534
Star 1.3k

Code
Issues 34
Pull requests 395
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: pytorch/FBGEMM

Labels 34 Milestones 0

New pull request New

395 Open 3,114 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add cublas FP8 tensorwise GEMM in fbgemm quantize bench cla signed fb-exported

#3693 opened Feb 14, 2025 by jiawenliu64

Loading…

[fbgemm_gpu] Test genai op registration cla signed

#3692 opened Feb 14, 2025 by q10

Loading…

avoid extra copy in PackedGemmMatrixB constructor cla signed fb-exported

#3691 opened Feb 13, 2025 by helloguo

Loading…

[fbgemm_gpu] Increase timeout for ARM nova jobs cla signed

#3690 opened Feb 13, 2025 by q10

Loading…

Numerical Fix. cla signed fb-exported

#3688 opened Feb 13, 2025 by levendlee

Loading…

custom reduce scatter cla signed fb-exported

#3686 opened Feb 13, 2025 by xw285cornell

Loading…

adding an option to skip zeroing output tensor for f8f8bf16_rowwise_grouped_dynamic cla signed fb-exported

#3685 opened Feb 13, 2025 by mxz297

Loading…

Add small M support cla signed fb-exported

#3682 opened Feb 12, 2025 by YUNQIUGUO

Loading…

[wip] commits scraper cla signed

#3676 opened Feb 11, 2025 by q10

Loading…

Add D_folded support for jagged_to_padded_dense_backward meta function cla signed fb-exported

#3670 opened Feb 8, 2025 by brad-mengchi

Loading…

changing config for fp8 gemm cla signed fb-exported

#3668 opened Feb 7, 2025 by adamomainz

Loading…

fixing fp8 gemm on amd cla signed fb-exported

#3662 opened Feb 5, 2025 by adamomainz

Loading…

Pull ARM's matrix transpose PR cla signed fb-exported

#3660 opened Feb 4, 2025 by Nicoshev

Loading…

Adding Missing includes and explicitly declaring Tensor in aten namespace. cla signed fb-exported

#3638 opened Jan 30, 2025 by pradeepfn

Loading…

Partial revert of D66986498 (Optimized backward pass for ROCm devices, pt 1), 2nd attempt ciflow/rocm cla signed fb-exported module: rocm

#3637 opened Jan 29, 2025 by q10

Loading…

avoid using warning tensor in cpu tbe op cla signed fb-exported

#3631 opened Jan 29, 2025 by 842974287

Loading…

Update bf16i4 gemm with new cutlass version cla signed fb-exported

#3630 opened Jan 29, 2025 by jwfromm

Loading…

finish #1808 cherry-pick, adjust interface cla signed fb-exported

#3627 opened Jan 28, 2025 by coconutruben

Loading…

Partial revert of D66986498 cla signed fb-exported

#3620 opened Jan 27, 2025 by q10

Loading…

Re-land D67407935 (Optimized backward pass for ROCm devices, pt 2) ciflow/rocm cla signed fb-exported module: rocm

#3619 opened Jan 27, 2025 by q10

Loading…

Performance Optimization: Optimized TileShape Configuration for f8 cla signed

#3617 opened Jan 27, 2025 by MatrixAssembler

Loading…

Replace runners prefix amz2023. (#2895) cla signed fb-exported module: rocm

#3612 opened Jan 24, 2025 by q10

Loading…

Test out D68193920 cla signed fb-exported

#3606 opened Jan 23, 2025 by q10

Loading…

AdagradW cla signed fb-exported

#3605 opened Jan 23, 2025 by spcyppt

Loading…

[fbgemm_gpu] Try nested ops namespaces cla signed

#3603 opened Jan 22, 2025 by q10

Loading…

Previous 1 2 3 4 5 … 15 16 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly