Skip to content
Change the repository type filter

All

    Repositories list

    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      13k130198Updated Feb 4, 2025Feb 4, 2025
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      99741070Updated Feb 4, 2025Feb 4, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.5k58526Updated Feb 4, 2025Feb 4, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1322921521Updated Feb 4, 2025Feb 4, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1433382251Updated Feb 4, 2025Feb 4, 2025
    • aiter

      Public
      AI Tensor Engine for ROCm
      Cuda
      MIT License
      11322Updated Feb 4, 2025Feb 4, 2025
    • ROCm Systems Profiler
      C++
      MIT License
      61509Updated Feb 4, 2025Feb 4, 2025
    • AMD's graph optimization engine.
      C++
      MIT License
      9119635251Updated Feb 4, 2025Feb 4, 2025
    • Device Metrics Exporter exports metrics from AMD devices (GPUs) to collectors like Prometheus.
      Shell
      Apache License 2.0
      10703Updated Feb 4, 2025Feb 4, 2025
    • Shell
      Apache License 2.0
      92783Updated Feb 4, 2025Feb 4, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1750111Updated Feb 4, 2025Feb 4, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k152236Updated Feb 4, 2025Feb 4, 2025
    • rocHPL

      Public
      High Performance Linpack for Next-Generation AMD HPC Accelerators
      C++
      Other
      204553Updated Feb 4, 2025Feb 4, 2025
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      17336042Updated Feb 4, 2025Feb 4, 2025
    • Advanced Profiling and Analytics for AMD Hardware
      Python
      MIT License
      511394911Updated Feb 4, 2025Feb 4, 2025
    • ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime
      C++
      Other
      1132342120Updated Feb 4, 2025Feb 4, 2025
    • rocminfo

      Public
      ROCm Application for Reporting System Info
      C++
      Other
      3235010Updated Feb 4, 2025Feb 4, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2391.1k24753Updated Feb 4, 2025Feb 4, 2025
    • Jupyter Notebook
      104711Updated Feb 3, 2025Feb 3, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2207740Updated Feb 3, 2025Feb 3, 2025
    • Python
      Other
      81688Updated Feb 3, 2025Feb 3, 2025
    • aomp

      Public
      AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
      Fortran
      Apache License 2.0
      48211241Updated Feb 3, 2025Feb 3, 2025
    • hipBLAS

      Public
      ROCm BLAS marshalling library
      C++
      Other
      8112815Updated Feb 3, 2025Feb 3, 2025
    • ONNX Runtime: cross-platform, high performance scoring engine for ML models
      C++
      MIT License
      3k608Updated Feb 3, 2025Feb 3, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.8k1051050Updated Feb 3, 2025Feb 3, 2025
    • Tensile

      Public
      Stretching GPU performance for GEMMs and tensor contractions.
      Python
      MIT License
      15423134Updated Feb 3, 2025Feb 3, 2025
    • TensorFlow ROCm port
      C++
      Apache License 2.0
      74k6907167Updated Feb 3, 2025Feb 3, 2025
    • rocSHMEM

      Public
      rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
      C++
      MIT License
      124883Updated Feb 3, 2025Feb 3, 2025
    • rocSOLVER

      Public
      Next generation LAPACK implementation for ROCm platform
      C++
      Other
      5398018Updated Feb 3, 2025Feb 3, 2025
    • C
      MIT License
      121412Updated Feb 3, 2025Feb 3, 2025
    301 repositories found. List is sorted by Last pushed in descending order.