Skip to content
@CentML

CentML

Popular repositories Loading

  1. DeepView.Profile DeepView.Profile Public

    🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.

    Python 58 7

  2. DeepView.Explore DeepView.Explore Public

    🛠 VSCode plugin that provides visual interface for CentML Tools

    TypeScript 15 2

  3. DeepView.Predict DeepView.Predict Public

    🔮 Execution time predictions for deep neural network training iterations across different GPUs.

    Python 14 3

  4. VectorWorkshop VectorWorkshop Public

    Jupyter Notebook 8 2

  5. flexible-inference-bench flexible-inference-bench Public

    A modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.

    Python 8

  6. gpu-usage-estimator gpu-usage-estimator Public

    Python script to estimate GPU utilization using NVIDIA Nsight Systems

    Python 4

Repositories

Showing 10 of 37 repositories
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    CentML/vllm’s past year of commit activity
    Python 0 Apache-2.0 6,040 0 1 Updated Feb 27, 2025
  • paper-of-the Public

    An Agent that reviews the papers published on a given day and picks the one most aligned with our mission.

    CentML/paper-of-the’s past year of commit activity
    TypeScript 0 Apache-2.0 0 0 0 Updated Feb 27, 2025
  • CentML/centml_platform_docs’s past year of commit activity
    MDX 0 0 0 2 Updated Feb 26, 2025
  • flexible-inference-bench Public

    A modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.

    CentML/flexible-inference-bench’s past year of commit activity
    Python 8 Apache-2.0 0 9 2 Updated Feb 25, 2025
  • CentML/centml-python-client’s past year of commit activity
    Python 1 Apache-2.0 0 12 (4 issues need help) 1 Updated Feb 25, 2025
  • spiffe-jwt Public
    CentML/spiffe-jwt’s past year of commit activity
    Go 0 0 0 1 Updated Feb 24, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    CentML/flash-attention’s past year of commit activity
    Python 0 BSD-3-Clause 1,513 0 0 Updated Feb 19, 2025
  • ai-benchmarks Public Forked from fixie-ai/ai-benchmarks

    Benchmarking suite for popular AI APIs

    CentML/ai-benchmarks’s past year of commit activity
    Python 0 MIT 15 0 3 Updated Feb 12, 2025
  • aisuite Public Forked from andrewyng/aisuite

    Simple, unified interface to multiple Generative AI providers

    CentML/aisuite’s past year of commit activity
    Python 0 MIT 1,121 0 7 Updated Feb 11, 2025
  • Sylva Public

    Boost fine-tuning performance with sparse embedded adapters and hierarchical approximate second-order information.

    CentML/Sylva’s past year of commit activity
    Python 2 Apache-2.0 0 0 1 Updated Feb 11, 2025