Popular repositories Loading
-
DeepView.Profile
DeepView.Profile Public🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
-
DeepView.Explore
DeepView.Explore Public🛠 VSCode plugin that provides visual interface for CentML Tools
-
DeepView.Predict
DeepView.Predict Public🔮 Execution time predictions for deep neural network training iterations across different GPUs.
-
-
flexible-inference-bench
flexible-inference-bench PublicA modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.
Python 8
-
gpu-usage-estimator
gpu-usage-estimator PublicPython script to estimate GPU utilization using NVIDIA Nsight Systems
Python 4
Repositories
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
CentML/vllm’s past year of commit activity - paper-of-the Public
An Agent that reviews the papers published on a given day and picks the one most aligned with our mission.
CentML/paper-of-the’s past year of commit activity - centml_platform_docs Public
CentML/centml_platform_docs’s past year of commit activity - flexible-inference-bench Public
A modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.
CentML/flexible-inference-bench’s past year of commit activity - centml-python-client Public
CentML/centml-python-client’s past year of commit activity - spiffe-jwt Public
CentML/spiffe-jwt’s past year of commit activity - flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
CentML/flash-attention’s past year of commit activity - aisuite Public Forked from andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
CentML/aisuite’s past year of commit activity