Skip to content

Add support for nvidia modelopt fp8 kv cache #5128

Add support for nvidia modelopt fp8 kv cache

Add support for nvidia modelopt fp8 kv cache #5128

performance-test-1-gpu-part-1

succeeded Feb 1, 2025 in 11m 53s
Set up job
12s
Checkout code
9s
Install dependencies
1m 7s
Benchmark single latency
27s
Benchmark online latency
2m 54s
Benchmark offline throughput
3m 25s
Benchmark offline throughput (Non-streaming, small batch size)
1m 24s
Benchmark online latency (EAGLE)
1m 50s
Post Checkout code
1s
Complete job
0s