Implement OpenAI Whisper Model #125

MahmoudAshraf97 · 2025-02-04T11:28:15Z

This PR aims to enable exporting HF Whisper implementation to TRT-LLM checkpoint.
Although TRT already has a conversion script for both HF and original implementation, but implementing it here enables users to use modelopt for optimization and quantization techniques that aren't available yet in TRT-LLM

There are some sharp edges in this implementation, mostly because this is the second Enc-Dec model to be implemented after T5 so they are handled as edge cases, perhaps having a generalized Enc-Dec path can be better, also there might be some code duplication between whisper and T5 in TRT-LLM config creation that can be unified

yuekaizhang · 2025-02-06T09:40:51Z

@MahmoudAshraf97 Hi, thanks for your effort. I would take this PR into our internal gitlab (We don't have CI running in Github). Also, we would add your name into the co-author list and credit your work on the release notes for whisper fp8 quant support.

MahmoudAshraf97 added 2 commits February 17, 2025 19:42

Initial Whisper Implementation

cee2021

simplify feature extractor config

cf63773

MahmoudAshraf97 force-pushed the main branch from 7867993 to cf63773 Compare February 17, 2025 17:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement OpenAI Whisper Model #125

Implement OpenAI Whisper Model #125

MahmoudAshraf97 commented Feb 4, 2025

yuekaizhang commented Feb 6, 2025

Implement OpenAI Whisper Model #125

Are you sure you want to change the base?

Implement OpenAI Whisper Model #125

Conversation

MahmoudAshraf97 commented Feb 4, 2025

yuekaizhang commented Feb 6, 2025