Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Browse files
Browse the repository at this point in the history
* Add flash_attn support (#306) * add dockerfile for flash_attn setup * remove test.py * parametrize model name and engine * Update Dockerfile --------- Co-authored-by: Michael Feil <[email protected]> * Delete libs/infinity_emb/Dockerfile.flash --------- Co-authored-by: Göktürk <[email protected]>
- Loading branch information