What's Changed?

🎈1. Support Save & Load “Pre-quantized Model” with Extension .perfab #83

You can use /api/models/save (post model_path to this api after loading the model with quantization and LoRA merged to save the prefab), and you will get a smaller model file.
And then you can load this file using out favourite /api/models/load. Note that when loading a prefab, parameters like lora and quant are ignored.

👑2. Support BNF #92

Now you could specify bnf_schema in /api/oai/chat/completions.
This will force the model to generate contents that fit the schema (e.g., in JSON format).

🧨 Fixed Many Minor Issues

🥱🥱🥱

Changed list

docs: add longzou as a contributor for code, and security by @allcontributors in #77
Auth by @cryscan in #78
All contributors/add cryscan by @cryscan in #79
Small fix. by @cryscan in #80
Bump web-rwkv to v0.6.33 by @cryscan in #81
Remove axum api. by @cryscan in #82
Allow loading prefab. by @cryscan in #83
Do not check for existence when building path. by @cryscan in #85
Fix typo in main.rs by @cryscan in #86
Update index.zip by @cuijinsen in #87
Update index.zip by @cuijinsen in #88
Fix the null finish reason representation. by @cryscan in #91
Bnf constraint by @cryscan in #92

New Contributors

@cuijinsen made their first contribution in #87

Full Changelog: v0.3.23...v0.3.24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

What's Changed

Contributors

What's Changed?

🎈1. Support Save & Load “Pre-quantized Model” with Extension .perfab #83

👑2. Support BNF #92

🧨 Fixed Many Minor Issues

Changed list

New Contributors

Contributors

What's Changed?

🎈1. Support TLS and http2/http3

👑2. Support openAPI and Swagger-UI

🔔3. Fix something for Loading Lora

ps. 📢📢📢 Something BIG on the way! (⓿_⓿)

Releases: Ai00-X/ai00_server

v0.4.2

v0.4.1

v0.4.0

What's Changed

Contributors

v0.3.25

What's Changed

Contributors

v0.3.24

What's Changed?

🎈1. Support Save & Load “Pre-quantized Model” with Extension .perfab #83

👑2. Support BNF #92

🧨 Fixed Many Minor Issues

Changed list

New Contributors

Contributors

v0.3.23

What's Changed?

🎈1. Support TLS and http2/http3

👑2. Support openAPI and Swagger-UI

🔔3. Fix something for Loading Lora

ps. 📢📢📢 Something BIG on the way! (⓿_⓿)

v0.3.22

v0.3.21

v0.3.20

v0.3.19