Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Support for chain-of-thought models #485

Open
acolombier opened this issue Jan 21, 2025 · 6 comments
Open

[Feature Request] Support for chain-of-thought models #485

acolombier opened this issue Jan 21, 2025 · 6 comments
Labels
enhancement New feature or request

Comments

@acolombier
Copy link

Is your feature request related to a problem? Please describe.

With the recent release of deepseek-r1, it would be great to add proper support to chain-of-thoughts model

Describe the solution you'd like

A valid solution would include

  • A correct title, potentially generated by another model (optional): Alpaca uses the same model to generate title for the prompt, but chain of thought model aren't a great usecase for it. (See context)
  • A better UI rendering allowing collapsing of the thoughts: the chain of thoughts can be very verbose, and while it is great to be able to see it, most of the time it is artifacts you don't need to as part of your prompt answer. This makes the "chat" hard to interact with (See context)
  • A better responsiveness: currently, Alpaca is not responding well during the generation, beside using a dedicated server and thus not having the current machine under load

Describe alternatives you've considered

N/A

Additional context

Image

Image

@acolombier acolombier added the enhancement New feature or request label Jan 21, 2025
@Jeffser
Copy link
Owner

Jeffser commented Jan 21, 2025

Hi, thanks for the suggestion. I was actually looking into this to get Deepseek R1 working better for the next Alpaca release.

I didn't noticed the title issue. For now I'll make it so that Alpaca uses the default model for that, in the future I'll give it it's own option, something like title generation model.

I'm also working on a new widget for handling the <think> blocks.

@Jeffser
Copy link
Owner

Jeffser commented Jan 21, 2025

First attempt:

Image

@Jeffser
Copy link
Owner

Jeffser commented Jan 21, 2025

I'll set this as finished

2a60dff
a360058

@Jeffser
Copy link
Owner

Jeffser commented Jan 22, 2025

Reopening so people know this is done, I also changed the appearance to an attachment button, it opens a file previewer with the text in it.

Image

Image

@Jeffser Jeffser marked this as a duplicate of #492 Jan 22, 2025
@olumolu
Copy link
Contributor

olumolu commented Jan 26, 2025

Make the thought part integrated in the chat it is not good looking and unintiative how deepseek do it in there app... Which is actually better.

@acolombier
Copy link
Author

Looking very promising @Jeffser , thanks for that quick reaction. I'll give it a spin during the next few days.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants