-
-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Support for chain-of-thought models #485
Comments
Hi, thanks for the suggestion. I was actually looking into this to get Deepseek R1 working better for the next Alpaca release. I didn't noticed the title issue. For now I'll make it so that Alpaca uses the default model for that, in the future I'll give it it's own option, something like I'm also working on a new widget for handling the <think> blocks. |
Make the thought part integrated in the chat it is not good looking and unintiative how deepseek do it in there app... Which is actually better. |
Looking very promising @Jeffser , thanks for that quick reaction. I'll give it a spin during the next few days. |
Two problems:
|
The whole markdown processing thing is attached to the message widget (not attachments), I will make it more generic later so that it works in attachments too. The default model set in preferences is now in charge of generating the titles, you should probably select one that isn't chain of thought, that's the solution for now. I plan on rewriting all the instance manager code, and when I do, I'll separate the option to default model and title generating model. |
please add an option to hide the thoughts like chatgpt does (just show the thoughs bubble icon in ui then and user can click to expand it and see the generation ) deepseek has sometimes annoying emulated vocal expressions like hmmmmm let me see ... off topic: for example i provided the model with a large script and asked to fix one part and when i said combine the updated code regarding the fix with the original script, ai be like: what script? generating random wanna be placeholders around the fixed previous requested code |
Finally got the update over flatpak - sorry I didn't have time to build the test flatpak before! It's looking great, thanks for that! :) One "bug" I found is, if you ask the model (R1 in my case) "forbidden" question (e.g historical facts), it will return an empty It would be great if the rendering of the thought section could be dynamic, rather than waiting for the whole response to generate, but this is probably a separated feature request. Is that something you are already working on? Anyway, happy to close this issue if you would prefer to track follow up work in other ones |
Is your feature request related to a problem? Please describe.
With the recent release of deepseek-r1, it would be great to add proper support to chain-of-thoughts model
Describe the solution you'd like
A valid solution would include
Describe alternatives you've considered
N/A
Additional context
The text was updated successfully, but these errors were encountered: