Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I keep being Rick Rolled #311

Open
ergosumdre opened this issue Feb 7, 2025 · 7 comments
Open

I keep being Rick Rolled #311

ergosumdre opened this issue Feb 7, 2025 · 7 comments

Comments

@ergosumdre
Copy link

So, every time I ask my AI agent to do literally anything—analyze data, fetch a file, make me coffee (okay, maybe not that last one)—it sneakily starts downloading a YouTube video.

Curious, I check the URL, and what do I find? You guessed it: Rick Astley’s "Never Gonna Give You Up."

@FatherfoxStrongpaw
Copy link

that's interesting. i got the same thing from another github agent called ailice. what llm were you using?

@ergosumdre
Copy link
Author

It was deepseek chat.

@YourTendency
Copy link

YourTendency commented Feb 8, 2025 via email

@romano
Copy link

romano commented Feb 8, 2025

Possibly the same here (haven't confirmed the actual rick roll part). Fresh install, just configured deepseek-r1:14b for chat, util, web and nomic-embed-text for embed. The first and only thing I've said to it is "hello", and it started thinking how to download a YouTube video.

pastebin

@frdel
Copy link
Owner

frdel commented Feb 8, 2025

Reasoning models are not great for agentic use, they focus too much on memories and system prompt and not enough on user message. Especially when using smaller reasoning models and quantized.

@FatherfoxStrongpaw
Copy link

frdel: that's actually good to know. you have a fall back to a smaller model for simpler tasks, but i've been looking for an arvix paper i lost track of setting up a fall back module for running frontier/api LLM's ranging from smaller/cheaper to larger to medium to larger/full frontier models as solutions or information is found to be lacking. what do you think about me looking into a module that can shift from general to reasoning models task dependent?

@romano
Copy link

romano commented Feb 8, 2025

@frdel True, I've seen you mentioning that a couple of times. I've been having issues finding a suitable local LLM that could fit on my machine, but most were barely working, loopy or completely broken like the distilled Deepseek-r1 ones. I've seen you mentioning that Llama, Phi or Gemma could work, but I'm guessing you've meant larger variants of those, over 30b, because smaller models were plain broken for this purpose. Either way, for now Mistral Small 24b seems to work fairly ok. Does what I tell it to do and doesn't fall into loops, that includes attempting to download YouTube videos for no apparent reason.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants