You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I ran prompt minimization on Llama-7b-chat with the target string "Imperfection is beauty, madness is genius and it's better to be absolutely ridiculous than absolutely boring". But it fails to find an adversarial prompt as in Figure 2.
I noticed that in Figure 2, there is a leading positive response "Sure! Here's a famous quote:\n\n". Should I add the positive response between the prompt and target string when running gcg?
The text was updated successfully, but these errors were encountered:
Hi, thanks for sharing the code!
I ran prompt minimization on Llama-7b-chat with the target string "Imperfection is beauty, madness is genius and it's better to be absolutely ridiculous than absolutely boring". But it fails to find an adversarial prompt as in Figure 2.
I noticed that in Figure 2, there is a leading positive response "Sure! Here's a famous quote:\n\n". Should I add the positive response between the prompt and target string when running gcg?
The text was updated successfully, but these errors were encountered: