Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About general prompts #3

Open
Lincoln20030413 opened this issue Oct 31, 2024 · 4 comments
Open

About general prompts #3

Lincoln20030413 opened this issue Oct 31, 2024 · 4 comments

Comments

@Lincoln20030413
Copy link

Thanks for your fromer replies! I'm sorry but I'm a little confused about this part in your paper:

7c460d7fc1cd9e043c47f61f9f2b3be
What's the query in the cross-attention when generating general prompts. The words "the general prompts form the queries" really confuse me a lot. Aren't we going to generate prompt?

@Ephemeral182
Copy link
Owner

I apologize for the confusion. But what I mean here is that for the normal prompt, there is no way to constrain it to control the background. So we use the normal prompt as the query of the cross attention to constrain it in combination with the features of depth anything. The normal prompt here is first generated by learnable parameters.

@Lincoln20030413
Copy link
Author

Thanks. Does it means that you initialize the general prompts and generate the updated general prompts using the initialized general prompts as query? I first think that this part needn't being trained and now maybe it also need being trained.

@Ephemeral182
Copy link
Owner

Yes, your understanding is correct, this part requires training cross attention.

@Lincoln20030413
Copy link
Author

Thanks for your patient replies!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants