-
Notifications
You must be signed in to change notification settings - Fork 143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
worktree: detect from secondary worktree if main worktree is bare #1829
base: maint
Are you sure you want to change the base?
worktree: detect from secondary worktree if main worktree is bare #1829
Conversation
Welcome to GitGitGadgetHi @olga-mcbfe, and welcome to GitGitGadget, the GitHub App to send patch series to the Git mailing list from GitHub Pull Requests. Please make sure that either:
You can CC potential reviewers by adding a footer to the PR description with the following syntax:
NOTE: DO NOT copy/paste your CC list from a previous GGG PR's description, Also, it is a good idea to review the commit messages one last time, as the Git project expects them in a quite specific form:
It is in general a good idea to await the automated test ("Checks") in this Pull Request before contributing the patches, e.g. to avoid trivial issues such as unportable code. Contributing the patchesBefore you can contribute the patches, your GitHub username needs to be added to the list of permitted users. Any already-permitted user can do that, by adding a comment to your PR of the form Both the person who commented An alternative is the channel
Once on the list of permitted usernames, you can contribute the patches to the Git mailing list by adding a PR comment If you want to see what email(s) would be sent for a After you submit, GitGitGadget will respond with another comment that contains the link to the cover letter mail in the Git mailing list archive. Please make sure to monitor the discussion in that thread and to address comments and suggestions (while the comments and suggestions will be mirrored into the PR by GitGitGadget, you will still want to reply via mail). If you do not want to subscribe to the Git mailing list just to be able to respond to a mail, you can download the mbox from the Git mailing list archive (click the curl -g --user "<EMailAddress>:<Password>" \
--url "imaps://imap.gmail.com/INBOX" -T /path/to/raw.txt To iterate on your change, i.e. send a revised patch or patch series, you will first want to (force-)push to the same branch. You probably also want to modify your Pull Request description (or title). It is a good idea to summarize the revision by adding something like this to the cover letter (read: by editing the first comment on the PR, i.e. the PR description):
To send a new iteration, just add another PR comment with the contents: Need help?New contributors who want advice are encouraged to join [email protected], where volunteers who regularly contribute to Git are willing to answer newbie questions, give advice, or otherwise provide mentoring to interested contributors. You must join in order to post or view messages, but anyone can join. You may also be able to find help in real time in the developer IRC channel, |
47a2eea
to
9e7170f
Compare
/allow |
User olga-mcbfe is now allowed to use GitGitGadget. WARNING: olga-mcbfe has no public email address set on GitHub; |
/preview |
Preview email sent as [email protected] |
/submit |
Submitted as [email protected] To fetch this version into
To fetch this version to local tag
|
9e7170f
to
17f4b24
Compare
/preview |
Preview email sent as [email protected] |
/submit |
Submitted as [email protected] To fetch this version into
To fetch this version to local tag
|
On the Git mailing list, Eric Sunshine wrote (reply to this): On Thu, Jan 16, 2025 at 4:35 PM Olga Pilipenco via GitGitGadget
<[email protected]> wrote:
> Setup:
> 1. Have a bare repo with core.bare = true in config.worktree
> 2. Create a new worktree
>
> Behavior:
> From the secondary worktree the main worktree appears as non-bare.
>
> Expected:
> From the secondary worktree the main worktree should appear as bare.
>
> Why current behavior is not good?
> If the main worktree is detected as not bare it doesn't allow
> checking out the branch of the main worktree. There are possibly
> other problems associated with that behavior.
>
> Why is it happening?
> While we're inside the secondary worktree we don't initialize the main
> worktree's repository with its configuration.
Okay, this is clearly a very real problem and explains this comment
added by f3534c98e4 (worktree: update is_bare heuristics, 2019-04-19):
NEEDSWORK: If this function is called from a secondary worktree and
config.worktree is present, is_bare_repository_cfg will reflect the
contents of config.worktree, not the contents of the main worktree.
This means that worktree->is_bare may be set to 0 even if the main
worktree is configured to be bare.
(Aside: I recall reading this comment when Jonathan added it but
wasn't able to dig into it at the time to really understand it, and
never got back around to it. Now, after studying your patch, I
understand what it was about.
> How is it fixed?
> Load actual configs of the main worktree. Also, skip the config loading
> step if we're already inside the current worktree because in that case we
> rely on is_bare_repository() to return the correct result.
I found that I had to dig around a bit to fully understand the problem
expressed by this commit message. Perhaps adding a bit more detail
would help? Here's my attempt at rewriting the above (also in a way
which is more idiomatic to this project):
When extensions.worktreeConfig is true and the main worktree is
bare -- that is, its config.worktree file contains core.bare=true
-- commands run from secondary worktrees incorrectly see the main
worktree as not bare. As such, those commands incorrectly think
that the repository's default branch (typically "main" or
"master") is checked out in the bare repository even though it's
not. This makes it impossible, for instance, to checkout or delete
the default branch from a secondary worktree, among other
shortcomings.
This problem occurs because, when extensions.worktreeConfig is
true, commands run in secondary worktrees only consult
$commondir/config and $commondir/worktrees/<id>/config.worktree,
thus they never see the main worktree's core.bare=true setting in
$commondir/config.worktree.
Fix this problem by consulting the main worktree's config.worktree
file when checking whether it is bare. (This extra work is
performed only when running from a secondary worktree.)
> Other solutions considered:
> Alternatively, instead of incorrectly always using
> `the_repository` as the main worktree's repository, we can detect
> and load the actual repository of the main worktree and then use
> that repository's `is_bare` value extracted from correct configs.
> However, this approach is a bit riskier and could also affect
> performance. Since we had the assignment `worktree->repo =
> the_repository` for a long time already, I decided it's safe to
> keep it as it is for now; it can be still fixed separately from
> this change.
I found this paragraph somewhat confusing because it seems to conflate
a repository (i.e. the shared object database) with the `struct
repository` type, and the configuration which happens to get loaded
and stored (as one of *many* members) of the repository structure. I
had to read it several times to understand that this was talking about
instantiating a separate `struct repository` initialized from the main
worktree configuration. I agree that doing so would likely be overkill
and could impact performance negatively. I understand that you added
this paragraph because SubmittingPatches suggests to do so, but I
think it can probably be omitted in this case unless it can be
rewritten to be more clear (but even then I doubt it is necessary to
keep it).
> Real life use case:
> 1. Have a bare repo
> 2. Create a worktree from the bare repo
> 3. In the secondary worktree enable sparse-checkout - this enables
> extensions.worktreeConfig and keeps core.bare=true setting in
> config.worktree of the bare worktree
> 4. The secondary worktree or any other non-bare worktree created
> won't be able to use branch main (not even once), but it should be
> able to.
This is mostly repeating what was said earlier, thus probably isn't
adding any value to the commit message. I'd probably drop it.
> Signed-off-by: Olga Pilipenco <[email protected]>
> ---
> Changes since v1:
>
> * no code changes
> * rebased with maint
> * CC added
Sorry. I've had your v1 sitting in my ever-increasingly-large backlog
of patches to look at, but have been extra busy the last many months
and never managed to get to it.
> Existing broken functionality forces our project to use hacks on bare
> repo that we'd like to avoid. I would really appreciate reviews of this
> patch to move closer towards fixing the issue. This is my first
> contribution to git/git, I apologize if I got lost in the instructions,
> but I tried my best to follow the rules.
Your submission is fine. Unfortunately, the project has a lack of
reviewers but no lack of submitters, so sometimes patches get
overlooked or simply buried.
> diff --git a/t/t3200-branch.sh b/t/t3200-branch.sh
> @@ -410,6 +410,20 @@ test_expect_success 'bare main worktree has HEAD at branch deleted by secondary
> +test_expect_success 'secondary worktree can switch to main if common dir is bare worktree' '
The use of "common dir" is a bit confusing. Also, this patch is fixing
the more general problem that secondary worktrees think that the bare
main worktree has a branch checked out. So, perhaps a better title
would be:
secondary worktrees recognize core.bare=true in main config.worktree
or something?
> + test_when_finished "rm -rf bare_repo non_bare_repo secondary_worktree" &&
> + git init -b main non_bare_repo &&
> + test_commit -C non_bare_repo x &&
> +
> + git clone --bare non_bare_repo bare_repo &&
> + git -C bare_repo config extensions.worktreeConfig true &&
> + git -C bare_repo config unset core.bare &&
> + git -C bare_repo config --worktree core.bare true &&
> +
> + git -C bare_repo worktree add ../secondary_worktree &&
> + git -C secondary_worktree checkout main
> +'
Very straightforward and exactly what I expected to see once I
understood the problem.
> diff --git a/worktree.c b/worktree.c
> @@ -65,6 +65,28 @@ static int is_current_worktree(struct worktree *wt)
> +static int is_bare_git_dir(const char *git_dir)
Nit: I wonder if a name such as is_main_worktree_bare() would clue
readers in a bit more?
> +{
> + int bare = 0;
> + struct config_set cs = { { 0 } };
This is not your fault since this construct is used elsewhere in this
file (from which I presume you copied it), but project consensus is
that using the notation `{{0}}` to work around a complaint from the
Apple compiler (and only the Apple compiler) should be avoided, and
that `{0}` is preferred. So, if you reroll, changing this to `{0}` may
make other reviewers happy (or you can leave it as is to be consistent
with existing precedence in this file; I don't feel strongly about
it).
> + char *config_file;
> + char *worktree_config_file;
> +
> + config_file = xstrfmt("%s/config", git_dir);
> + worktree_config_file = xstrfmt("%s/config.worktree", git_dir);
> +
> + git_configset_init(&cs);
> + git_configset_add_file(&cs, config_file);
> + git_configset_add_file(&cs, worktree_config_file);
Genuine question: I haven't thought too deeply about it, but do we
gain anything by loading $commondir/config here -- which is shared by
the main worktree and all secondary worktrees -- considering that it
was already loaded and consulted by the earlier is-bare check before
this function was even called?
> + git_configset_get_bool(&cs, "core.bare", &bare);
> +
> + git_configset_clear(&cs);
> + free(config_file);
> + free(worktree_config_file);
> + return bare;
Everything gets cleaned up correctly. Good.
> @@ -77,18 +99,16 @@ static struct worktree *get_main_worktree(int skip_reading_head)
> + /*
> + * NEEDSWORK: the_repository is not always main worktree's repository
> + */
> worktree->repo = the_repository;
> worktree->path = strbuf_detach(&worktree_path, NULL);
I found this new NEEDSWORK comment rather confusing the first several
times I read the patch. It wasn't until I finally realized that the
reference to `the_repository` here is the same reference to
`the_repository` in the commit message -- which confused me, as well
-- that I understood what this was trying to say. The actual problem,
of course, is that the _configuration_ stored in `the_repository` is
the secondary worktree's configuration, not the main worktree's
configuration. Considering that this patch addresses that problem, I'd
probably just drop this new comment altogether (unless, perhaps, you
rewrite it to talk about the _configuration_ stored in
`the_repository`).
> - /*
> - * NEEDSWORK: If this function is called from a secondary worktree and
> - * config.worktree is present, is_bare_repository_cfg will reflect the
> - * contents of config.worktree, not the contents of the main worktree.
> - * This means that worktree->is_bare may be set to 0 even if the main
> - * worktree is configured to be bare.
> - */
> - worktree->is_bare = (is_bare_repository_cfg == 1) ||
> - is_bare_repository();
> worktree->is_current = is_current_worktree(worktree);
> + worktree->is_bare = (is_bare_repository_cfg == 1) ||
> + is_bare_repository() ||
> + (!worktree->is_current && is_bare_git_dir(repo_get_common_dir(the_repository)));
This is performing the expensive check only if the earlier checks left
the question unanswered. Good. |
On the Git mailing list, Olga Pilipenco wrote (reply to this): Thank you for the review, I totally understand the delay in the review process and appreciate your time spent on this.
> On Jan 19, 2025, at 3:30 PM, Eric Sunshine <[email protected]> wrote:
>
> On Thu, Jan 16, 2025 at 4:35 PM Olga Pilipenco via GitGitGadget
> <[email protected]> wrote:
>> Setup:
>> 1. Have a bare repo with core.bare = true in config.worktree
>> 2. Create a new worktree
>>
>> Behavior:
>> From the secondary worktree the main worktree appears as non-bare.
>>
>> Expected:
>> From the secondary worktree the main worktree should appear as bare.
>>
>> Why current behavior is not good?
>> If the main worktree is detected as not bare it doesn't allow
>> checking out the branch of the main worktree. There are possibly
>> other problems associated with that behavior.
>>
>> Why is it happening?
>> While we're inside the secondary worktree we don't initialize the main
>> worktree's repository with its configuration.
>
> Okay, this is clearly a very real problem and explains this comment
> added by f3534c98e4 (worktree: update is_bare heuristics, 2019-04-19):
>
> NEEDSWORK: If this function is called from a secondary worktree and
> config.worktree is present, is_bare_repository_cfg will reflect the
> contents of config.worktree, not the contents of the main worktree.
> This means that worktree->is_bare may be set to 0 even if the main
> worktree is configured to be bare.
>
> (Aside: I recall reading this comment when Jonathan added it but
> wasn't able to dig into it at the time to really understand it, and
> never got back around to it. Now, after studying your patch, I
> understand what it was about.
>
>> How is it fixed?
>> Load actual configs of the main worktree. Also, skip the config loading
>> step if we're already inside the current worktree because in that case we
>> rely on is_bare_repository() to return the correct result.
>
> I found that I had to dig around a bit to fully understand the problem
> expressed by this commit message. Perhaps adding a bit more detail
> would help? Here's my attempt at rewriting the above (also in a way
> which is more idiomatic to this project):
>
> When extensions.worktreeConfig is true and the main worktree is
> bare -- that is, its config.worktree file contains core.bare=true
> -- commands run from secondary worktrees incorrectly see the main
> worktree as not bare. As such, those commands incorrectly think
> that the repository's default branch (typically "main" or
> "master") is checked out in the bare repository even though it's
> not. This makes it impossible, for instance, to checkout or delete
> the default branch from a secondary worktree, among other
> shortcomings.
>
> This problem occurs because, when extensions.worktreeConfig is
> true, commands run in secondary worktrees only consult
> $commondir/config and $commondir/worktrees/<id>/config.worktree,
> thus they never see the main worktree's core.bare=true setting in
> $commondir/config.worktree.
>
> Fix this problem by consulting the main worktree's config.worktree
> file when checking whether it is bare. (This extra work is
> performed only when running from a secondary worktree.)
Wow, your explanation is so much better than mine.Thank you for “translating" it for the world :) I’m still trying to get used to the terminology used in this codebase.
I’ll steal your description for sure (if you don’t mind).
>
>> Other solutions considered:
>> Alternatively, instead of incorrectly always using
>> `the_repository` as the main worktree's repository, we can detect
>> and load the actual repository of the main worktree and then use
>> that repository's `is_bare` value extracted from correct configs.
>> However, this approach is a bit riskier and could also affect
>> performance. Since we had the assignment `worktree->repo =
>> the_repository` for a long time already, I decided it's safe to
>> keep it as it is for now; it can be still fixed separately from
>> this change.
>
> I found this paragraph somewhat confusing because it seems to conflate
> a repository (i.e. the shared object database) with the `struct
> repository` type, and the configuration which happens to get loaded
> and stored (as one of *many* members) of the repository structure. I
> had to read it several times to understand that this was talking about
> instantiating a separate `struct repository` initialized from the main
> worktree configuration. I agree that doing so would likely be overkill
> and could impact performance negatively. I understand that you added
> this paragraph because SubmittingPatches suggests to do so, but I
> think it can probably be omitted in this case unless it can be
> rewritten to be more clear (but even then I doubt it is necessary to
> keep it).
Trust me, it took me a while to wrap my head around `struct repository` as well.
I agree if the explanation is too confusing and doesn’t bring any value, it can be omitted.
>
>> Real life use case:
>> 1. Have a bare repo
>> 2. Create a worktree from the bare repo
>> 3. In the secondary worktree enable sparse-checkout - this enables
>> extensions.worktreeConfig and keeps core.bare=true setting in
>> config.worktree of the bare worktree
>> 4. The secondary worktree or any other non-bare worktree created
>> won't be able to use branch main (not even once), but it should be
>> able to.
>
> This is mostly repeating what was said earlier, thus probably isn't
> adding any value to the commit message. I'd probably drop it.
I agree, your improved description captures this scenario perfectly.
>
>> Signed-off-by: Olga Pilipenco <[email protected]>
>> ---
>> Changes since v1:
>>
>> * no code changes
>> * rebased with maint
>> * CC added
>
> Sorry. I've had your v1 sitting in my ever-increasingly-large backlog
> of patches to look at, but have been extra busy the last many months
> and never managed to get to it.
Totally understand. Thanks again for getting to it eventually.
>
>> Existing broken functionality forces our project to use hacks on bare
>> repo that we'd like to avoid. I would really appreciate reviews of this
>> patch to move closer towards fixing the issue. This is my first
>> contribution to git/git, I apologize if I got lost in the instructions,
>> but I tried my best to follow the rules.
>
> Your submission is fine. Unfortunately, the project has a lack of
> reviewers but no lack of submitters, so sometimes patches get
> overlooked or simply buried.
>
>> diff --git a/t/t3200-branch.sh b/t/t3200-branch.sh
>> @@ -410,6 +410,20 @@ test_expect_success 'bare main worktree has HEAD at branch deleted by secondary
>> +test_expect_success 'secondary worktree can switch to main if common dir is bare worktree' '
>
> The use of "common dir" is a bit confusing. Also, this patch is fixing
> the more general problem that secondary worktrees think that the bare
> main worktree has a branch checked out. So, perhaps a better title
> would be:
>
> secondary worktrees recognize core.bare=true in main config.worktree
>
> or something?
Sounds good, will update.
>
>> + test_when_finished "rm -rf bare_repo non_bare_repo secondary_worktree" &&
>> + git init -b main non_bare_repo &&
>> + test_commit -C non_bare_repo x &&
>> +
>> + git clone --bare non_bare_repo bare_repo &&
>> + git -C bare_repo config extensions.worktreeConfig true &&
>> + git -C bare_repo config unset core.bare &&
>> + git -C bare_repo config --worktree core.bare true &&
>> +
>> + git -C bare_repo worktree add ../secondary_worktree &&
>> + git -C secondary_worktree checkout main
>> +'
>
> Very straightforward and exactly what I expected to see once I
> understood the problem.
>
>> diff --git a/worktree.c b/worktree.c
>> @@ -65,6 +65,28 @@ static int is_current_worktree(struct worktree *wt)
>> +static int is_bare_git_dir(const char *git_dir)
>
> Nit: I wonder if a name such as is_main_worktree_bare() would clue
> readers in a bit more?
I was about to explain how I wanted this function to be more generic and handle all sorts of bare and non-bare cases - whether it’s the main worktree or not. However, after seeing your comments and after revisiting the code, I realized that generalization doesn’t really provide much benefit here. It is much clearer if we're explicit that the bare check in this case is only performed on the main worktree. I’ll update it in the next version.
>
>> +{
>> + int bare = 0;
>> + struct config_set cs = { { 0 } };
>
> This is not your fault since this construct is used elsewhere in this
> file (from which I presume you copied it), but project consensus is
> that using the notation `{{0}}` to work around a complaint from the
> Apple compiler (and only the Apple compiler) should be avoided, and
> that `{0}` is preferred. So, if you reroll, changing this to `{0}` may
> make other reviewers happy (or you can leave it as is to be consistent
> with existing precedence in this file; I don't feel strongly about
> it).
I’ll fix it, sounds like a good reason.
>
>> + char *config_file;
>> + char *worktree_config_file;
>> +
>> + config_file = xstrfmt("%s/config", git_dir);
>> + worktree_config_file = xstrfmt("%s/config.worktree", git_dir);
>> +
>> + git_configset_init(&cs);
>> + git_configset_add_file(&cs, config_file);
>> + git_configset_add_file(&cs, worktree_config_file);
>
> Genuine question: I haven't thought too deeply about it, but do we
> gain anything by loading $commondir/config here -- which is shared by
> the main worktree and all secondary worktrees -- considering that it
> was already loaded and consulted by the earlier is-bare check before
> this function was even called?
This function determines if a worktree is bare or not. I want this logic to work even when it’s called from a different context and not rely on other is-bare checks (that are a bit confusing tbh).
>
>> + git_configset_get_bool(&cs, "core.bare", &bare);
>> +
>> + git_configset_clear(&cs);
>> + free(config_file);
>> + free(worktree_config_file);
>> + return bare;
>
> Everything gets cleaned up correctly. Good.
>
>> @@ -77,18 +99,16 @@ static struct worktree *get_main_worktree(int skip_reading_head)
>> + /*
>> + * NEEDSWORK: the_repository is not always main worktree's repository
>> + */
>> worktree->repo = the_repository;
>> worktree->path = strbuf_detach(&worktree_path, NULL);
>
> I found this new NEEDSWORK comment rather confusing the first several
> times I read the patch. It wasn't until I finally realized that the
> reference to `the_repository` here is the same reference to
> `the_repository` in the commit message -- which confused me, as well
> -- that I understood what this was trying to say. The actual problem,
> of course, is that the _configuration_ stored in `the_repository` is
> the secondary worktree's configuration, not the main worktree's
> configuration. Considering that this patch addresses that problem, I'd
> probably just drop this new comment altogether (unless, perhaps, you
> rewrite it to talk about the _configuration_ stored in
> `the_repository`).
This `the_repository` structure is soooo confusing, took me a while to figure out what it is! I would feel guilty not mentioning that under some circumstances `the_repository` assigned here could be not actual configuration of the worktree object. I don’t know if that will ever matter or not, but I find this assignment kinda “stinky” and want everyone to know about it. I don’t want to change this assignment in this patch because it didn’t bring any harm so far. I’ll try again to rephrase this comment, just to give a heads up in case someone experiences “weird” behaviour in this area (same way the previous NEEDSWORK comment gave me ideas why my workflow didn’t work and inspired me to try to fix it).
>
>> - /*
>> - * NEEDSWORK: If this function is called from a secondary worktree and
>> - * config.worktree is present, is_bare_repository_cfg will reflect the
>> - * contents of config.worktree, not the contents of the main worktree.
>> - * This means that worktree->is_bare may be set to 0 even if the main
>> - * worktree is configured to be bare.
>> - */
>> - worktree->is_bare = (is_bare_repository_cfg == 1) ||
>> - is_bare_repository();
>> worktree->is_current = is_current_worktree(worktree);
>> + worktree->is_bare = (is_bare_repository_cfg == 1) ||
>> + is_bare_repository() ||
>> + (!worktree->is_current && is_bare_git_dir(repo_get_common_dir(the_repository)));
>
> This is performing the expensive check only if the earlier checks left
> the question unanswered. Good.
Thanks for the review. I’ll incorporate the changes in my next version and hopefully it will be good to go :tada:
I hope I responded to all the comments, it’s a bit nerve-wrecking to contribute for the first time (so many rules and instructions!) :)
|
33bf8b7
to
6287d45
Compare
On the Git mailing list, Eric Sunshine wrote (reply to this): On Tue, Jan 28, 2025 at 4:45 PM Olga Pilipenco
<[email protected]> wrote:
> > On Jan 19, 2025, at 3:30 PM, Eric Sunshine <[email protected]> wrote:
> > On Thu, Jan 16, 2025 at 4:35 PM Olga Pilipenco via GitGitGadget
> > <[email protected]> wrote:
> > I found that I had to dig around a bit to fully understand the problem
> > expressed by this commit message. Perhaps adding a bit more detail
> > would help? Here's my attempt at rewriting the above (also in a way
> > which is more idiomatic to this project):
> >
> > When extensions.worktreeConfig is true and the main worktree is
> > bare -- that is, its config.worktree file contains core.bare=true
> > -- commands run from secondary worktrees incorrectly see the main
> > worktree as not bare. As such, those commands incorrectly think
> > that the repository's default branch (typically "main" or
> > "master") is checked out in the bare repository even though it's
> > not. This makes it impossible, for instance, to checkout or delete
> > the default branch from a secondary worktree, among other
> > shortcomings.
> >
> > This problem occurs because, when extensions.worktreeConfig is
> > true, commands run in secondary worktrees only consult
> > $commondir/config and $commondir/worktrees/<id>/config.worktree,
> > thus they never see the main worktree's core.bare=true setting in
> > $commondir/config.worktree.
> >
> > Fix this problem by consulting the main worktree's config.worktree
> > file when checking whether it is bare. (This extra work is
> > performed only when running from a secondary worktree.)
>
> Wow, your explanation is so much better than mine.Thank you for
> “translating" it for the world :) I’m still trying to get used to
> the terminology used in this codebase. I’ll steal your description
> for sure (if you don’t mind).
You are more than welcome to use the proposed commit message rewrite.
(If you want to acknowledge assistance rendered, a Helped-by: trailer,
preceding your Signed-off-by:, is the way to do so. Or not. It's up to
you.)
> >> diff --git a/worktree.c b/worktree.c
> >> @@ -65,6 +65,28 @@ static int is_current_worktree(struct worktree *wt)
> >> +static int is_bare_git_dir(const char *git_dir)
> >
> > Nit: I wonder if a name such as is_main_worktree_bare() would clue
> > readers in a bit more?
>
> I was about to explain how I wanted this function to be more generic
> and handle all sorts of bare and non-bare cases - whether it’s the
> main worktree or not. However, after seeing your comments and after
> revisiting the code, I realized that generalization doesn’t really
> provide much benefit here. It is much clearer if we're explicit that
> the bare check in this case is only performed on the main
> worktree. I’ll update it in the next version.
I see. When reviewing, I was wondering why the git-dir was being
passed into the function. Your explanation above answers that
question. On that note, in addition to renaming the function as
suggested, for clarity, I would probably go a bit further and pass in
a `struct repository *` rather than passing in the git-dir itself,
just to make it clear that the function is checking main-worktree
bareness of the repository in question, as opposed to merely checking
bareness of any arbitrary directory. (At least, I would find the
intention more clear at-a-glance with that additional change applied.)
> >> + config_file = xstrfmt("%s/config", git_dir);
> >> + worktree_config_file = xstrfmt("%s/config.worktree", git_dir);
> >> +
> >> + git_configset_init(&cs);
> >> + git_configset_add_file(&cs, config_file);
> >> + git_configset_add_file(&cs, worktree_config_file);
> >
> > Genuine question: I haven't thought too deeply about it, but do we
> > gain anything by loading $commondir/config here -- which is shared by
> > the main worktree and all secondary worktrees -- considering that it
> > was already loaded and consulted by the earlier is-bare check before
> > this function was even called?
>
> This function determines if a worktree is bare or not. I want this
> logic to work even when it’s called from a different context and not
> rely on other is-bare checks (that are a bit confusing tbh).
Agreed about the is-bare checks -- and indeed the entire Git startup
sequence -- being difficult to digest, however...
One reason I asked the question was due to concern that future readers
of this code may very well wonder (as I did) why $commondir/config is
being loaded when doing so is (apparently) unnecessary in this
particular context. The question is especially pertinent given that
this is a private helper function with a single caller. A second
reason was that, over the years, a good deal of effort has been put
into optimizing Git's startup to avoid doing unnecessary work, and
this appears to be unnecessary since $commondir/config would already
have been consulted by earlier checks before this function gets called
(assuming I'm correctly understanding the code-flow).
Anyhow, we can probably punt on the question for the moment and leave
the code as you wrote it if you feel strongly about it or if you think
it is clearer this way for future readers.
> >> + /*
> >> + * NEEDSWORK: the_repository is not always main worktree's repository
> >> + */
> >> worktree->repo = the_repository;
> >> worktree->path = strbuf_detach(&worktree_path, NULL);
> >
> > I found this new NEEDSWORK comment rather confusing the first several
> > times I read the patch. It wasn't until I finally realized that the
> > reference to `the_repository` here is the same reference to
> > `the_repository` in the commit message -- which confused me, as well
> > -- that I understood what this was trying to say. The actual problem,
> > of course, is that the _configuration_ stored in `the_repository` is
> > the secondary worktree's configuration, not the main worktree's
> > configuration. Considering that this patch addresses that problem, I'd
> > probably just drop this new comment altogether (unless, perhaps, you
> > rewrite it to talk about the _configuration_ stored in
> > `the_repository`).
>
> This `the_repository` structure is soooo confusing, took me a while
> to figure out what it is! I would feel guilty not mentioning that
> under some circumstances `the_repository` assigned here could be not
> actual configuration of the worktree object. I don’t know if that
> will ever matter or not, but I find this assignment kinda “stinky”
> and want everyone to know about it. I don’t want to change this
> assignment in this patch because it didn’t bring any harm so far.
> I’ll try again to rephrase this comment, just to give a heads up in
> case someone experiences “weird” behaviour in this area (same way
> the previous NEEDSWORK comment gave me ideas why my workflow didn’t
> work and inspired me to try to fix it).
Likely, the confusion is an outcome of the natural evolution
(mutation) software undergoes. The development of linked worktrees and
the concept of a `repository` structure did not necessarily occur
concurrently. I suppose one could develop one of two views (if not
more) of the `repository`: (1) an in-memory representation of the
".git" directory or bare-repository "object database", including all
worktrees hanging off it, or (2) a single worktree's
view/representation of the repository, meaning paths, configuration,
"index" specific to that worktree.
In the present state of the code, the second view is the more accurate
one, so the existing `worktree->repo = the_repository` assignment does
make sense without any further commentary. My main concern with the
NEEDSWORK comment is that it implies that there is a problem with the
assignment, even though there isn't. While it may be true that the
entire `repository` idea needs to be rethought or clarified or
expanded, that's a global issue permeating the entire code-base, not
specific to this one spot, which is why it feels inappropriate to have
a NEEDSWORK comment here. So, I'm not, in general, opposed to a
comment explaining the the `worktree->repo` assignment for future
readers if you think that would be valuable, but I am concerned about
giving it a "NEEDSWORK" prefix, which feels misleading for this
particular piece of code.
> Thanks for the review. I’ll incorporate the changes in my next
> version and hopefully it will be good to go :tada: I hope I
> responded to all the comments, it’s a bit nerve-wrecking to
> contribute for the first time (so many rules and instructions!) :)
Understood, and it didn't help the nerve situation when your v1 was
apparently ignored. Rest assured, though, that your submission was
nicely done and fixes a real problem which ought to be addressed. (In
fact, I'm surprised it took this long for someone to tackle it. So,
thanks.) |
When extensions.worktreeConfig is true and the main worktree is bare -- that is, its config.worktree file contains core.bare=true -- commands run from secondary worktrees incorrectly see the main worktree as not bare. As such, those commands incorrectly think that the repository's default branch (typically "main" or "master") is checked out in the bare repository even though it's not. This makes it impossible, for instance, to checkout or delete the default branch from a secondary worktree, among other shortcomings. This problem occurs because, when extensions.worktreeConfig is true, commands run in secondary worktrees only consult $commondir/config and $commondir/worktrees/<id>/config.worktree, thus they never see the main worktree's core.bare=true setting in $commondir/config.worktree. Fix this problem by consulting the main worktree's config.worktree file when checking whether it is bare. (This extra work is performed only when running from a secondary worktree.) Signed-off-by: Olga Pilipenco <[email protected]>
6287d45
to
fda52a1
Compare
On the Git mailing list, Junio C Hamano wrote (reply to this): Eric Sunshine <[email protected]> writes:
>> Wow, your explanation is so much better than mine.Thank you for
>> “translating" it for the world :) I’m still trying to get used to
>> the terminology used in this codebase. I’ll steal your description
>> for sure (if you don’t mind).
>
> You are more than welcome to use the proposed commit message rewrite.
>
> (If you want to acknowledge assistance rendered, a Helped-by: trailer,
> preceding your Signed-off-by:, is the way to do so. Or not. It's up to
> you.)
> ...
> Agreed about the is-bare checks -- and indeed the entire Git startup
> sequence -- being difficult to digest, however...
> ...
Thanks for an easy-to-read review (and thanks OP for working on it,
of course).
|
On the Git mailing list, Eric Sunshine wrote (reply to this): On Thu, Jan 30, 2025 at 2:09 AM Olga Pilipenco <[email protected]> wrot
> On Jan 29, 2025, at 6:41 AM, Eric Sunshine <[email protected]> wrote:
> > I see. When reviewing, I was wondering why the git-dir was being
> > passed into the function. Your explanation above answers that
> > question. On that note, in addition to renaming the function as
> > suggested, for clarity, I would probably go a bit further and pass in
> > a `struct repository *` rather than passing in the git-dir itself,
> > just to make it clear that the function is checking main-worktree
> > bareness of the repository in question, as opposed to merely checking
> > bareness of any arbitrary directory. (At least, I would find the
> > intention more clear at-a-glance with that additional change applied.)
>
> Indeed, no need to pass git-dir anymore. There is actually no need
> to pass `the_repository` because it’s global. I like how this
> simplified things and made code clearer.
The reason I suggested passing in a `struct repository *` is that the
project is slowly moving away from the `the_repository` global, so
making this new function accept a `struct repository *` as its sole
argument means less work later on.
> > One reason I asked the question was due to concern that future readers
> > of this code may very well wonder (as I did) why $commondir/config is
> > being loaded when doing so is (apparently) unnecessary in this
> > particular context. The question is especially pertinent given that
> > this is a private helper function with a single caller. A second
> > reason was that, over the years, a good deal of effort has been put
> > into optimizing Git's startup to avoid doing unnecessary work, and
> > this appears to be unnecessary since $commondir/config would already
> > have been consulted by earlier checks before this function gets called
> > (assuming I'm correctly understanding the code-flow).
>
> I trust your judgement and knowledge of the code and really like the
> reasons presented. I’ll change this function to only check for
> worktree config. However I’d like to give it a good name where it’s
> clear we only check worktree config. It’s a bit challenging to make
> it short-ish and not to include multiple “worktree” words in the
> name. Before I submit a new release, maybe I have time to quickly
> align on the name. What do you think about this one:
>
> is_main_worktree_bare_in_worktree_config
>
> (It will check if bare=true in the main worktree’s worktree.config)
>
> Naming is harder than the code itself :)
It's a historic "accident" that when worktree support was designed,
the idea of linking worktrees to a bare repository was not considered.
Support for using worktrees with a bare repository was added later.
However, by that time, the term "main worktree" was already well
established, with the very unfortunate result that even when there is
no actual "main worktree" but only a bare repository with "linked
worktrees" hanging off it, the repository itself is usually referred
to as the "bare main worktree", which is an obvious misnomer; the
repository is just a repository (i.e. the object database and other
meta-information) and there is no actual main worktree.
Given the very real potential for confusion when employing the "bare
main worktree" misnomer, I suspect that we won't be able to come up
with a good name which easily conveys the function's purpose. Since
this is an internal helper (hence, there is slightly less pressure to
come up with a perfect name) rather than public API, this might be one
of those cases in which it makes more sense to choose a concise name
and then explain the function's purpose with a short comment block.
Perhaps something like this would be most helpful to future readers of
the code:
/*
* When in a secondary worktree, and when extensions.worktreeConfig
* is true, only $commondir/config and $commondir/worktrees/<id>/
* config.worktree are consulted, hence any core.bare=true setting in
* $commondir/config.worktree gets overlooked. Thus, check it manually
* to determine if the repository is bare.
*/
static int is_repo_bare(struct repository *r) {...}
> Thank you for this thorough explanation. I’ll drop the comment
> completely. Less code to read. (To be honest I’m not a big fan of
> comments and definitely don’t want to introduce confusing comments
> :)
Understood. Self-explanatory code is preferred. That said, a comment
such as the one proposed above can really help readers not intimately
familiar with this otherwise nonobvious behavior, thus may be
justified.
> Fun fact: my email app kept changing “worktree” to “corktree”. Maybe
> another git feature?
Your mailer obviously suffers from Gitophobia or is perhaps a nemophilist.
(By the way, when replying, please use the normal ">", "> >", "> > >"
markers to signify quoted portions of earlier messages in the thread
rather than using only indentation. The reason I make this request is
that when I replied to your message, my mailer stripped away all
indentation from your message, leaving all earlier quoted portions
flush with the left margin, which made it very difficult to figure out
which quotes came from which authors from which earlier messages, and
I ended up having to reinsert the "> >" markers manually to restore
structure to my reply.) |
On the Git mailing list, Eric Sunshine wrote (reply to this): On Thu, Jan 30, 2025 at 9:32 AM Eric Sunshine <[email protected]> wrote:
> (By the way, when replying, please use the normal ">", "> >", "> > >"
> markers to signify quoted portions of earlier messages in the thread
> rather than using only indentation. The reason I make this request is
> that when I replied to your message, my mailer stripped away all
> indentation from your message, leaving all earlier quoted portions
> flush with the left margin, which made it very difficult to figure out
> which quotes came from which authors from which earlier messages, and
> I ended up having to reinsert the "> >" markers manually to restore
> structure to my reply.)
Examining more closely, I see that your message was in fact multipart
MIME with an HTML portion which was using <blockquote> and whatnot,
which my (plain text) mailer stripped out, hence lost all formatting.
If you can configure your mailer to send plain text and use the normal
">" markers, that would be generally helpful on this list. |
Changes since v2:
{ { 0 } }
is replaced with{0}
CC: Patrick Steinhardt [email protected], Eric Sunshine [email protected], Johannes Schindelin [email protected], René Scharfe [email protected], Junio C Hamano [email protected]