More efficient `breakString` #177

michael-schwarz · 2025-01-14T16:37:18Z

Closes #169.

See also goblint/analyzer#1513 that also has some runtime comparisons.

michael-schwarz · 2025-01-14T20:38:40Z

This was a bit of a pain because I did not consider that building the result from right to left means that the accumulator cannot be used to pass in a prefix, I'll squash merge then.

I am confident this is correct now, it both passes all printing tests in this repo, as well as all cram tests in the goblint repo which also exercise this function.

sim642 · 2025-01-16T09:17:08Z

This now traverses the string right-to-left (because the Stdlib splitting does so because that's needed to construct a list), but the old version went left-to-right.
Also, it looks like the document tree from this will be completely different, no? Leaning in a different direction, Lines in different places, no CText (which I assume is an optimization to keep trees smaller).

It looks like Pretty significantly prefers some shape over another and has flatten to improve things. So we need to be quite careful with these details: otherwise we might make breakString faster but later outputting it inefficient.
Even if the default flattenBeforePrint would take care of it, constructing the correct tree directly in the first place would be good.

michael-schwarz · 2025-01-16T15:06:55Z

Judging by the comments a few lines below, this right-heavy tree actually seems preferable over the one they had previously:

cil/src/ocamlutil/pretty.ml

Lines 116 to 119 in f5ee39b

    
           (* Note that the ++ operator in Ocaml are left-associative. This means 
        
              that if you have a long list of ++ then the whole thing is very unbalanced 
        
              towards the left side. This is the worst possible case since scanning the 
        
              left side of a Concat is the non-tail recursive case. *)

michael-schwarz · 2025-01-16T15:10:44Z

cil/src/ocamlutil/pretty.ml

Lines 227 to 231 in f5ee39b

    
           (* When we construct documents, most of the time they are heavily unbalanced 
        
              towards the left. This is due to the left-associativity of ++ and also to 
        
              the fact that constructors such as docList construct from the let of a 
        
              sequence. We would prefer to shift the imbalance to the right to avoid 
        
              consuming a lot of stack when we traverse the document *)

It seems like the new function I give constructs more or less the tree you would get after calling flatten?

src/ocamlutil/pretty.ml

sim642 · 2025-01-17T10:48:43Z

It seems like the new function I give constructs more or less the tree you would get after calling flatten?

That's true, which maybe is a good thing.
Although it does also have an interesting effect: since flatten is written to do tail-recursion on the left, it does non-tail-recursion on the right. So when flattening a right-leaning tree, it actually goes into the very same deep recursion that the whole flattening process is trying to avoid later when doing the printing.

So it's not clear to me what of the old behavior should be retained (for the sake of just optimizing) and what's actually inconsequential (given how breakString wasn't really thought through for efficiency). Maybe it's not worth preserving all the old behavior after all.

michael-schwarz · 2025-01-17T13:06:49Z

Given we did not encounter a stack overflow for goblint/analyzer#1513 where I also used an non tail-recursive helper, I would assume that this is also not problematic here.

If we run into any trouble, we can always go back to a more faithful replacement.

michael-schwarz added 2 commits January 14, 2025 17:32

More efficient breakString (#169)

6287828

Pull out variable

2faf732

michael-schwarz added the enhancement label Jan 14, 2025

michael-schwarz requested a review from sim642 January 14, 2025 16:37

michael-schwarz changed the title ~~More efficient breakString (#169)~~ More efficient breakString Jan 14, 2025

michael-schwarz marked this pull request as draft January 14, 2025 16:48

michael-schwarz force-pushed the issue_169 branch from cac00a5 to 195de1b Compare January 14, 2025 19:51

Fix duplicate lines

f99b977

michael-schwarz force-pushed the issue_169 branch from 195de1b to f99b977 Compare January 14, 2025 19:54

michael-schwarz added 2 commits January 14, 2025 21:27

Fix acc position

98fb22b

Indent

bd042da

michael-schwarz marked this pull request as ready for review January 14, 2025 20:36

sim642 reviewed Jan 17, 2025

View reviewed changes

src/ocamlutil/pretty.ml Show resolved Hide resolved

src/ocamlutil/pretty.ml Show resolved Hide resolved

src/ocamlutil/pretty.ml Outdated Show resolved Hide resolved

Simplify generated string

4d77053

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More efficient `breakString` #177

More efficient `breakString` #177

michael-schwarz commented Jan 14, 2025

michael-schwarz commented Jan 14, 2025

sim642 commented Jan 16, 2025

michael-schwarz commented Jan 16, 2025

michael-schwarz commented Jan 16, 2025

sim642 commented Jan 17, 2025

michael-schwarz commented Jan 17, 2025

More efficient breakString #177

Are you sure you want to change the base?

More efficient breakString #177

Conversation

michael-schwarz commented Jan 14, 2025

michael-schwarz commented Jan 14, 2025

sim642 commented Jan 16, 2025

michael-schwarz commented Jan 16, 2025

michael-schwarz commented Jan 16, 2025

sim642 commented Jan 17, 2025

michael-schwarz commented Jan 17, 2025

More efficient `breakString` #177

More efficient `breakString` #177