-
Notifications
You must be signed in to change notification settings - Fork 152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: Improve audit time #545
base: main
Are you sure you want to change the base?
Conversation
ed48220
to
b449541
Compare
Based on a wheel of pytorch, I test the speed and size of different compress level of zip. There's some outliner in extract time, but I haven't double check them yet, let's just ignore them. Based on size, 0->1 reduced over 50% size and 3->4 reduced >1%, all other levels have a <1% contribution. It should be reasonable to set default level (from 6) to 4, and adding another cli argument to specify compress level (to 0 or 9 if user know what they're doing) also makes sense to me. so files only:
all files:
|
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #545 +/- ##
==========================================
- Coverage 92.07% 91.48% -0.60%
==========================================
Files 22 22
Lines 1553 1597 +44
Branches 285 293 +8
==========================================
+ Hits 1430 1461 +31
- Misses 74 81 +7
- Partials 49 55 +6 ☔ View full report in Codecov by Sentry. |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
With the demo in PR, the time has been reduced from 3m36.515s to 2m44.353s. Concurrent grafting seems not significent, I'll see the result on our real workload later. Only change compress level to 4: 2m53.029s |
Numbers: The repair part still seem potential to be optimized. |
@oraluben, thanks for digging into this. |
For the " do not re-unzip wheel if possible", I'd probably move unzipping at the main_repair/main_show level not to introduce complexity with caching. |
for more information, see https://pre-commit.ci
Motivation: We're using auditwheel to repair our internal PyTorch build, the overall time is ~10m. We're attempting to reduce the redundant operations that we've found, e.g. unzips.
This PR is not finalized yet but reviews are welcome!