Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
update lars partition function removing
lar_row_counts_by_lei
parameter.Originally, it's used to calculate the size of the lar data chunks returned by sql query. this means that we need to run another sql query.
Found out that the chunks list is returned as iterable type so we can use regular loop to into each chunk (and skip the
lar_row_counts_by_lei
sql query)Tested by running 2023 annual and Q3 lars data locally (
lar_raw_parquets_2023
andlar_raw_parquets_2023_q2
outputs) and verified that the parquet output files count and contents are the samecloses #11