forked from tensorflow/tensorflow
-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop upstream sync 230201 #1987
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Before this change, XLA compilation fails when calling `tf.image.non_max_suppression_padded` with undertemined batch size for input boxes. The reason behind this failure is because: 1. When calling this op with parameter `sorted_input=False`, `_sort_scores_and_boxes` will be called. 2. In `_sort_scores_and_boxes`, if input boxes has dynamic batch size, during XLA compilation time, `tf.shape(boxes)[1]` will be unknown because of this reshape operation. 3. In the while loop body (`supression_loop_body`), `num_tiles` cannot be determined because `tf.shape(boxes)[1]` is unkonwn during compilation. 4. The op `math_ops.range(num_tiles)` failed to be compiled because `num_tiles` is unknown during compilation time. This change will fix this issue by adding `num_boxes` to the reshape op. PiperOrigin-RevId: 504902678
Prior to this change, it only accepted 1, false, or true. PiperOrigin-RevId: 504902827
In a similar fashion to tfl.sin and tlf.cos legalization, a tosa.table provides an atan approximation. Additional logic is then used to determine the correct quadrant of the atan2 function. Signed-off-by: Luke Hutton <[email protected]> Change-Id: Iae1384009d825d01e5cf48ad7c3ff8fba77114cf
* remove unnecessary type checks * add note about numerical behaviour of std::atan2 * improve error message for expected inputs * undo change updating copyright year Change-Id: Iea8339da437a5ff3e6fe065c715c2c97e696fdbb
Currently users have to provide dummy deleter if they want to declare ExecutionReference variable. This change solves that issue by declaring ExecutionReference as a class, which wraps std::unique_ptr<ExecutionContext, void(*)(ExecutionContext *)>. Ex. struct Test { ExecutionReference exec_ref; }; PiperOrigin-RevId: 504904730
…e_spec_registry.py`. PiperOrigin-RevId: 504905987
PiperOrigin-RevId: 504906696
…hod in TF selective_registration_header_lib. PiperOrigin-RevId: 504910088
PiperOrigin-RevId: 504910311
PiperOrigin-RevId: 504910444
Move verifyRecvOp and verifyInfeedOp with TokenType openxla/stablehlo#852 Move inferFftOp and verifyRngOp with Enums openxla/stablehlo#853 PiperOrigin-RevId: 504910725
… to `framework/type_spec_registry.py`. PiperOrigin-RevId: 504911860
Ensure that there is at least a single non control input to the Identity/IdentityN. PiperOrigin-RevId: 504913650
…` to `framework/type_spec_registry.py`. PiperOrigin-RevId: 504917585
PiperOrigin-RevId: 504920230
PiperOrigin-RevId: 504922596
PiperOrigin-RevId: 504923081
PiperOrigin-RevId: 504928473
Export GetBestAlgorithm fom GemmAlgorithmPicker so that runtime autotuning use it as a subroutine. PiperOrigin-RevId: 504928946
PiperOrigin-RevId: 504930976
The shape refiner reruns shape inference for nested function calls every time a materialized argument is requested. Before this change the constant folding mechanism used in the refiner stopped traversing the subgraph right after attending the first unresolved argument. After this change it will continue traversal to cover all unresolved arguments. This change reduces preprocessing time of some of our models by 75%. PiperOrigin-RevId: 504931297
PiperOrigin-RevId: 504935198
PiperOrigin-RevId: 504937807
…otuner. PiperOrigin-RevId: 504939086
PiperOrigin-RevId: 504939291
This breaks a cycle between "python/framework/graph_util_impl.py" and "python/framework/convert_to_constants.py". PiperOrigin-RevId: 504939493
PiperOrigin-RevId: 504940089
PiperOrigin-RevId: 504940378
* Cleanup of error messages. * Cleanup casting inputs/outputs. * Use CHECK-DAG instead of CHECK for constants. * Spelling Change-Id: Ibf01ac5b711944bb2efbd7184820a304cbae8501 Signed-off-by: Luke Hutton <[email protected]>
PiperOrigin-RevId: 504949702
This adds a pass that provides some debug info with which basic line number info can be generated. Adapted from Flang's AddDebugFoundationPass. PiperOrigin-RevId: 506213461
PiperOrigin-RevId: 506217195
PiperOrigin-RevId: 506225078
…izontal_loop_fusion PiperOrigin-RevId: 5062358
This fixes double-free errors or memory leaks for example when the running of the HLO is unsuccessful. The old code-path is also left there, as a lot of our code depends on the ability to run the same HLO multiple times without reallocating the input buffers. PiperOrigin-RevId: 506238363
PiperOrigin-RevId: 506239134
PiperOrigin-RevId: 506239156
PiperOrigin-RevId: 506240202
PiperOrigin-RevId: 506243978
PiperOrigin-RevId: 506244287
PiperOrigin-RevId: 506252805
retest Ubuntu-GPU-single please |
retest Ubuntu-sanity please |
retest Ubuntu-GPU-single please |
jayfurmanek
approved these changes
Feb 3, 2023
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Disable unit tests:
multi_client_test_nccl_local_2gpus Failed UT: multi_client_test_nccl_local_2gpus #1980
device_tracer_test_gpu https://github.com/ROCmSoftwarePlatform/frameworks-internal/issues/3175
to fix estimator_export error, I put estimator_export back to tf_export tensorflow@3837d0f
cudnn_determinstic_ops_test needs to check. https://github.com/ROCmSoftwarePlatform/frameworks-internal/issues/3563
5. tensor_or_memref.h is failed to build at cpu testAlso, roc_blas.cc needs to be upstreaming.
https://github.com/ROCmSoftwarePlatform/frameworks-internal/issues/3547