Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop upstream sync 230201 #1987

Merged
merged 545 commits into from
Feb 3, 2023
Merged

Conversation

i-chaochen
Copy link

@i-chaochen i-chaochen commented Feb 1, 2023

Disable unit tests:

  1. multi_client_test_nccl_local_2gpus Failed UT: multi_client_test_nccl_local_2gpus #1980

  2. device_tracer_test_gpu https://github.com/ROCmSoftwarePlatform/frameworks-internal/issues/3175

  3. to fix estimator_export error, I put estimator_export back to tf_export tensorflow@3837d0f

  4. cudnn_determinstic_ops_test needs to check. https://github.com/ROCmSoftwarePlatform/frameworks-internal/issues/3563

5. tensor_or_memref.h is failed to build at cpu test

Also, roc_blas.cc needs to be upstreaming.
https://github.com/ROCmSoftwarePlatform/frameworks-internal/issues/3547

tensorflower-gardener and others added 30 commits January 26, 2023 11:59
Before this change, XLA compilation fails when calling `tf.image.non_max_suppression_padded` with undertemined batch size for input boxes.

The reason behind this failure is because:
1. When calling this op with parameter `sorted_input=False`, `_sort_scores_and_boxes` will be called.
2. In `_sort_scores_and_boxes`, if input boxes has dynamic batch size, during XLA compilation time, `tf.shape(boxes)[1]` will be unknown because of this reshape operation.
3. In the while loop body (`supression_loop_body`), `num_tiles` cannot be determined because `tf.shape(boxes)[1]` is unkonwn during compilation.
4. The op `math_ops.range(num_tiles)` failed to be compiled because `num_tiles` is unknown during compilation time.

This change will fix this issue by adding `num_boxes` to the reshape op.

PiperOrigin-RevId: 504902678
Prior to this change, it only accepted 1, false, or true.

PiperOrigin-RevId: 504902827
In a similar fashion to tfl.sin and tlf.cos legalization,
a tosa.table provides an atan approximation. Additional logic
is then used to determine the correct quadrant of the atan2
function.

Signed-off-by: Luke Hutton <[email protected]>
Change-Id: Iae1384009d825d01e5cf48ad7c3ff8fba77114cf
* remove unnecessary type checks
* add note about numerical behaviour of std::atan2
* improve error message for expected inputs
* undo change updating copyright year

Change-Id: Iea8339da437a5ff3e6fe065c715c2c97e696fdbb
Currently users have to provide dummy deleter if they want to declare ExecutionReference variable. This change solves that issue by declaring ExecutionReference as a class, which wraps std::unique_ptr<ExecutionContext, void(*)(ExecutionContext *)>.

Ex.
struct Test {
  ExecutionReference exec_ref;
};

PiperOrigin-RevId: 504904730
…e_spec_registry.py`.

PiperOrigin-RevId: 504905987
PiperOrigin-RevId: 504906696
…hod in TF selective_registration_header_lib.

PiperOrigin-RevId: 504910088
PiperOrigin-RevId: 504910444
Move verifyRecvOp and verifyInfeedOp with TokenType
openxla/stablehlo#852

Move inferFftOp and verifyRngOp with Enums
openxla/stablehlo#853

PiperOrigin-RevId: 504910725
… to `framework/type_spec_registry.py`.

PiperOrigin-RevId: 504911860
Ensure that there is at least a single non control input to the Identity/IdentityN.

PiperOrigin-RevId: 504913650
…` to `framework/type_spec_registry.py`.

PiperOrigin-RevId: 504917585
Export GetBestAlgorithm fom GemmAlgorithmPicker so that runtime autotuning use it as a subroutine.

PiperOrigin-RevId: 504928946
The shape refiner reruns shape inference for nested function calls every time a materialized argument is requested. Before this change the constant folding mechanism used in the refiner stopped traversing the subgraph right after attending the first unresolved argument. After this change it will continue traversal to cover all unresolved arguments.

This change reduces preprocessing time of some of our models by 75%.

PiperOrigin-RevId: 504931297
This breaks a cycle between "python/framework/graph_util_impl.py" and "python/framework/convert_to_constants.py".

PiperOrigin-RevId: 504939493
* Cleanup of error messages.
* Cleanup casting inputs/outputs.
* Use CHECK-DAG instead of CHECK for constants.
* Spelling

Change-Id: Ibf01ac5b711944bb2efbd7184820a304cbae8501
Signed-off-by: Luke Hutton <[email protected]>
PiperOrigin-RevId: 504949702
tyb0807 and others added 19 commits January 31, 2023 22:23
This adds a pass that provides some debug info with which basic line number info can be generated.

Adapted from Flang's AddDebugFoundationPass.

PiperOrigin-RevId: 506213461
PiperOrigin-RevId: 506217195
PiperOrigin-RevId: 506225078
This fixes double-free errors or memory leaks for example when the running of the HLO is unsuccessful.

The old code-path is also left there, as a lot of our code depends on the ability to run the same HLO multiple times without reallocating the input buffers.

PiperOrigin-RevId: 506238363
PiperOrigin-RevId: 506239156
PiperOrigin-RevId: 506243978
PiperOrigin-RevId: 506252805
@i-chaochen
Copy link
Author

retest Ubuntu-GPU-single please
retest Ubuntu-GPU-multi please
retest Ubuntu-CPU please
retest Ubuntu-sanity please

@jayfurmanek
Copy link

retest Ubuntu-sanity please

@i-chaochen
Copy link
Author

retest Ubuntu-GPU-single please

@i-chaochen i-chaochen merged commit a61751c into develop-upstream Feb 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.