Skip to content

Commit

Permalink
Remove inline default transposeHelper and ensure we use the proper ch…
Browse files Browse the repository at this point in the history
…eck via CanUse_hipBlasTransposeHelper_MLFloat16
  • Loading branch information
Ted Themistokleous committed Jan 29, 2025
1 parent 147465f commit ba9c603
Showing 1 changed file with 0 additions and 1 deletion.
1 change: 0 additions & 1 deletion onnxruntime/core/providers/rocm/shared_inc/fpgeneric.h
Original file line number Diff line number Diff line change
Expand Up @@ -501,7 +501,6 @@ inline hipblasStatus_t hipblasTransposeHelper(hipStream_t /*stream*/, hipblasHan
return hipblasDgeam(handle, transa, transb, m, n, alpha, A, lda, beta, B, ldb, C, ldc);
}

inline bool CanUse_hipblasTransposeHelper_MLFloat16(int /*m*/, int /*n*/) { return true; } // CUDA has a limited grid size of 65536, ROCm has higher limits.
hipblasStatus_t hipblasTransposeHelper(hipStream_t stream, hipblasHandle_t, hipblasOperation_t, hipblasOperation_t, int m, int n, const half*, const half* A, int, const half*, const half*, int, half* C, int);

// copy
Expand Down

0 comments on commit ba9c603

Please sign in to comment.