Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MaxFlops reports crazy numbers for Apple M1 #81

Open
jeffhammond opened this issue Nov 7, 2022 · 1 comment
Open

MaxFlops reports crazy numbers for Apple M1 #81

jeffhammond opened this issue Nov 7, 2022 · 1 comment

Comments

@jeffhammond
Copy link

The MaxFlops test reports >100 petaflop/s FP32 for Apple M1, which false suggest that my M1 Air laptop is the second most powerful supercomputer in Europe.

Disabling compiler optimizations (below) makes the problem smaller but does not fix it, as the results are still 4-17 teraflop/s FP32, which is greater than the performance claimed by Apple or reported by third-parties.

static const char* opts = "-cl-mad-enable -cl-no-signed-zeros -cl-opt-disable";
                          //"-cl-unsafe-math-optimizations -cl-finite-math-only";

It would seem that the benchmark should be modified to prevent compilers from removing large portions of the computation.

@jyoung3131
Copy link
Contributor

Hi Jeff - I worked on this benchmark back in the day (2014?) and a large part of our efforts were focused on keeping the compiler from optimizing out computation, sometimes to the point of adding redundant and indirect references. It turns out that compilers are actually really good at optimizing "useless" code, so I'm not surprised that it is reporting erroneous results for the M1.

Realistically, some of the feeds and speeds (ie, BW tests) are still useful, but SHOC has not been fully updated in some time. As such, I'd be very wary of any results from the MaxFlops test unless/until a new version of the code is released.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants