RFC: Remove FAST_FLOAT, TFLOAT #4385

amitdo · 2025-01-24T13:59:49Z

We should unconditionally use 32-bit float.

stweil · 2025-01-24T14:46:52Z

I already started experiments with even smaller float data types (like they are used in GPUs) because this would accelerate the training. Up to now my experiments were not successful, but who knows, this might change in the future with better compiler support and the right libraries. Therefore knowing the code locations and having a special data type is still very helpful.

But I also don't think that anybody still has the need for the old double implementation.

amitdo · 2025-01-26T12:41:06Z

But I also don't think that anybody still has the need for the old double implementation.

My suggestion:

In cmake/autotools,FAST_FLOAT should always be defined (remove or comment out the televant config code).
Remove all the double (64-bit) dot product code.

amitdo added the RFC label Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Remove FAST_FLOAT, TFLOAT #4385

RFC: Remove FAST_FLOAT, TFLOAT #4385

amitdo commented Jan 24, 2025 •

edited

Loading

stweil commented Jan 24, 2025

amitdo commented Jan 26, 2025 •

edited

Loading

RFC: Remove FAST_FLOAT, TFLOAT #4385

RFC: Remove FAST_FLOAT, TFLOAT #4385

Comments

amitdo commented Jan 24, 2025 • edited Loading

stweil commented Jan 24, 2025

amitdo commented Jan 26, 2025 • edited Loading

amitdo commented Jan 24, 2025 •

edited

Loading

amitdo commented Jan 26, 2025 •

edited

Loading