perf(embedding): always request embedding creation as base64 #1312

manekinekko · 2025-02-08T17:09:04Z

Requesting base64 encoded embeddings returns smaller body sizes, on average ~60% smaller than float32 encoded. In other words, the size of the response body containing embeddings in float32 is ~2.3x bigger than base64 encoded embedding.

Closes #1310

I understand that this repository is auto-generated and my pull request may not be merged

Changes being requested

We always request embedding creating encoded as base64, and then decoded them to float32 based on the user's provided encoding_format parameter.

Additional context & links

After running a few benchmarks, requesting base64 encoded embeddings returns smaller body sizes, on average ~60% smaller than float32 encoded. In other words, the size of the response body containing embeddings in float32 is ~2.3x bigger than base64 encoded embedding.

This performance improvement could translate to:

✅ Faster HTTP responses
✅ Less bandwidth used when generating multiple embeddings

This is the result of a request that creates embedding from a 10kb chunk, run 10 times (the number are the size of response body in kb):

Benchmark	Min (ms)	Max (ms)	Mean (ms)	Min (+)	Max (+)	Mean (+)
float32 vs base64	41.742	19616.000	9848.819	40.094 (3.9%)	8351.000 (57.4%)	4206.126 (57.3%)

Read more #1310

Requesting base64 encoded embeddings returns smaller body sizes, on average ~60% smaller than float32 encoded. In other words, the size of the response body containing embeddings in float32 is ~2.3x bigger than base64 encoded embedding. We always request embedding creating encoded as base64, and then decoded them to float32 based on the user's provided encoding_format parameter. Closes openai#1310

RobertCraigie

Thanks!

RobertCraigie · 2025-02-11T11:14:48Z

src/resources/embeddings.ts

+        // Force base64 encoding for vector embeddings creation
+        // See https://github.com/openai/openai-node/issues/1310
+        encoding_format: 'base64',


I don't think we want to always use base64, if the user explicitly asked for a different format we should have the exact same behaviour as we do prior to this PR which is to just let them.

RobertCraigie · 2025-02-11T11:15:23Z

src/resources/embeddings.ts

+            console.log(embeddingBase64Obj);
+            const embeddingBase64Str = embeddingBase64Obj.embedding as unknown as string;
+            embeddingBase64Obj.embedding = Array.from(
+              new Float32Array(Buffer.from(embeddingBase64Str, 'base64').buffer),


Buffer is a Node.js specific API, we need to use something that is available everywhere, would be great to add a generic helper function in core.ts.

RobertCraigie · 2025-02-11T11:15:37Z

src/resources/embeddings.ts

+      return base64Response._thenUnwrap((response) => {
+        if (response && response.data) {
+          response.data.forEach((embeddingBase64Obj) => {
+            console.log(embeddingBase64Obj);


nit

Suggested change

console.log(embeddingBase64Obj);

manekinekko requested a review from a team as a code owner February 8, 2025 17:09

manekinekko force-pushed the perf/wassim-chegham-issue-1310 branch from 7702d54 to 270861b Compare February 8, 2025 17:09

manekinekko mentioned this pull request Feb 8, 2025

Perf: Improve vector embeddings creation by 60% #1310

Open

1 task

RobertCraigie requested changes Feb 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(embedding): always request embedding creation as base64 #1312

perf(embedding): always request embedding creation as base64 #1312

manekinekko commented Feb 8, 2025

RobertCraigie left a comment

RobertCraigie Feb 11, 2025

RobertCraigie Feb 11, 2025

RobertCraigie Feb 11, 2025

perf(embedding): always request embedding creation as base64 #1312

Are you sure you want to change the base?

perf(embedding): always request embedding creation as base64 #1312

Conversation

manekinekko commented Feb 8, 2025

Changes being requested

Additional context & links

RobertCraigie left a comment

Choose a reason for hiding this comment

RobertCraigie Feb 11, 2025

Choose a reason for hiding this comment

RobertCraigie Feb 11, 2025

Choose a reason for hiding this comment

RobertCraigie Feb 11, 2025

Choose a reason for hiding this comment