"Unexpected end of JSON input" when streaming on edge environments (Vercel Edge, Cloudflare Workers) #292

venables · 2024-02-14T20:14:02Z

The SDK seems to operate fine when running in a node.js environment, but when running in an Edge runtime (browser env), such as Vercel Edge or Cloudflare Workers, streaming becomes cut off with the following exception:

Could not parse message into JSON: 
From chunk: [ 'event: content_block_delta' ]

SyntaxError: Unexpected end of JSON input
    at (node_modules/@anthropic-ai/sdk/streaming.mjs:58:39)
    at (app/api/test/route.js:15:19)
    at (node_modules/next/dist/esm/server/future/route-modules/app-route/module.js:189:36)
    at (node_modules/next/dist/esm/server/future/route-modules/app-route/module.js:128:25)
    at (node_modules/next/dist/esm/server/future/route-modules/app-route/module.js:251:29)
    at (node_modules/next/dist/esm/server/web/edge-route-module-wrapper.js:81:20)
    at (node_modules/next/dist/esm/server/web/adapter.js:157:15)

The error is coming from this block: https://github.com/anthropics/anthropic-sdk-typescript/blob/main/src/streaming.ts#L69-L84

The line content is:

{
  event: 'content_block_delta',
  data: '',
  raw: [ 'event: content_block_delta' ]
}

Since the data is an empty string,, the JSON parsing blows up. I can bypass this error if I modify the code to ignore empty strings, but that does not seem ideal.

Reproduction repos:

I put the Streaming example from the Anthropic SDK README into a Vercel Edge function and a Cloudflare Workers function with the same failing result.

Note, the error occurs whether we use import "@anthropic-ai/sdk/shims/web"; or not.

Vercel Edge:

I've put together a sample repo, using create-next-app and using the example from your README: https://github.com/venables/anthropic-edge-stream-error

The file in question would be app/api/test/route.ts. If you remove export const runtime = "edge", it works as expected.

This error will not occur locally since locally the environment is a node.js environment, but when you deploy to Vercel (with runtime = "edge" still in the code), you will consistently get the error.

Cloudflare Workers

If you want to reproduce this locally, you can do so using Wrangler and Cloudflare Workers, which spins up a real edge-like environment locally when you run it.

I created a sample repository here, using Hono as the router: https://github.com/venables/anthropic-stream-error-cf

The file in question here is src/index.ts

Running that locally and hitting the endpoint will fail.

The text was updated successfully, but these errors were encountered:

rattrayalex · 2024-02-15T05:44:10Z

Thanks for reporting!

cc @RobertCraigie can you take a look at this?

izuchukwu · 2024-03-05T00:50:01Z

Hi y'all! @rattrayalex @RobertCraigie We're experiencing this same issue in Node.js environments as well. We've confirmed it in both Node.js and Bun after migrating to the Messages API today to adopt Claude 3. We don't experience this with the legacy Text Completions streaming API.

Update! It was not exactly the same issue. Instead - the new .stream API consistently fails for us with this same error, but we found we can use the old .create({stream: true, ...}) API with the new Messages API, and this works fine on Node.js & Bun. So if .stream fails for you in Node environments, .create({stream: true, ...}) still works. Hope this helps someone else!

rattrayalex · 2024-03-05T02:12:30Z

Thank you for the update @izuchukwu ! We'll take a look at it shortly!

Could you share a script, ideally including prompt, that reproduces the problem you're seeing?

EDIT: we were not able to reproduce this locally.

izuchukwu · 2024-03-05T06:56:46Z

Hi, unfortunately, we're now running into this issue with .create({stream: true, ...}) as well.

It's hard for me to identify the prompt because it happens when we run multiple large prompts in parallel. I can confirm it is the same problem - sse.event is either content_block_delta or content_block_start, but sse.data is an empty string, not a JSON object, so the yield JSON.parse(sse.data) line throws an error.

We consistently see the error, but when we see it is very inconsistent, presumably because its dependent on the server sending an empty string. We're able to run small prompts just fine. The prompts that trigger it write multiple paragraphs (~5) before tripping the error.

Here's a snippet. It's embedded in a library, so most options are passed as variables.

const stream = await client.messages.create({
    messages,
    model,
    max_tokens: 4096,
    temperature: options?.temp,
    top_p: options?.topP,
    system,
    stream: true
})

for await (const event of stream) {
  if (event.type !== 'content_block_delta') continue
  const chunk = event.delta.text
  
  // Process stream
  const shouldContinue = await onComplete?.(chunk) // onComplete is an async callback function
  if (!_.isNil(shouldContinue) && !shouldContinue) {
	  stream.controller.abort()
  }
}

lawetis · 2024-03-05T11:38:27Z

Very sorry, I also encountered the same problem here.

stephtr · 2024-03-05T13:34:24Z

As venables said, patching the streaming.js file (see below) keeps the stream from aborting. ~~Interestingly, nevertheless a few tokens go then missing in the response.~~ (edit: that also happens without the patch)

EDIT: for a fix, see the response from dzhng

tvytlx · 2024-03-05T16:53:39Z

Any update on this issue? @rattrayalex

RobertCraigie · 2024-03-05T17:56:27Z

Hey, would it be possible for anyone to share a request ID for a request that failed in this way? We can't reproduce unfortunately.

You can get a request ID by setting the DEBUG env var to true and in your logs you'll see something like this:

Anthropic:DEBUG:response 200 https://api.anthropic.com/v1/messages Headers {
  [Symbol(map)]: [Object: null prototype] {
    date: [ 'Tue, 05 Mar 2024 17:54:09 GMT' ],
    'content-type': [ 'text/event-stream; charset=utf-8' ],
    'transfer-encoding': [ 'chunked' ],
    connection: [ 'keep-alive' ],
    'cache-control': [ 'no-cache' ],
    'request-id': [ 'req_01Vr3DL4pMCDk2kNHujJkpwf' ],
    via: [ '1.1 google' ],
    'cf-cache-status': [ 'DYNAMIC' ],
    server: [ 'cloudflare' ],
    'cf-ray': [ '85fbf82b8be16aaa-MAN' ]
  }
}

In this case the request ID was req_01Vr3DL4pMCDk2kNHujJkpwf.

nyacg · 2024-03-05T19:08:44Z

Unfortunately when I debug log on Vercel the headers aren't printed 😞

Anthropic:DEBUG:response 200 https://api.anthropic.com/v1/messages Headers { } ReadableStream { }

~~I also applied the suggested patch #292 (comment) which seems to reduce the frequency of error rates but not bring it down to zero~~
Edit: I don't think the patch was being applied properly, it now is and I'm getting dropped tokens

nyacg · 2024-03-05T20:47:20Z

Doing some debugging myself it looks like a potential bug in the LineDecoder

Here's an extract of the logs where an error occurs. I'm printing the output of this.decodeText(chunk); in the LineDecoder.

Note: logs go from bottom to top

ata: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"ky"}}
event: content_block_delta d
data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" with"}} event: content_block_delta data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" its"}} event: content_block_delta data: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":" smo"}}
ERROR in iterMessages, sse:  '{"event":"content_block_delta","data":"","raw":["event: content_block_delta"]}' 
event: content_block_delta
ata: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":","}}
event: content_block_delta d
ata: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"uda"}}
event: content_block_delta d
ata: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"\n\nGo"}}

We get the output Gouda, its smoky i.e. the " with" delta is dropped

Generally the chunks that the LineDecoder.decode gets fed are either

event: content_block_delta d
then
ata: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"<the delta>"}}

However an error occurs when the first chunk is just event: content_block_delta (possibly with a trailing space). This leads to an SSE with a sse.data of an empty string which would usually throw an error. If we just continue instead of throwing an error we then get the next SSE with the delta after this one

dzhng · 2024-03-06T10:52:40Z

OK I fixed it, @nyacg was on the right track, the issue is in the LineDecoder class in streaming.ts.

Specifically, it's because the decode() method is directly ported from this python implementation, but missed an important behavior difference in js's split() method vs python's splitlines() method.

The error happens when decode() receives the any inputs that ends in a new line, e.g.:
event: content_block_delta\r\n OR \r\n

In both of these cases, js's split() method will add an extra empty string to the end of the lines array, whereas python's splitlines() method will not. This is causing empty lines to be passed through to the next layer of sse decoders, causing this issue.

This is also why the previous patch doesn't work, it dropped tokens because the extra empty line caused whole data packets to be ignored.

It's got nothing to do with edge env, I suspect some network config in edge causes SSE packets to be smaller making this issue more noticeable.

For now, you can patch the package really easily:

--- a/streaming.js
+++ b/streaming.js
@@ -266,6 +266,9 @@ class LineDecoder {
         }
         const trailingNewline = LineDecoder.NEWLINE_CHARS.has(text[text.length - 1] || '');
         let lines = text.split(LineDecoder.NEWLINE_REGEXP);
+        if (trailingNewline) {
+            lines.pop();
+        }
         if (lines.length === 1 && !trailingNewline) {
             this.buffer.push(lines[0]);
             return [];

This will account for the different split() behavior and align it with python's splitlines() behavior.

I already created a patch for my llm-api lib if anyone want the patch files. Commit for patch

RobertCraigie · 2024-03-06T11:01:39Z

Ahh thank you so much for the detailed investigation and proposed patch @dzhng! We'll test and port this over to our side ASAP.

rattrayalex · 2024-03-06T18:41:26Z

Fixed in #312 which should be released shortly.

RobertCraigie · 2024-03-06T19:42:41Z

This fix was released in v0.17.0!

izuchukwu · 2024-03-06T19:59:29Z

Amazing turnaround time on this, thank you both @rattrayalex & @RobertCraigie

rattrayalex · 2024-03-06T23:22:12Z

Thank you for the details, help, and patience @izuchukwu , @dzhng , @nyacg , @venables , and others!

danielglh mentioned this issue Mar 5, 2024

✨ feat: support anthropic as model provider lobehub/lobe-chat#1409

Merged

7 tasks

RobertCraigie mentioned this issue Mar 6, 2024

More robust chunking #318

Closed

rattrayalex closed this as completed Mar 6, 2024

danielglh mentioned this issue Mar 7, 2024

🐛 fix: fix anthropic streaming on Vercel/Cloudflare lobehub/lobe-chat#1480

Merged

7 tasks

sunnyunde-xooa mentioned this issue Mar 12, 2024

Anthropic-AI sdk version upgrade for Issue: Unexpected End of JSON input langchain-ai/langchainjs#4716

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Unexpected end of JSON input" when streaming on edge environments (Vercel Edge, Cloudflare Workers) #292

"Unexpected end of JSON input" when streaming on edge environments (Vercel Edge, Cloudflare Workers) #292

venables commented Feb 14, 2024

rattrayalex commented Feb 15, 2024

izuchukwu commented Mar 5, 2024 •

edited

Loading

rattrayalex commented Mar 5, 2024 •

edited

Loading

izuchukwu commented Mar 5, 2024 •

edited

Loading

lawetis commented Mar 5, 2024

stephtr commented Mar 5, 2024 •

edited

Loading

tvytlx commented Mar 5, 2024

RobertCraigie commented Mar 5, 2024

nyacg commented Mar 5, 2024 •

edited

Loading

nyacg commented Mar 5, 2024

dzhng commented Mar 6, 2024 •

edited

Loading

RobertCraigie commented Mar 6, 2024

rattrayalex commented Mar 6, 2024 •

edited

Loading

RobertCraigie commented Mar 6, 2024

izuchukwu commented Mar 6, 2024

rattrayalex commented Mar 6, 2024

"Unexpected end of JSON input" when streaming on edge environments (Vercel Edge, Cloudflare Workers) #292

"Unexpected end of JSON input" when streaming on edge environments (Vercel Edge, Cloudflare Workers) #292

Comments

venables commented Feb 14, 2024

Reproduction repos:

Vercel Edge:

Cloudflare Workers

rattrayalex commented Feb 15, 2024

izuchukwu commented Mar 5, 2024 • edited Loading

rattrayalex commented Mar 5, 2024 • edited Loading

izuchukwu commented Mar 5, 2024 • edited Loading

lawetis commented Mar 5, 2024

stephtr commented Mar 5, 2024 • edited Loading

tvytlx commented Mar 5, 2024

RobertCraigie commented Mar 5, 2024

nyacg commented Mar 5, 2024 • edited Loading

nyacg commented Mar 5, 2024

dzhng commented Mar 6, 2024 • edited Loading

RobertCraigie commented Mar 6, 2024

rattrayalex commented Mar 6, 2024 • edited Loading

RobertCraigie commented Mar 6, 2024

izuchukwu commented Mar 6, 2024

rattrayalex commented Mar 6, 2024

izuchukwu commented Mar 5, 2024 •

edited

Loading

rattrayalex commented Mar 5, 2024 •

edited

Loading

izuchukwu commented Mar 5, 2024 •

edited

Loading

stephtr commented Mar 5, 2024 •

edited

Loading

nyacg commented Mar 5, 2024 •

edited

Loading

dzhng commented Mar 6, 2024 •

edited

Loading

rattrayalex commented Mar 6, 2024 •

edited

Loading