When using stream stream output, there is a problem of chunks being out of order #697

sennoy11012 · 2024-05-09T10:16:07Z

Bug Description

When we use streams to output agreed upon code, there is a problem of code fragment confusion.

Reason:
class OpenAiApi

public Flux<ChatCompletionChunk> chatCompletionStream(ChatCompletionRequest chatRequest) {
     Assert.notNull(chatRequest, "The request body can not be null.");
     Assert.isTrue(chatRequest.stream(), "Request must set the steam property to true.");
     AtomicBoolean isInsideTool = new AtomicBoolean(false);
     log.debug("openai.chat-request:{}", JsonUtils.toJson(chatRequest));
     return ((WebClient.RequestBodySpec) this.webClient.post().uri("/v1/chat/completions", new Object[0])).body(
                     Mono.just(chatRequest), ChatCompletionRequest.class).retrieve().bodyToFlux(String.class)
             .takeUntil(SSE_DONE_PREDICATE).filter(SSE_DONE_PREDICATE.negate()).map((content) -> {
                 return (ChatCompletionChunk) ModelOptionsUtils.jsonToObject(content, ChatCompletionChunk.class);
             })
             .map((chunk) -> {
                 if (this.chunkMerger.isStreamingToolFunctionCall(chunk)) {
                     isInsideTool.set(true);
                 }

                 return chunk;
             }).windowUntil((chunk) -> {
                 if (isInsideTool.get() && this.chunkMerger.isStreamingToolFunctionCallFinish(chunk)) {
                     isInsideTool.set(false);
                     return true;
                 } else {
                     return !isInsideTool.get();
                 }
             }).concatMapIterable((window) -> {
                 Mono<ChatCompletionChunk> monoChunk = window.reduce(
                         new ChatCompletionChunk((String) null, (List) null, (Long) null, (String) null,
                                 (String) null, (String) null), (previous, current) -> {
                             return this.chunkMerger.merge(previous, current);
                         });
                 return List.of(monoChunk);
             }).flatMap((mono) -> {
                 return mono;
             });

To convert data, the flatMap method uses parallel processing of chunks. When the speed is too fast, there may be a problem of chunk output being scrambled

environment
JDK17

Spring 3.2.0

Spring ai 1.0.0-SNAPSHOT

Service deployment in Singapore

Copy Steps

Calling chatCompletionStream in the OpenAiApi class to implement responsive data output

Expected behavior

No chunk disorder occurs

The text was updated successfully, but these errors were encountered:

sennoy11012 closed this as completed May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When using stream stream output, there is a problem of chunks being out of order #697

When using stream stream output, there is a problem of chunks being out of order #697

sennoy11012 commented May 9, 2024

When using stream stream output, there is a problem of chunks being out of order #697

When using stream stream output, there is a problem of chunks being out of order #697

Comments

sennoy11012 commented May 9, 2024