automation: only read complete lines before trying to deserialize #15778

tgummerer · 2024-03-25T17:59:38Z

When tailing the event log in automation API we currently have nothing that makes sure we read only complete lines. This means if the OS happens to flush an incomplete line for whatever reason (or the Go JSON encoder does, which we're using to write these lines), we might read a line that is incompletely written, and thus will fail to JSON decode it.

Since the JSON encoder always writes a newline at the end of each string, we can also make sure that the line we read ends with a newline and otherwise wait for the rest of the line to be written.

The library we use in Go provides a convenient setting for this, while in python and nodejs we need to add some code to do this ourselves.

Fixes #15235
Fixes #15652
Fixes #9269 (This is closed already, but never had a proper resolution afaics)
Fixes #6768

It would be nice to add a typescript test here as well, but I'm not sure how to do that without marking the readLines function non-private. But I don't know typescript well, so any hints of how to do that would be appreciated!

Checklist

I have run make tidy to update any new dependencies
I have run make lint to verify my code passes the lint check
- I have formatted my code using gofumpt

I have added tests that prove my fix is effective or that my feature works

I have run make changelog and committed the changelog/pending/<file> documenting my change

Yes, there are changes in this PR that warrants bumping the Pulumi Cloud API version

When tailing the event log in automation API we currently have nothing that makes sure we read only complete lines. This means if the OS happens to flush an incomplete line for whatever reason (or the Go JSON encoder does, which we're using to write these lines), we might read a line that is incompletely written, and thus will fail to JSON decode it. Since the JSON encoder always writes a newline at the end of each string, we can also make sure that the line we read ends with a newline and otherwise wait for the rest of the line to be written. The library we use in Go provides a convenient setting for this, while in python and nodejs we need to add some code to do this ourselves.

pulumi-bot · 2024-03-25T18:08:57Z

Changelog

[uncommitted] (2024-03-26)

Bug Fixes

[auto/{go,nodejs,python}] Make sure to read complete lines before trying to deserialize them as engine events
#15778

Frassle · 2024-03-25T18:20:52Z

Guess we need a similar fix in dotnet?

tgummerer · 2024-03-26T09:22:58Z

Guess we need a similar fix in dotnet?

Yes probably, good call! I'll look into that.

This is the equivalent as pulumi/pulumi#15778 for dotnet. When tailing the event log we currently don't make sure that we read complete lines. Lines can be flushed in the middle by the OS while they are being written down, leading to incomplete JSON. The JSON encoder always ends each line with a newline, so we can stitch the lines back together even if we occasionally read a partly written line.

tgummerer · 2024-03-26T13:34:26Z

sdk/nodejs/automation/stack.ts

+        let partialLine = "";
        lineSplitter.on("line", (line) => {
            let event: EngineEvent;
            try {
-                event = JSON.parse(line);
+                line = partialLine + line;
+                partialLine = "";
+		event = JSON.parse(line);
                callback(event);
            } catch (e) {
-                log.warn(`Failed to parse engine event
-If you're seeing this warning, please comment on https://github.com/pulumi/pulumi/issues/6768 with the event and any
-details about your environment.
-
-Event: ${line}\n${e.toString()}`);
+                partialLine += line;
+                return;


Unfortunately nodejs doesn't seem to include the newline character in the on line events, so we'll have to go off catching exceptions here.

tgummerer · 2024-03-26T14:31:47Z

pulumi/pulumi-dotnet#245 is the corresponding dotnet PR.

justinvp · 2024-03-26T23:13:33Z

It would be nice to add a typescript test here as well, but I'm not sure how to do that without marking the readLines function non-private. But I don't know typescript well, so any hints of how to do that would be appreciated!

Unfortunately nodejs doesn't seem to include the newline character in the on line events, so we'll have to go off catching exceptions here.

Yeah, we should have a test for Node.js. It doesn't look like readLines needs to be a private member of the class -- I think we could move it out to be a top-level exported function marked /** @internal */ so it can be called from a test.

/** @internal */
export async function readLines(logPath: string, callback: (event: EngineEvent) => void): Promise<ReadlineResult> {
    // ...
}

Tentative changelog: ### Features - [docs] Implement constructor syntax examples for every resource in typescript, python, csharp and go [#15624](#15624) - [engine] Send output values with property dependency information to transform functions [#15637](#15637) - [engine] Add a --continue-on-error flag to pulumi destroy [#15727](#15727) - [sdk/go] Make `property.Map` keyed by `string` not `MapKey` [#15767](#15767) - [sdk/python] Improve the error message when depends_on is passed objects of the wrong type [#15737](#15737) ### Bug Fixes - [auto/{go,nodejs,python}] Make sure to read complete lines before trying to deserialize them as engine events [#15778](#15778) - [cli/plugin] Fix installing local language plugins on Windows [#15715](#15715) - [engine] Don't delete stack outputs on failed deployments [#15754](#15754) - [engine] Fix a panic when updating provider version in a run using --target [#15716](#15716) - [engine] Handle that Assets & Archives can be returned from providers without content. [#15736](#15736) - [engine] Fix the engine trying to delete a protected resource caught in a replace chain [#15776](#15776) - [sdkgen/docs] Add missing newline for `Coming soon!` [#15783](#15783) - [programgen/dotnet] Fix generated code for a list of resources used in resource option DependsOn [#15773](#15773) - [programgen/{dotnet,go}] Fixes emitted code for object expressions assigned to properties of type Any [#15770](#15770) - [sdk/go] Fix lookup of plugin and program dependencies when using Go workspaces [#15743](#15743) - [sdk/nodejs] Export automation.tag.TagMap type [#15774](#15774) - [sdk/python] Wait only for pending outputs in the Python SDK, not all pending asyncio tasks [#15744](#15744) ### Miscellaneous - [sdk/nodejs] Reorganize function serialization tests [#15753](#15753) - [sdk/nodejs] Move mockpackage tests to closure integration tests [#15757](#15757)

Tentative changelog: ### Features - [docs] Implement constructor syntax examples for every resource in typescript, python, csharp and go [#15624](#15624) - [docs] Implement YAML constructor syntax examples in the docs [#15791](#15791) - [engine] Send output values with property dependency information to transform functions [#15637](#15637) - [engine] Add a --continue-on-error flag to pulumi destroy [#15727](#15727) - [sdk/go] Make `property.Map` keyed by `string` not `MapKey` [#15767](#15767) - [sdk/nodejs] Make function serialization work with typescript 4 and 5 [#15761](#15761) - [sdk/python] Improve the error message when depends_on is passed objects of the wrong type [#15737](#15737) ### Bug Fixes - [auto/{go,nodejs,python}] Make sure to read complete lines before trying to deserialize them as engine events [#15778](#15778) - [cli/plugin] Fix installing local language plugins on Windows [#15715](#15715) - [engine] Don't delete stack outputs on failed deployments [#15754](#15754) - [engine] Fix a panic when updating provider version in a run using --target [#15716](#15716) - [engine] Handle that Assets & Archives can be returned from providers without content. [#15736](#15736) - [engine] Fix the engine trying to delete a protected resource caught in a replace chain [#15776](#15776) - [sdkgen/docs] Add missing newline for `Coming soon!` [#15783](#15783) - [programgen/dotnet] Fix generated code for a list of resources used in resource option DependsOn [#15773](#15773) - [programgen/{dotnet,go}] Fixes emitted code for object expressions assigned to properties of type Any [#15770](#15770) - [sdk/go] Fix lookup of plugin and program dependencies when using Go workspaces [#15743](#15743) - [sdk/nodejs] Export automation.tag.TagMap type [#15774](#15774) - [sdk/python] Wait only for pending outputs in the Python SDK, not all pending asyncio tasks [#15744](#15744) ### Miscellaneous - [sdk/nodejs] Reorganize function serialization tests [#15753](#15753) - [sdk/nodejs] Move mockpackage tests to closure integration tests [#15757](#15757)

tgummerer requested a review from a team as a code owner March 25, 2024 17:59

Frassle approved these changes Mar 25, 2024

View reviewed changes

tgummerer force-pushed the tg/read-complete-lines-only branch 3 times, most recently from 8147684 to 2490263 Compare March 26, 2024 08:59

run make tidy

2ffb399

tgummerer force-pushed the tg/read-complete-lines-only branch from 2490263 to 2ffb399 Compare March 26, 2024 09:21

tgummerer mentioned this pull request Mar 26, 2024

automation: always read complete lines pulumi/pulumi-dotnet#245

Closed

fix python test

f58a699

tgummerer commented Mar 26, 2024

View reviewed changes

make nodejs work

954247d

tgummerer force-pushed the tg/read-complete-lines-only branch from fac105a to 954247d Compare March 26, 2024 14:09

tgummerer enabled auto-merge March 26, 2024 14:31

tgummerer added this pull request to the merge queue Mar 26, 2024

Merged via the queue into master with commit 1339f96 Mar 26, 2024
49 checks passed

tgummerer deleted the tg/read-complete-lines-only branch March 26, 2024 15:14

justinvp mentioned this pull request Mar 27, 2024

Prepare for v3.112.0 release #15794

Merged

justinvp mentioned this pull request Mar 27, 2024

Freeze v3.112.0 #15799

Merged

MJLongstreth-tiq mentioned this pull request Apr 11, 2024

v3.112.0: Vuln packages #15918

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automation: only read complete lines before trying to deserialize #15778

automation: only read complete lines before trying to deserialize #15778

tgummerer commented Mar 25, 2024 •

edited

pulumi-bot commented Mar 25, 2024 •

edited

Frassle commented Mar 25, 2024

tgummerer commented Mar 26, 2024

tgummerer Mar 26, 2024

tgummerer commented Mar 26, 2024

justinvp commented Mar 26, 2024

automation: only read complete lines before trying to deserialize #15778

automation: only read complete lines before trying to deserialize #15778

Conversation

tgummerer commented Mar 25, 2024 • edited

Checklist

pulumi-bot commented Mar 25, 2024 • edited

Changelog

[uncommitted] (2024-03-26)

Bug Fixes

Frassle commented Mar 25, 2024

tgummerer commented Mar 26, 2024

tgummerer Mar 26, 2024

Choose a reason for hiding this comment

tgummerer commented Mar 26, 2024

justinvp commented Mar 26, 2024

tgummerer commented Mar 25, 2024 •

edited

pulumi-bot commented Mar 25, 2024 •

edited