Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

automation: only read complete lines before trying to deserialize #15778

Merged
merged 4 commits into from
Mar 26, 2024

Conversation

tgummerer
Copy link
Collaborator

@tgummerer tgummerer commented Mar 25, 2024

When tailing the event log in automation API we currently have nothing that makes sure we read only complete lines. This means if the OS happens to flush an incomplete line for whatever reason (or the Go JSON encoder does, which we're using to write these lines), we might read a line that is incompletely written, and thus will fail to JSON decode it.

Since the JSON encoder always writes a newline at the end of each string, we can also make sure that the line we read ends with a newline and otherwise wait for the rest of the line to be written.

The library we use in Go provides a convenient setting for this, while in python and nodejs we need to add some code to do this ourselves.

Fixes #15235
Fixes #15652
Fixes #9269 (This is closed already, but never had a proper resolution afaics)
Fixes #6768

It would be nice to add a typescript test here as well, but I'm not sure how to do that without marking the readLines function non-private. But I don't know typescript well, so any hints of how to do that would be appreciated!

Checklist

  • I have run make tidy to update any new dependencies
  • I have run make lint to verify my code passes the lint check
    • I have formatted my code using gofumpt
  • I have added tests that prove my fix is effective or that my feature works
  • I have run make changelog and committed the changelog/pending/<file> documenting my change
  • Yes, there are changes in this PR that warrants bumping the Pulumi Cloud API version

When tailing the event log in automation API we currently have nothing
that makes sure we read only complete lines.  This means if the OS
happens to flush an incomplete line for whatever reason (or the Go
JSON encoder does, which we're using to write these lines), we might
read a line that is incompletely written, and thus will fail to JSON
decode it.

Since the JSON encoder always writes a newline at the end of each
string, we can also make sure that the line we read ends with a
newline and otherwise wait for the rest of the line to be written.

The library we use in Go provides a convenient setting for this, while
in python and nodejs we need to add some code to do this ourselves.
@tgummerer tgummerer requested a review from a team as a code owner March 25, 2024 17:59
@pulumi-bot
Copy link
Contributor

pulumi-bot commented Mar 25, 2024

Changelog

[uncommitted] (2024-03-26)

Bug Fixes

  • [auto/{go,nodejs,python}] Make sure to read complete lines before trying to deserialize them as engine events
    #15778

@Frassle
Copy link
Member

Frassle commented Mar 25, 2024

Guess we need a similar fix in dotnet?

@tgummerer tgummerer force-pushed the tg/read-complete-lines-only branch 3 times, most recently from 8147684 to 2490263 Compare March 26, 2024 08:59
@tgummerer
Copy link
Collaborator Author

Guess we need a similar fix in dotnet?

Yes probably, good call! I'll look into that.

tgummerer added a commit to pulumi/pulumi-dotnet that referenced this pull request Mar 26, 2024
This is the equivalent as pulumi/pulumi#15778
for dotnet.  When tailing the event log we currently don't make sure
that we read complete lines.  Lines can be flushed in the middle by
the OS while they are being written down, leading to incomplete JSON.

The JSON encoder always ends each line with a newline, so we can
stitch the lines back together even if we occasionally read a partly
written line.
tgummerer added a commit to pulumi/pulumi-dotnet that referenced this pull request Mar 26, 2024
This is the equivalent as pulumi/pulumi#15778
for dotnet.  When tailing the event log we currently don't make sure
that we read complete lines.  Lines can be flushed in the middle by
the OS while they are being written down, leading to incomplete JSON.

The JSON encoder always ends each line with a newline, so we can
stitch the lines back together even if we occasionally read a partly
written line.
tgummerer added a commit to pulumi/pulumi-dotnet that referenced this pull request Mar 26, 2024
This is the equivalent as pulumi/pulumi#15778
for dotnet.  When tailing the event log we currently don't make sure
that we read complete lines.  Lines can be flushed in the middle by
the OS while they are being written down, leading to incomplete JSON.

The JSON encoder always ends each line with a newline, so we can
stitch the lines back together even if we occasionally read a partly
written line.
tgummerer added a commit to pulumi/pulumi-dotnet that referenced this pull request Mar 26, 2024
This is the equivalent as pulumi/pulumi#15778
for dotnet.  When tailing the event log we currently don't make sure
that we read complete lines.  Lines can be flushed in the middle by
the OS while they are being written down, leading to incomplete JSON.

The JSON encoder always ends each line with a newline, so we can
stitch the lines back together even if we occasionally read a partly
written line.
tgummerer added a commit to pulumi/pulumi-dotnet that referenced this pull request Mar 26, 2024
This is the equivalent as pulumi/pulumi#15778
for dotnet.  When tailing the event log we currently don't make sure
that we read complete lines.  Lines can be flushed in the middle by
the OS while they are being written down, leading to incomplete JSON.

The JSON encoder always ends each line with a newline, so we can
stitch the lines back together even if we occasionally read a partly
written line.
tgummerer added a commit to pulumi/pulumi-dotnet that referenced this pull request Mar 26, 2024
This is the equivalent as pulumi/pulumi#15778
for dotnet.  When tailing the event log we currently don't make sure
that we read complete lines.  Lines can be flushed in the middle by
the OS while they are being written down, leading to incomplete JSON.

The JSON encoder always ends each line with a newline, so we can
stitch the lines back together even if we occasionally read a partly
written line.
Comment on lines 126 to 136
let partialLine = "";
lineSplitter.on("line", (line) => {
let event: EngineEvent;
try {
event = JSON.parse(line);
line = partialLine + line;
partialLine = "";
event = JSON.parse(line);
callback(event);
} catch (e) {
log.warn(`Failed to parse engine event
If you're seeing this warning, please comment on https://github.com/pulumi/pulumi/issues/6768 with the event and any
details about your environment.

Event: ${line}\n${e.toString()}`);
partialLine += line;
return;
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unfortunately nodejs doesn't seem to include the newline character in the on line events, so we'll have to go off catching exceptions here.

@tgummerer
Copy link
Collaborator Author

pulumi/pulumi-dotnet#245 is the corresponding dotnet PR.

@tgummerer tgummerer added this pull request to the merge queue Mar 26, 2024
Merged via the queue into master with commit 1339f96 Mar 26, 2024
49 checks passed
@tgummerer tgummerer deleted the tg/read-complete-lines-only branch March 26, 2024 15:14
@justinvp
Copy link
Member

It would be nice to add a typescript test here as well, but I'm not sure how to do that without marking the readLines function non-private. But I don't know typescript well, so any hints of how to do that would be appreciated!

Unfortunately nodejs doesn't seem to include the newline character in the on line events, so we'll have to go off catching exceptions here.

Yeah, we should have a test for Node.js. It doesn't look like readLines needs to be a private member of the class -- I think we could move it out to be a top-level exported function marked /** @internal */ so it can be called from a test.

/** @internal */
export async function readLines(logPath: string, callback: (event: EngineEvent) => void): Promise<ReadlineResult> {
    // ...
}

github-merge-queue bot pushed a commit that referenced this pull request Mar 27, 2024
Tentative changelog:

### Features

- [docs] Implement constructor syntax examples for every resource in
typescript, python, csharp and go
  [#15624](#15624)

- [engine] Send output values with property dependency information to
transform functions
  [#15637](#15637)

- [engine] Add a --continue-on-error flag to pulumi destroy
  [#15727](#15727)

- [sdk/go] Make `property.Map` keyed by `string` not `MapKey`
  [#15767](#15767)

- [sdk/python] Improve the error message when depends_on is passed
objects of the wrong type
  [#15737](#15737)


### Bug Fixes

- [auto/{go,nodejs,python}] Make sure to read complete lines before
trying to deserialize them as engine events
  [#15778](#15778)

- [cli/plugin] Fix installing local language plugins on Windows
  [#15715](#15715)

- [engine] Don't delete stack outputs on failed deployments
  [#15754](#15754)

- [engine] Fix a panic when updating provider version in a run using
--target
  [#15716](#15716)

- [engine] Handle that Assets & Archives can be returned from providers
without content.
  [#15736](#15736)

- [engine] Fix the engine trying to delete a protected resource caught
in a replace chain
  [#15776](#15776)

- [sdkgen/docs] Add missing newline for `Coming soon!`
  [#15783](#15783)

- [programgen/dotnet] Fix generated code for a list of resources used in
resource option DependsOn
  [#15773](#15773)

- [programgen/{dotnet,go}] Fixes emitted code for object expressions
assigned to properties of type Any
  [#15770](#15770)

- [sdk/go] Fix lookup of plugin and program dependencies when using Go
workspaces
  [#15743](#15743)

- [sdk/nodejs] Export automation.tag.TagMap type
  [#15774](#15774)

- [sdk/python] Wait only for pending outputs in the Python SDK, not all
pending asyncio tasks
  [#15744](#15744)


### Miscellaneous

- [sdk/nodejs] Reorganize function serialization tests
  [#15753](#15753)

- [sdk/nodejs] Move mockpackage tests to closure integration tests
  [#15757](#15757)
@justinvp justinvp mentioned this pull request Mar 27, 2024
github-merge-queue bot pushed a commit that referenced this pull request Mar 28, 2024
Tentative changelog:

### Features

- [docs] Implement constructor syntax examples for every resource in
typescript, python, csharp and go
  [#15624](#15624)

- [docs] Implement YAML constructor syntax examples in the docs
  [#15791](#15791)

- [engine] Send output values with property dependency information to
transform functions
  [#15637](#15637)

- [engine] Add a --continue-on-error flag to pulumi destroy
  [#15727](#15727)

- [sdk/go] Make `property.Map` keyed by `string` not `MapKey`
  [#15767](#15767)

- [sdk/nodejs] Make function serialization work with typescript 4 and 5
  [#15761](#15761)

- [sdk/python] Improve the error message when depends_on is passed
objects of the wrong type
  [#15737](#15737)


### Bug Fixes

- [auto/{go,nodejs,python}] Make sure to read complete lines before
trying to deserialize them as engine events
  [#15778](#15778)

- [cli/plugin] Fix installing local language plugins on Windows
  [#15715](#15715)

- [engine] Don't delete stack outputs on failed deployments
  [#15754](#15754)

- [engine] Fix a panic when updating provider version in a run using
--target
  [#15716](#15716)

- [engine] Handle that Assets & Archives can be returned from providers
without content.
  [#15736](#15736)

- [engine] Fix the engine trying to delete a protected resource caught
in a replace chain
  [#15776](#15776)

- [sdkgen/docs] Add missing newline for `Coming soon!`
  [#15783](#15783)

- [programgen/dotnet] Fix generated code for a list of resources used in
resource option DependsOn
  [#15773](#15773)

- [programgen/{dotnet,go}] Fixes emitted code for object expressions
assigned to properties of type Any
  [#15770](#15770)

- [sdk/go] Fix lookup of plugin and program dependencies when using Go
workspaces
  [#15743](#15743)

- [sdk/nodejs] Export automation.tag.TagMap type
  [#15774](#15774)

- [sdk/python] Wait only for pending outputs in the Python SDK, not all
pending asyncio tasks
  [#15744](#15744)


### Miscellaneous

- [sdk/nodejs] Reorganize function serialization tests
  [#15753](#15753)

- [sdk/nodejs] Move mockpackage tests to closure integration tests
  [#15757](#15757)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment