[BUG] Default CRON causes extremely high memory usage until Out of Memory exception #5312

rboylesDev · 2024-04-30T14:45:29Z

Description

We are using Elsa 3 with the default scheduling. Whenever we have a Timer or CRON triggered workflow, we have noticed jumps in memory usage when the workflow is triggered and then this memory is never garbage collected. We are running Elsa on Azure Container Apps with 1 GB of RAM. With a CRON that runs every 15 minutes, it runs for about two hours until the container app crashes with an out of memory exception and then is automatically restarted. We have experienced this memory leak as well locally, but with significantly more RAM, it is not causing issues unless it runs for days.

Steps to Reproduce

Create even a simple workflow with a Timer or CRON expression, even one that just writes to console. Allow it to run and trigger and observe memory allocations.

Expected Behavior

While a memory jump isn't unexpected, having GC run and release memory is expected.

Actual Behavior

Memory appears to not be de-allocated.

Screenshots

Here is a sample of memory usage from our container app showing the raising memory until the app crashes and restarts.

Environment

Elsa Package Version: Specify the version. If you've cloned the repo, mention if you are using the latest source code from the main branch.
Operating System: Include your OS version (e.g., Windows 10, macOS Big Sur, Ubuntu 18.04).
Browser and Version: If applicable, include your browser and version (e.g., Chrome 92).

Troubleshooting Attempts

Initially we thought it might be the amount of work that the workflow is trying to do. But even a simple workflow of writing a console message or dispatching another workflow will still continue to allocate memory.

glime-ai · 2024-04-30T14:46:09Z

Based on the context provided, it seems that you are experiencing a memory leak issue with Elsa 3 when using Timer or CRON triggered workflows. This is a known issue that has been reported by other users as well, as seen in the GitHub issue #4884. The issue describes a memory leak related to CodeActivity which may be similar to what you are experiencing with Timer or CRON activities.

To address this issue, you can try the following steps:

Update Elsa to the latest version: Ensure that you are using the latest version of Elsa, as the issue might have been addressed in a newer release.
Monitor Garbage Collection (GC): Use tools like dotMemory to monitor memory allocations and GC. This can help you identify if the memory is indeed not being collected.
Implement IDisposable: If you have custom activities, ensure that they implement IDisposable and properly dispose of any unmanaged resources.
Check for Subscriptions: Ensure that any event handlers or message subscriptions are being properly unsubscribed or disposed of to prevent memory leaks.
Review Configuration: Review your Elsa configuration to ensure that you are not unintentionally retaining workflow instances in memory. For example, check if you are using a persistent workflow store and if the configuration of the store is correct.
Report the Issue: If none of the above steps help, and you are using the latest version of Elsa, consider reporting the issue on the Elsa GitHub repository. Provide detailed information about your environment, the version of Elsa you are using, and the steps to reproduce the issue.
Seek Community Help: You can also seek help from the Elsa community on platforms like Stack Overflow or the Elsa Slack channel. Other users or the developers might have encountered similar issues and could have solutions.
Workaround: As a temporary workaround, you might consider implementing a mechanism to periodically restart your service to clear the memory until a permanent fix is available.

Please note that without specific details about your environment and the version of Elsa you are using, it's challenging to provide a precise solution. If you can provide more details, I may be able to offer more targeted advice.

This comment was generated by Glime.

rboylesDev · 2024-05-01T14:04:41Z

Oh, I forgot to mention, we are using version 3.1.2 of Elsa. So I do not believe it is the same issue as the Code Activity memory leak.

sfmskywalker · 2024-05-06T21:04:04Z

Confirmed:

@rboylesDev thanks for reporting!

sfmskywalker · 2024-05-06T21:13:26Z

Actually, when I let it running, it seems to purge memory:

So now I am not really sure there is a memory leak 🤔

sfmskywalker · 2024-05-06T21:14:22Z

Unless the "23,4 unreachable" is a hint.

sfmskywalker · 2024-05-06T21:48:04Z

@rboylesDev Just to make sure: is the application using non-memory stores for all of the modules:

Workflow Management
Workflow Runtime

Unless you explicitly configure the persistence provider for these two, they will use the Memory provider by default - which will most certainly lead to an increase of memory consumption as new workflow instances and execution records are stored in an in-memory dictionary.

rboylesDev · 2024-05-07T04:46:28Z

We are using Elsa with EF Core and SQL Server. I believe these are configured correctly as it is a very simple code setup.

rboylesDev · 2024-05-22T14:26:01Z

Minor update on our end. We decided to look at Quartz scheduler instead of the built-in scheduler. This had the same result of allocating ~200MB per scheduled workflow run and never seeming to release it. What is interesting is taking the same workflow and manually running it does not see the same jump in allocation.

rboylesDev added the bug Something isn't working label Apr 30, 2024

sfmskywalker self-assigned this May 6, 2024

sfmskywalker added this to the Elsa 3.2 milestone May 6, 2024

cristinamudura removed this from the Elsa 3.2 milestone May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Default CRON causes extremely high memory usage until Out of Memory exception #5312

[BUG] Default CRON causes extremely high memory usage until Out of Memory exception #5312

rboylesDev commented Apr 30, 2024

glime-ai bot commented Apr 30, 2024

rboylesDev commented May 1, 2024

sfmskywalker commented May 6, 2024

sfmskywalker commented May 6, 2024

sfmskywalker commented May 6, 2024

sfmskywalker commented May 6, 2024

rboylesDev commented May 7, 2024

rboylesDev commented May 22, 2024

[BUG] Default CRON causes extremely high memory usage until Out of Memory exception #5312

[BUG] Default CRON causes extremely high memory usage until Out of Memory exception #5312

Comments

rboylesDev commented Apr 30, 2024

Description

Steps to Reproduce

Expected Behavior

Actual Behavior

Screenshots

Environment

Troubleshooting Attempts

glime-ai bot commented Apr 30, 2024

rboylesDev commented May 1, 2024

sfmskywalker commented May 6, 2024

sfmskywalker commented May 6, 2024

sfmskywalker commented May 6, 2024

sfmskywalker commented May 6, 2024

rboylesDev commented May 7, 2024

rboylesDev commented May 22, 2024