Added bulk object allocator (memory pool) for syntax nodes #173

vintagedave · 2016-04-23T12:15:55Z

This branch implements a memory pool for each syntax node class. It means that nodes are allocated not from FastMM, but from a pool. When freed, the memory is returned to the pool, but the large allocation is not freed; in other words, repeated allocations and frees will not fragment memory. This comes at the cost of slightly higher memory usage for the life of the process (typically a few hundred kilobytes.)

This is turned off by default, and when off has absolutely no effect on the code.

This is part of my quest to reduce memory fragmentation in pre-XE8 IDEs for Navigator and Bookmarks. They re-parse files with almost every change (with a small delay) and over many hours this can result in many, many allocations and frees of the syntax node classes. Since other memory operations occur in between these, and since the pre-XE8 IDE has a small address space (2GB) further exacerbated by .Net's allocator sitting in the same process, fragmentation can occur.

This is not necessary for occasional use of DelphiAST, eg running it once. It may help significantly when run many times in a row, as in the Navigator/Bookmarks use case or when parsing several thousand file in a large project.

…lso very importantly memory fragmentation). Added StringCache.pas. Changed attributes to use the string cache. Note required a define USESTRINGCACHE which is off by default. The cache is not threadsafe.

…ionary of attributes, but attributes no longer work as a dictionary.)

…en threadsafe, any individual instance has get/add method contents wrapped in a lock, to lock the internal structures. The instance is never cleared when the count of objects using it drops to 0, since that's a bit more complex to get right (should only clear while the refcount is 0, but have to lock that to ensure it's not changed while clearing is happening, which severely slows inc/decrementing the count. I couldn't think of a good compare-lock-exchange algorithm. So just don't bother; it's never cleared while alive, when threadsafe.)

…both operations; now holds it for both. Safe to enter a CS twice.)

This reverts commit 95233e9.

… This is off by default, but when on will reduce memory fragmentation at the cost of slightly higher permanent memory usage through the lifetime of the process

RomanYankovsky · 2016-04-23T12:58:32Z

Thans, @vintagedave !

But FastMM uses memory pool too, doesn't it? I mean do you have any benchmark that shows that your memory managment is better than FastMM? It's very sensitive area, we have to be bery careful.

vintagedave · 2016-04-23T13:14:19Z

That's tricky. I have empirical evidence, which is memory stability in the IDE. I haven't done a full test which would require tracking all FastMM allocations and free areas of memory.

FastMM cannot ever optimize for a specific class. It can only optimize by size, and it does have pools per memory size (small, medium, and I think it goes straight to VirtualAlloc for large.) My suspicion is that the fragmentation is occurring at a higher level, eg it may be releasing entire blocks back to the OS and then re-acquiring them, so the fragmentation is at that level. It will always free memory, whereas this code does not - it is static pool that never shrinks, which means fragmentation cannot occur because there is no repeated releasing and reacquisition.

For my purposes (IDE plugins) this works and seems to be important (there is a noticeable stability increase in the IDE, especially older versions), so I will keep this in a private fork even if it's not merged. Personally I suspect this would benefit FixInsight too, if you parse large (say, thousand-unit-plus) projects. Note that it's not enabled by default and if merged in, no-one using it will notice any change because unless defined and built with the define, there is no code change.

RomanYankovsky · 2016-04-23T13:23:12Z

I'm OK to merge this. I just want to be 150% sure, that's why I'm asking.

As far as I understood, this pull request includes StringCache, so if I merge this, previous pull request is not needed. Right?

Let's wait a couple of days, so everyone can say his opinion. @Wosi and @sglienke do intensively use DelphiAST, so I want them to be happy too.

vintagedave · 2016-04-23T13:28:01Z

Yes, I branched from my string cache branch, since it's part 2 of the
memory fragmentation code. The string cache is IMO the most important of
the two.

On 23 April 2016 at 16:23, Roman Yankovsky notifications@github.com wrote:

I'm OK to merge this. I just want to be 150% sure, that's why I'm asking.

As far as I understood, this pull request includes StringCache, so if I
merge this, previous pull request is not needed. Right?

Let's wait a couple of days, so everyone can say his opinion. @Wosi
https://github.com/Wosi and @sglienke https://github.com/sglienke do
intensively use DelphiAST, so I want them to be happy too.

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#173 (comment)

sglienke · 2016-04-23T18:37:43Z

To be honest - using a simple object pool would have been enough where you request objects from and then put them back into. If you preallocated n object imo you have the same effect. Messing with the InitInstance and allocating memory yourself that you use by those objects seems unnecessary overkill and a possible source of errors (is it thread-safe? I don't think so)

vintagedave · 2016-04-24T14:42:22Z

It's not threadsafe, no. Of course, it's not used unless you turn it on - the code, when compiled with the define undefined, is absolutely unchanged from the current code.

The main thing with reusing objects in an object pool instead of a memory pool is ensuring that they are initialised correctly. With this technique, the constructor etc runs as normal, and memory is zeroed. With a reused object, you need to ensure that everyone one is used, all fields are reset. Without an extensive code inspection, I'm not confident of this in DAST since the code relies on new objects being in the state they are constructed to, ie assumes default without setting everything explicitly.

sglienke · 2016-04-25T19:07:57Z

Resetting fields is done by calling TObject.CleanupInstance and initializing by calling TObject.InitInstance. Managing the objects lifetime and memory is not the purpose of the objects/classes themself but by someone else - of course you then have to refactor a fair bit of the DAST code to not just call Create/Free on these classes/objects but use an allocator/factory/objectpool (you name it).

But to me this would be the cleaner approach and easier to maintain when moving to a more typed node tree in the future because then you don't need to put that custom allocation code into every node class.

vintagedave · 2016-04-26T11:08:08Z

Hmm. Well, I could certainly rework this to do that instead. That would be a better option?

…into syntaxnode-allocation

Conflicts, now fixed: Source/DelphiAST.Classes.pas Source/DelphiAST.Writer.pas Source/SimpleParser/SimpleParser.Lexer.pas

…s in this fork.

into syntaxnode-allocation

sglienke · 2022-08-05T12:57:02Z

I have been looking into minimizing heap allocations by refactoring several methods in TPasSyntaxTreeBuilder.
Also I refactored the node classes in DelphiAST.Classes to avoid some overhead which comes from InitInstance and CleanupInstance (which we don't need if we clear all fields ourselves). This refactoring has not been finished but is part of a larger performance improvement project of DelphiAST.

So this can be closed imo as the changes are quite outdated by now.

vintagedave and others added 13 commits August 4, 2015 14:20

Update README.md

95233e9

Merge branch 'master' of https://github.com/vintagedave/DelphiAST

643d79f

Re-adding string caching (memory optimization, for memory usage and a…

e6baee2

…lso very importantly memory fragmentation). Added StringCache.pas. Changed attributes to use the string cache. Note required a define USESTRINGCACHE which is off by default. The cache is not threadsafe.

TValuedSyntaxNode uses the string cache

3228e5b

Lexer uses the string cache for tokens

66d852e

TStringCacheDictionary is not used (was originally written for a dict…

1aa71d6

…ionary of attributes, but attributes no longer work as a dictionary.)

Fixed typos for case where the ifdef to be threadsafe was turned off

5b58370

Comment; also holds lock for AddAndGet (enters lock individually for …

6685524

…both operations; now holds it for both. Safe to enter a CS twice.)

Revert "Update README.md"

38e35ba

This reverts commit 95233e9.

Syntax nodes can now optionally use a bulk allocator (a memory pool).…

a0d3bad

… This is off by default, but when on will reduce memory fragmentation at the cost of slightly higher permanent memory usage through the lifetime of the process

Information comment at the top of StringCache.pas

b799dd8

Added email address to info common at the top

85929d6

vintagedave added 7 commits April 27, 2016 17:37

Merge branch 'master' of https://github.com/RomanYankovsky/DelphiAST …

4dd3f18

…into syntaxnode-allocation

Merge branch 'roman-original-master' into syntaxnode-allocation

72999f8

Conflicts, now fixed: Source/DelphiAST.Classes.pas Source/DelphiAST.Writer.pas Source/SimpleParser/SimpleParser.Lexer.pas

Turn on syntax node allocation by default

a1c894a

Updated readme to point at Roman's original, and add info about what'…

1cd072f

…s in this fork.

Added Delphi syntax highlighting to the sample code snippet

9848973

Updated link, info on string interning being disabled

3a49933

Merge branch 'origin-master' of https://github.com/vintagedave/DelphiAST

42b93dd

into syntaxnode-allocation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added bulk object allocator (memory pool) for syntax nodes #173

Added bulk object allocator (memory pool) for syntax nodes #173

vintagedave commented Apr 23, 2016

RomanYankovsky commented Apr 23, 2016

vintagedave commented Apr 23, 2016

RomanYankovsky commented Apr 23, 2016

vintagedave commented Apr 23, 2016

sglienke commented Apr 23, 2016

vintagedave commented Apr 24, 2016

sglienke commented Apr 25, 2016 •

edited

vintagedave commented Apr 26, 2016

sglienke commented Aug 5, 2022 •

edited

Added bulk object allocator (memory pool) for syntax nodes #173

Are you sure you want to change the base?

Added bulk object allocator (memory pool) for syntax nodes #173

Conversation

vintagedave commented Apr 23, 2016

RomanYankovsky commented Apr 23, 2016

vintagedave commented Apr 23, 2016

RomanYankovsky commented Apr 23, 2016

vintagedave commented Apr 23, 2016

sglienke commented Apr 23, 2016

vintagedave commented Apr 24, 2016

sglienke commented Apr 25, 2016 • edited

vintagedave commented Apr 26, 2016

sglienke commented Aug 5, 2022 • edited

sglienke commented Apr 25, 2016 •

edited

sglienke commented Aug 5, 2022 •

edited