Remove RecordBase to speedup processing #57

xmedeko · 2018-03-07T12:44:46Z

Fix #55
The second commit limits the data processor getattr.

dtcooper · 2018-03-07T20:00:00Z

Looks decent. Thanks for submitting. Need more time to review more intimately. What sort of performance gains do you observe.

xmedeko · 2018-03-07T21:09:00Z

The first commit is about 10-15%, the second commit about 5%.

The second commit moved caching to the instance of processor, so, if someone uses a new processor every time and have small FITs, may have some performance degradation, maybe. Anyway, maybe the scrubbing is to complicated and may be simplified? Do not know.

Fix dtcooper#55

xmedeko · 2018-03-09T08:19:13Z

I've added BaseType.size caching to the first commit, which is kind of micro-optimization.

xmedeko · 2018-03-16T10:11:41Z

Update: I've run benchmarks on my Win32 Intel i5 2.5GHz machine previously. I've tried it on our Linux x64 (Ubuntu Xenial) Intel Xeon 2.6GHz server, and it's 5x faster. (all running Python 3.5.2). IMHO it cannot be caused just by the faster processor, there must be some performance problem with Python on Windows or on x32 arch.

Anyway, removing RecordBase brings more flexibility to class constructor, e.g. see #58.

xmedeko · 2018-04-04T07:48:16Z

Just about the commit: FitFileDataProcessor cache methods not just method names
I thing the data_processor should be removed from the FitFile completely. It may be designed as a wrapper around DataMessage, so a user may plugin into the parsing process externally. Something like:

with FitFile(...) as ffile:
    pr = FitFileDataProcessor(ffile)
    for m in pr.get_messages(...):
        # m is processed

This way, we do not need any static caching, it's up to the user.

pR0Ps · 2018-04-25T12:20:27Z

fitparse/processors.py

+            return method
+
+        scrubbed_method_name = scrub_method_name(method_name)
+        try:


Can do method = getattr(self, scrubbed_method_name, None) to use None as the default if the attribute doesn't exist.

getattr is slow for large FITs (e.g. 10 hours records). It's the purpose of this commit to avoid it and speed the processors.

I understand that. I'm suggesting using getattr(self, scrubbed_method_name, None) instead of the try/except for the initial caching.

pR0Ps · 2018-04-25T12:35:59Z

fitparse/records.py

    def __repr__(self):
        return '<FieldType: %s (%s)>' % (self.name, self.base_type)


-class MessageType(RecordBase):
+class MessageType():


missing object?

Yep, missing the object, thanks.

xmedeko · 2018-04-25T12:51:02Z

@pR0Ps thanks for CR. Meanwhile I've got mroe insight into the problem with the processors a bit. So I would discard the commit about the processors, make a issue for that where we can debate it more in depth.

pR0Ps · 2018-04-25T12:53:23Z

If your solution to the processors is the one you suggested above, I wouldn't recommend it. Making things "up to the user" is just pushing the problem along. If most users are going to want the values out of the FitFile object as standard SI units (likely), then it makes sense for that to be the default. The common case should be easy, and the advanced case (get raw values) should be possible. This is the way it is currently set up now.

xmedeko force-pushed the recordbase branch from 04d51db to 2da6961 Compare March 7, 2018 19:45

Remove RecordBase to speedup processing

efafdd2

Fix dtcooper#55

xmedeko force-pushed the recordbase branch from 2da6961 to 54b8452 Compare March 9, 2018 08:15

xmedeko mentioned this pull request Mar 9, 2018

Add FitFileEncoder for writing FIT files #58

Open

FitFileDataProcessor cache methods not just method names

51cd245

xmedeko force-pushed the recordbase branch from 54b8452 to 3fd140d Compare March 23, 2018 19:08

xmedeko force-pushed the recordbase branch from 3fd140d to 51cd245 Compare April 12, 2018 20:20

pR0Ps reviewed Apr 25, 2018

View reviewed changes

xmedeko mentioned this pull request Jun 11, 2018

Refactoring and optimization by decorators #72

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove RecordBase to speedup processing #57

Remove RecordBase to speedup processing #57

xmedeko commented Mar 7, 2018 •

edited

dtcooper commented Mar 7, 2018

xmedeko commented Mar 7, 2018

xmedeko commented Mar 9, 2018 •

edited

xmedeko commented Mar 16, 2018

xmedeko commented Apr 4, 2018

pR0Ps Apr 25, 2018

xmedeko Apr 25, 2018

pR0Ps Apr 25, 2018

pR0Ps Apr 25, 2018

xmedeko Apr 25, 2018

xmedeko commented Apr 25, 2018

pR0Ps commented Apr 25, 2018

Remove RecordBase to speedup processing #57

Are you sure you want to change the base?

Remove RecordBase to speedup processing #57

Conversation

xmedeko commented Mar 7, 2018 • edited

dtcooper commented Mar 7, 2018

xmedeko commented Mar 7, 2018

xmedeko commented Mar 9, 2018 • edited

xmedeko commented Mar 16, 2018

xmedeko commented Apr 4, 2018

pR0Ps Apr 25, 2018

Choose a reason for hiding this comment

xmedeko Apr 25, 2018

Choose a reason for hiding this comment

pR0Ps Apr 25, 2018

Choose a reason for hiding this comment

pR0Ps Apr 25, 2018

Choose a reason for hiding this comment

xmedeko Apr 25, 2018

Choose a reason for hiding this comment

xmedeko commented Apr 25, 2018

pR0Ps commented Apr 25, 2018

xmedeko commented Mar 7, 2018 •

edited

xmedeko commented Mar 9, 2018 •

edited