Fixing a performance issue in SSE client #221

hiranya911 · 2018-11-14T00:34:01Z

Improves the streaming RTDB API performance by checking for end of field more efficiently. Specifically, instead of matching a regex on each iteration, we keep track of the last 4 characters seen, and perform a simple string comparison.

Here are some preliminary test results to get an idea of the improvement.

Node Size	Time to process (Before)	Time to process (After)
100K	40.83s	1.73s
1M	-	11.36s

Fixes #198

daniel-ziegler · 2018-11-16T20:23:08Z

Doesn't work for me at all on Mac OS X -- I simply don't receive any listen events. This definitely needs to be tested with a real HTTP stream.

Also note that while this should hopefully improve the constant factor, it's still O(N^2).

hiranya911 · 2018-11-17T00:20:02Z

There seems to be a requests bug here, which I've reported at psf/requests#4876

The proposed solution essentially reduces N by reading data in chunks. With the fix I've tested payloads (in the unit tests) as large as 10MB, which can be processed under a second.

hiranya911 · 2018-11-17T01:00:34Z

Attached is another solution I've been tinkering with. It doesn't change how we read content from requests -- just changes how we handle the incoming data:

Using a list to buffer incoming data as opposed to string concatenation which is inefficient at large sizes.
Getting rid of the regex check on each iteration. The alternative check I've implemented assumes \n\n as the only acceptable terminator, which seems to be correct for Firebase.

With this I was able to read a 1MB node from a production database in under 10 seconds. Feedback welcome.

sse.txt

daniel-ziegler · 2018-11-17T01:08:42Z

I like that solution better since it is indeed linear time. Of course for maximum performance we'd need to figure out how to read in bigger chunks and not process each character individually in Python, but I'm happy with the improvement.

hiranya911 · 2018-11-20T22:43:22Z

@daniel-ziegler I've cleaned up the 2nd solution and committed it. Can you give it a try? My performance numbers are added to the PR description.

firebase_admin/_sseclient.py

daniel-ziegler · 2018-11-26T18:50:37Z

Seems to work; obviously I haven't tested it very thoroughly.

firebase_admin/_sseclient.py

Fixing a performance issue in SSE client

ffa963e

hiranya911 mentioned this pull request Nov 14, 2018

Performance issue and copyright violation in _sseclient.py (for db.Reference.listen()) #198

Closed

hiranya911 added 2 commits November 16, 2018 11:27

Merge branch 'master' into hkj-rt-perf

a4a8725

Merged with master

ea1b15b

hiranya911 added 3 commits November 20, 2018 13:52

Using a more efficient mechanism to buffer and parse incoming SSE data

8c6b692

Minor improvement to how tail string is read

41682c3

Merge branch 'master' into hkj-rt-perf

6f377dc

daniel-ziegler reviewed Nov 22, 2018

View reviewed changes

firebase_admin/_sseclient.py Outdated Show resolved Hide resolved

Using string slicing to reduce the window

c6d38d4

hiranya911 assigned bklimt Nov 29, 2018

hiranya911 requested a review from bklimt November 29, 2018 18:43

bklimt reviewed Nov 29, 2018

View reviewed changes

firebase_admin/_sseclient.py Outdated Show resolved Hide resolved

bklimt approved these changes Nov 29, 2018

View reviewed changes

hiranya911 assigned hiranya911 and unassigned bklimt Nov 30, 2018

Updated comment

ec2e571

hiranya911 merged commit 3e31716 into master Nov 30, 2018

hiranya911 deleted the hkj-rt-perf branch November 30, 2018 19:37

hiranya911 mentioned this pull request Mar 28, 2019

.listen causes spurious periodic snapshot to be dumped #275

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixing a performance issue in SSE client #221

Fixing a performance issue in SSE client #221

Uh oh!

hiranya911 commented Nov 14, 2018 •

edited

Loading

Uh oh!

daniel-ziegler commented Nov 16, 2018

Uh oh!

hiranya911 commented Nov 17, 2018

Uh oh!

hiranya911 commented Nov 17, 2018 •

edited

Loading

Uh oh!

daniel-ziegler commented Nov 17, 2018

Uh oh!

hiranya911 commented Nov 20, 2018

Uh oh!

Uh oh!

daniel-ziegler commented Nov 26, 2018

Uh oh!

Uh oh!

Uh oh!

Fixing a performance issue in SSE client #221

Fixing a performance issue in SSE client #221

Uh oh!

Conversation

hiranya911 commented Nov 14, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daniel-ziegler commented Nov 16, 2018

Uh oh!

hiranya911 commented Nov 17, 2018

Uh oh!

hiranya911 commented Nov 17, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daniel-ziegler commented Nov 17, 2018

Uh oh!

hiranya911 commented Nov 20, 2018

Uh oh!

Uh oh!

daniel-ziegler commented Nov 26, 2018

Uh oh!

Uh oh!

Uh oh!

hiranya911 commented Nov 14, 2018 •

edited

Loading

hiranya911 commented Nov 17, 2018 •

edited

Loading