fix: actually handle sessions in parallel #308

coryan · 2021-08-12T15:33:10Z

We were creating a thread to handle each request, and then promptly
blocking until the thread completed. With this change we leave the
thread running. Before accepting a new connection we cleanup the memory
resources for old sessions, not that they are that big. We block and
stop accepting requests if there are 64 sessions running, at that point
it is probably better to let the hosting environment (Cloud Run, Cloud
Functions, whatever) create new instances to handle the additional load.

Fixes #309

devjgm

LGTM, but some questions so that I understand.

According to https://cloud.google.com/functions/docs/concepts/exec#auto-scaling_and_concurrency

Each instance of a function handles only one concurrent request at a time. This means that while your code is processing one request, there is no possibility of a second request being routed to the same instance. Thus the original request can use the full amount of resources (CPU and memory) that you requested.

So in the context of Cloud Functions, this parallelism will never be used, is that right? If CF is all we cared about, we wouldn't need this change.

However, this parallelism may still be useful in cases where this framework is deployed directly to Cloud Run, which by default has a concurrency value of 80. Is that the thinking?

Since I'm guessing that 32 and 64 were chosen as nice round numbers, we might want to consider using 80 somewhere to match Cloud Run's default concurrency.

codecov · 2021-08-12T16:31:25Z

Codecov Report

Merging #308 (946adc2) into main (b5a5878) will increase coverage by 0.01%.
The diff coverage is 83.58%.

@@            Coverage Diff             @@
##             main     #308      +/-   ##
==========================================
+ Coverage   55.47%   55.48%   +0.01%     
==========================================
  Files         562      562              
  Lines       15105    15205     +100     
==========================================
+ Hits         8380     8437      +57     
- Misses       6725     6768      +43

Impacted Files	Coverage Δ
google/cloud/functions/internal/framework_impl.cc	`74.24% <44.44%> (-13.76%)`	⬇️
...ctions/integration_tests/basic_integration_test.cc	`88.65% <95.23%> (-3.01%)`	⬇️
...tions/integration_tests/cloud_event_conformance.cc	`95.83% <100.00%> (+0.59%)`	⬆️
...functions/integration_tests/cloud_event_handler.cc	`100.00% <100.00%> (ø)`
.../integration_tests/cloud_event_integration_test.cc	`91.94% <100.00%> (-3.68%)`	⬇️
...e/cloud/functions/integration_tests/echo_server.cc	`100.00% <100.00%> (ø)`
...ud/functions/integration_tests/http_conformance.cc	`100.00% <100.00%> (ø)`
...ed/x64-linux/include/boost/system/system_error.hpp	`0.00% <0.00%> (-41.67%)`	⬇️
...64-linux/include/boost/asio/detail/throw_error.hpp	`66.66% <0.00%> (-16.67%)`	⬇️
.../include/boost/asio/ip/basic_resolver_iterator.hpp	`87.87% <0.00%> (-9.19%)`	⬇️
... and 10 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b5a5878...946adc2. Read the comment docs.

coryan · 2021-08-12T17:08:43Z

LGTM, but some questions so that I understand.

According to https://cloud.google.com/functions/docs/concepts/exec#auto-scaling_and_concurrency

Each instance of a function handles only one concurrent request at a time. This means that while your code is processing one request, there is no possibility of a second request being routed to the same instance. Thus the original request can use the full amount of resources (CPU and memory) that you requested.

So in the context of Cloud Functions, this parallelism will never be used, is that right?

Probably.

If CF is all we cared about, we wouldn't need this change.

Yes.

However, this parallelism may still be useful in cases where this framework is deployed directly to Cloud Run, which by default has a concurrency value of 80. Is that the thinking?

Since I'm guessing that 32 and 64 were chosen as nice round numbers, we might want to consider using 80 somewhere to match Cloud Run's default concurrency.

Doh. I should have looked this up, completely forgot. Will change.

We were creating a thread to handle each request, and then promptly blocking until the thread completed. With this change we leave the thread running. Before accepting a new connection we cleanup the memory resources for old sessions, not that they are that big. We block and stop accepting requests if there are 64 sessions running, at that point it is probably better to let the hosting environment (Cloud Run, Cloud Functions, whatever) create new instances to handle the additional load.

google-cla bot added the cla: yes This human has signed the Contributor License Agreement. label Aug 12, 2021

devjgm approved these changes Aug 12, 2021

View reviewed changes

coryan marked this pull request as ready for review August 12, 2021 17:27

coryan requested a review from a team as a code owner August 12, 2021 17:27

coryan added 6 commits August 12, 2021 19:48

Fix formatting

b430c11

Better values for concurrency

8a60423

Fix clang-tidy

9fdfe80

Cannot use std::exit(0) with ThreadSanitizer

97b30d4

Fix clang-tidy

946adc2

coryan merged commit 0784827 into GoogleCloudPlatform:main Aug 12, 2021

coryan deleted the fix-actually-run-parallel-threads branch August 12, 2021 23:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: actually handle sessions in parallel #308

fix: actually handle sessions in parallel #308

Uh oh!

coryan commented Aug 12, 2021 •

edited

Loading

Uh oh!

devjgm left a comment

Uh oh!

codecov bot commented Aug 12, 2021 •

edited

Loading

Uh oh!

coryan commented Aug 12, 2021

Uh oh!

Uh oh!

fix: actually handle sessions in parallel #308

fix: actually handle sessions in parallel #308

Uh oh!

Conversation

coryan commented Aug 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devjgm left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Aug 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coryan commented Aug 12, 2021

Uh oh!

Uh oh!

coryan commented Aug 12, 2021 •

edited

Loading

codecov bot commented Aug 12, 2021 •

edited

Loading