Get String#crypt working with multi-ractor in cases where !HAVE_CRYPT_R #13567

luke-gruber · 2025-06-09T21:35:02Z

In commit 12f7ba5, ractor safety was added to String#crypt, however in certain cases it can cause a deadlock. When we lock a native mutex, we cannot allocate ruby objects because they might trigger GC which starts a VM barrier. If the barrier is triggered and other native threads are waiting on this mutex, they will not be able to be woken up in order to join the barrier. To fix this, we don't allocate ruby objects when we hold the lock.

The following could reproduce the problem:

strings = []
10_000.times do |i|
  strings << "my string #{i}"
end

STRINGS = Ractor.make_shareable(strings)

rs = []
100.times do
  rs << Ractor.new do
    STRINGS.each do |s|
      s.dup.crypt(s.dup)
    end
  end
end
while rs.any?
  r, obj = Ractor.select(*rs)
  rs.delete(r)
end

I will not be adding tests because I am almost finished a PR to enable running test-all test cases inside many ractors at once, which is how I found the issue.

jhawthorn · 2025-06-10T00:37:13Z

string.c

+    size_t res_size = strlen(res)+1;
+    char *dup = malloc(res_size); // need to hold onto lock while duplicating a potentially static buffer
+    memcpy(dup, res, res_size);


Note for readers: We have to do this because of our #define strdup ruby_strdup. We want to malloc here, not xmalloc. And I don't think there's a portable way to determine the size of crypt's output buffer.

res = (strdup)(res); can avoid the macro.

jhawthorn · 2025-06-10T00:37:55Z

string.c

+    // Don't allocate a ruby object while holding this lock, we could hit a VM barrier, which
+    // causes a deadlock if other ractors are waiting on this lock.
+    result = rb_str_new_cstr(dup);
+    free(dup);


If rb_str_new_cstr erorrs, this pointer might leak.

I think that may not be worth dealing with as crypt is semi-deprecated https://bugs.ruby-lang.org/issues/14915

I prefer that (very unlikely) leak to the deadlock that will occur in the test suite under ractors.

nobu · 2025-06-10T00:38:54Z

string.c

-    result = rb_str_new_cstr(res);
+
+    size_t res_size = strlen(res)+1;
+    char *dup = malloc(res_size); // need to hold onto lock while duplicating a potentially static buffer


The encrypted result would be small enough in alloca.
And this copy should not be needed if crypt_r is available.

Thanks for the review. I've updated the PR to use alloca.
Edit: Some CI failures I'll look into it Friday.

In commit 12f7ba5, ractor safety was added to String#crypt, however in certain cases it can cause a deadlock. When we lock a native mutex, we cannot allocate ruby objects because they might trigger GC which starts a VM barrier. If the barrier is triggered and other native threads are waiting on this mutex, they will not be able to be woken up in order to join the barrier. To fix this, we don't allocate ruby objects when we hold the lock. The following could reproduce the problem: ```ruby strings = [] 10_000.times do |i| strings << "my string #{i}" end STRINGS = Ractor.make_shareable(strings) rs = [] 100.times do rs << Ractor.new do STRINGS.each do |s| s.dup.crypt(s.dup) end end end while rs.any? r, obj = Ractor.select(*rs) rs.delete(r) end ``` I will not be adding tests because I am almost finished a PR to enable running test-all test cases inside many ractors at once, which is how I found the issue. Co-authored-by: jhawthorn <[email protected]>

luke-gruber · 2025-06-25T19:17:43Z

@nobu, @jhawthorn: ready for re-review

luke-gruber force-pushed the fix_string_crypt_with_ractors branch 3 times, most recently from 8d25d6f to eb97984 Compare June 9, 2025 22:02

jhawthorn reviewed Jun 10, 2025

View reviewed changes

jhawthorn approved these changes Jun 10, 2025

View reviewed changes

nobu reviewed Jun 10, 2025

View reviewed changes

luke-gruber force-pushed the fix_string_crypt_with_ractors branch from eb97984 to e65f343 Compare June 12, 2025 20:38

This comment has been minimized.

Sign in to view

luke-gruber force-pushed the fix_string_crypt_with_ractors branch from e65f343 to 1d25406 Compare June 25, 2025 17:45

luke-gruber force-pushed the fix_string_crypt_with_ractors branch from 1d25406 to f97389e Compare June 25, 2025 18:38

jhawthorn approved these changes Jun 25, 2025

View reviewed changes

jhawthorn merged commit 328e302 into ruby:master Jun 25, 2025
84 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Get String#crypt working with multi-ractor in cases where !HAVE_CRYPT_R #13567

Get String#crypt working with multi-ractor in cases where !HAVE_CRYPT_R #13567

Uh oh!

luke-gruber commented Jun 9, 2025 •

edited

Loading

Uh oh!

jhawthorn Jun 10, 2025

Uh oh!

nobu Jun 10, 2025

Uh oh!

jhawthorn Jun 10, 2025 •

edited

Loading

Uh oh!

nobu Jun 10, 2025

Uh oh!

luke-gruber Jun 12, 2025 •

edited

Loading

Uh oh!

This comment has been minimized.

luke-gruber commented Jun 25, 2025

Uh oh!

Uh oh!

Uh oh!

Get String#crypt working with multi-ractor in cases where !HAVE_CRYPT_R #13567

Get String#crypt working with multi-ractor in cases where !HAVE_CRYPT_R #13567

Uh oh!

Conversation

luke-gruber commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhawthorn Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

nobu Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

jhawthorn Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nobu Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

luke-gruber Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

luke-gruber commented Jun 25, 2025

Uh oh!

Uh oh!

Uh oh!

luke-gruber commented Jun 9, 2025 •

edited

Loading

jhawthorn Jun 10, 2025 •

edited

Loading

luke-gruber Jun 12, 2025 •

edited

Loading