Arbitrary Syscall Invocation #235

Frank01001 · 2025-04-19T14:39:36Z

This pull request implements the long-awaited arbitrary system call invocation.

API: d.invoke_syscall("write", 1, d.regs.rbp, 0x10)

Addresses #169 #225

… called

…scall

Build mistakes + doc mistakes resolution

…d exception in syscall handler callback

…s stopped inside a syscall

… syscalls

…nother branch

Frank01001 · 2025-04-24T20:07:06Z

Finally works on i386 as well, Now just AArch64 remaining

io-no · 2025-04-25T08:45:31Z

docs/quality_of_life/arbitrary_code_execution.md

+</div>
+Additionally, when the syscall is a [`fork`](https://man7.org/linux/man-pages/man2/fork.2.html), [`vfork`](https://man7.org/linux/man-pages/man2/vfork.2.html) or [`clone`](https://man7.org/linux/man-pages/man2/clone.2.html), the function will also restore the state in the child process / thread. This is done by copying the registers from the parent process / thread to the child process / thread.
+
+As you can see, registers values are restored after the syscall is executed to reduce the chances of the process crashing. However, be mindful that the syscall is indeed executed. Thus, the state of the process will have changed.


Maybe it may be worth adding explicitely the clarification that the memory is not restored since it is intended to be so

io-no · 2025-04-25T08:56:27Z

libdebug/architectures/amd64/amd64_ptrace_register_holder.py

@@ -385,6 +385,7 @@ def apply_on_thread(self: Amd64PtraceRegisterHolder, target: ThreadContext, targ

        # setup generic syscall properties
        target_class.syscall_number = _get_property_64("orig_rax")
+        target_class.syscall_num_register = _get_property_64("rax")


I think this is not really needed. Under arbitrary syscall calling, we control the on_enter status, hence, we can use syscall_number for this purpose

I would still need a variable to tell the status handler what syscall I want to hijack. Also, to me this addition disambiguates the meaning of the syscall_number property (which will create confusion in someone else other than me when they try to set the value for some other weird use case).

You can just build the on_enter callback there, and pass it the syscall number:

[...] # Rest of invoke_syscall def on_enter_invoke(t, _): t.syscall_number = syscall [...] # Rest of invoke_syscall

Doesn't this work? I agree that the weirdness with syscall_number has to be fixed, I don't think that adding a new attribute is really the proper way to do it.

I guess I have to agree with Alessandro, this time

io-no · 2025-04-25T08:56:55Z

libdebug/architectures/aarch64/aarch64_ptrace_register_holder.py

@@ -267,6 +267,7 @@ def apply_on_thread(self: Aarch64PtraceRegisterHolder, target: ThreadContext, ta
        target_class.instruction_pointer = _get_property_64("pc")

        # setup generic syscall properties
+        target_class.syscall_num_register = _get_property_64("x8")


see comment for amd64

io-no · 2025-04-25T08:57:25Z

libdebug/architectures/amd64/compat/i386_over_amd64_ptrace_register_holder.py

@@ -108,6 +108,7 @@ def apply_on_thread(self: I386OverAMD64PtraceRegisterHolder, target: ThreadConte

        # setup generic syscall properties
        target_class.syscall_number = _get_property_32("orig_rax")
+        target_class.syscall_num_register = _get_property_32("rax")


see comment for amd64

io-no · 2025-04-25T08:58:19Z

libdebug/architectures/i386/i386_ptrace_register_holder.py

@@ -164,6 +164,7 @@ def apply_on_thread(self: I386PtraceRegisterHolder, target: ThreadContext, targe

        # setup generic syscall properties
        target_class.syscall_number = _get_property_32("orig_eax")
+        target_class.syscall_num_register = _get_property_32("eax")


io-no · 2025-04-25T09:43:48Z

libdebug/debugger/internal_debugger.py

+
+        if not self._is_in_background():
+            self.__polling_thread_command_queue.put((self.__threaded_cont_to_syscall, (thread,)))
+            self.__polling_thread_command_queue.put((self.__threaded_wait, ()))


if you do the wait, you should do the join. No? It should work also as it is but idk, just double check for race or edge cases

io-no · 2025-04-25T09:49:07Z

libdebug/debugger/internal_debugger.py

+
+        is_cloning_event = syscall_name in ["fork", "vfork", "clone", "clone3"]
+
+        if is_cloning_event:


separate function, please

io-no · 2025-04-25T09:53:24Z

libdebug/debugger/internal_debugger.py

+                child.syscall_number = syscall_number
+
+                # - Restore registers
+                child.step()


this was fine for def con, not for production

io-no · 2025-04-25T10:01:56Z

libdebug/debugger/internal_debugger.py

+                    if isinstance(getattr(thread.regs, reg_name), int | float) and reg_name != "_thread_id":
+                        setattr(child.regs, reg_name, getattr(thread.regs, reg_name))
+            else:
+                # If the syscall is a fork, we need to fix the state of the new process


is not a fork???

io-no · 2025-04-25T10:05:13Z

libdebug/debugger/internal_debugger.py

+                    if isinstance(getattr(thread.regs, reg_name), int | float) and reg_name != "_thread_id":
+                        setattr(child.regs, reg_name, getattr(thread.regs, reg_name))
+        return retval
+


what about base pointer, stack pointer and pippo stuff for a new thread?

MrIndeciso · 2025-04-26T07:52:18Z

libdebug/architectures/amd64/amd64_thread_context.py

@@ -25,3 +25,8 @@ def __init__(self: Amd64ThreadContext, thread_id: int, registers: Amd64PtraceReg

        # Register the thread properties
        self._register_holder.apply_on_thread(self, Amd64ThreadContext)
+
+    @property
+    def num_syscall_args(self: Amd64ThreadContext) -> int:


I appreciate the abstract method and everything LoL, but for once I am going to have to say that I don't think we need this yet. There are only a couple of platforms where syscalls do not take 6 arguments, we support none of them.
I'd rater have a concrete method that returns 6 in the abstract ThreadContext, and we override that method in the concrete subclass only for the platforms where we need it (if we ever decide to support PowerPC 32, for example)

Ok. Btw, I think ARM32 (which is arguably more likely than PowerPC) is also 7.

Mhmm I should look into support for arm32 over aarch64 actually, I thought that they were incompatible.
In any case, I would just override that method for arm32. Less lines of code is better (in my opinion) ((in this case)).

MrIndeciso · 2025-04-26T08:03:34Z

libdebug/debugger/internal_debugger.py

+                reg_bit_count = get_platform_gp_register_size(self.arch) * 8
+                negative_threshold = 2 ** (reg_bit_count - 1)
+
+                if new_pid >= negative_threshold:


pid_t is a signed 32 bit integer on all platforms, you could just check that new_pid.bit_length() <= 31, and it's not a platform-dependent check

MrIndeciso · 2025-04-26T08:06:41Z

test/scripts/syscall_invocation_test.py

+
+        # Invoke the syscall
+        if PLATFORM == "i386":
+            # On i386, the mmap syscall has a different signature: it takes a struct instead of the arguments directly


You can just call mmap_pgoff on i386 instead of mmap

So you can avoid all of this

MrIndeciso · 2025-04-26T08:07:12Z

libdebug/debugger/internal_debugger.py

+            self.handled_syscalls[syscall_number].on_enter_user is None
+            and self.handled_syscalls[syscall_number].on_exit_user is None
+        ):
+            if not self._is_in_background():


I think that this flow is not totally correct for cloning functions (and some others).

If I invoke a normal syscall, this happens:
process is in group stop -> thread does SYSCALL and hits on_enter -> thread does SYSCALL and hits on_exit -> end of invoke_syscall. All is fine, I think.

If I invoke fork/clone/whatever, I think that this is what happens:
process is in group stop -> thread does SYSCALL and hits on_enter -> thread does SYSCALL -> kernel notifies us of a new FORK/CLONE_EVENT -> ptrace_status_handler receives the event and then does a process-wide cont -> thread hits on_exit -> end of invoke_syscall. So this has unexpected side-effects for multithreaded process, I think, and I am also not sure how this doesn't break the flow for single-threaded processes, because the main thread gets cont'd too.

Now, what happens if I invoke the exit syscall from a thread? We should support thread-suicide I think, and I honestly don't know if and how this flow could handle that.

I was thinking about this, and I found some other syscalls that call for "special treatment":

seccomp: I don't remember how we have it implemented now, but this is another syscall that (I suppose) generates a third event in-between the two SYSCALL stops.

execve/execveat: these probably break libdebug in general, not only when injected, but we should probably add an error if the user attempts to invoke them, because we would definitely break.

Thinking about the whole syscall invocation thingy, I've actually been wondering if we should think of a non-blocking implementation too:

r = d.run() d.invoke_syscall("read", d.regs.rax, 0x10) r.sendline(b"provola")

This makes no sense in a real script, I know, but it would deadlock everything because the read syscall will never terminate, waiting for the sendline right after. Am I hallucinating? Probably. Is there an actual sane case of something like this that could happen in a real script? I think so, actually.

So should invoke_syscall be non-blocking like cont? Should we have an optional non-blocking mode? Does this make sense? This whole syscall invocation thing has been like opening a whole can of worms, I think.

cc @io-no the "we should do it for consistency" API connoisseur, I think I gave him enough material to insult me for a week in this message.

I think the third event is not an issue if it occurs within the context of a handled syscall, since any continue operation in libdebug becomes a ptrace_syscall. I could suggest maintaining a similar mechanism also for the invocation.

void LibdebugPtraceInterface::cont_thread(Thread &t) { if (ptrace(handle_syscall ? PTRACE_SYSCALL : PTRACE_CONT, t.tid, NULL, t.signal_to_forward) == -1) { throw std::runtime_error("ptrace cont failed"); } t.signal_to_forward = 0; }

The seccomp event will be managed internally during the wait loop. Therefore, it is sufficient to have a wait somewhere to prevent the event from causing issues.
Is this what you meant, or did I misunderstand?
That said, I agree that seccomp handling should be improved overall.

I might have to agree for the second time today regarding the API. It makes sense to have a non-blocking API — this would make it consistent with the rest of the API, not just continue (we still need to fix step and a few other APIs that are currently blocking).
The only intended way to wait for the program to stop should be through d.wait, in my opinion — just my two cents.

At this point, maybe the entire mechanism of invoke_syscall could be managed through special handles.
An exit handle, transparent to the user, would manage restoring the original state of the process and simply stop the process at the point where the syscall was invoked.

I think the third event is not an issue if it occurs within the context of a handled syscall, since any continue operation in libdebug becomes a ptrace_syscall. I could suggest maintaining a similar mechanism also for the invocation.

Yeah, it's not a problem if we do the handlers and everything, I was just saying that it is probably a problem in the current implementation of invoke_syscall, so the issue is not only with clone and fork.

MrIndeciso · 2025-04-26T08:11:29Z

test/scripts/syscall_invocation_test.py

+        elif PLATFORM == "i386":
+            ret = d.invoke_syscall("clone", clone_flags, stack_base, stack_base + 0x04, d.regs.gs, stack_base + 0x08)
+        elif PLATFORM == "aarch64":
+            # To retrieve the TLS base, we need to use the TPIDR_EL0 register


The tls parameter is nullable in the clone syscall.
You are not setting the CLONE_SETTLS flag, so I don't think that this is needed, just pass 0x0 and it will work.
I think this is the same for the other two parameters as well, parent_tid and child_tid.

MrIndeciso · 2025-04-26T08:13:15Z

libdebug/debugger/internal_debugger.py

+        if any(not isinstance(arg, int) for arg in args):
+            raise TypeError("All arguments must be integers.")
+
+        self._ensure_process_stopped()


This is not really needed.

MrIndeciso · 2025-04-26T08:15:17Z

libdebug/debugger/internal_debugger.py

+        Returns:
+            int: The return value of the syscall.
+        """
+        # Initial checks to ensure the syscall can be invoked


I think we should check that the thread is on an executable page before injecting a syscall. What if we are on a RW page? Or if we are at the boundary of an executable page and we don't have enough space for a syscall instruction?

Yes, since we inject it before the current IP, we should.

Frank01001 and others added 30 commits November 1, 2024 23:27

feat: first prototype (based on nanobind). WIP - not yet working

43f9cb5

debug: debugging setup, thread is dead by the time the nb function is…

4ce078c

… called

Merge branch 'dev', remote-tracking branch 'origin' into arbitrary-sy…

68c41ed

…scall

feat: got the first correct arbitrary write syscall working. :D

5653dcb

chore: decluttering and refactoring

603f960

Merge branch 'dev' into arbitrary-syscall

ae71c8d

fix: missing return

a3c01cc

feat: created file for useful constants and parsing

cf013b8

fix: Generalized error handling

0bec031

Merge branch 'dev' into arbitrary-syscall

9528512

feat: prototype of the python conversion of the arbitrary syscall code

9840a79

fix: debugging prints to debug the addition of emulated syscall

ad6a7c3

Merge branch 'dev' into arbitrary-syscall

75d7db6

feat: working prototype of syscall handling during invocation

f720b51

feat: prototype of error parsing for pprint_syscalls

e9b49d7

Merge pull request #219 from libdebug/dev

cfbcc3e

Build mistakes + doc mistakes resolution

Merge remote-tracking branch 'origin' into arbitrary-syscall

d08e30e

feat: further progress on parameter parsing for pprint_syscall

16710c0

feat: partial implementation of arg parsing

3e1486d

fix: finally the invoke syscall works (allegedly)

a50765e

feat: overhaul of syscall invocation to handle fork and other cases

b828689

Merge branch 'dev' into arbitrary-syscall

59cdf89

test: many fixes and intensive testing (WIP)

8662cb4

fix: fork is still broken, but the rest should be fixed

c6811f3

feat: more progress on the mapping of syscall arguments

72e4733

feat: more syscall arg define parsing

dec1efd

feat: implemented syscall parsing from value map

0edf8f0

Merge remote-tracking branch 'origin/dev' into arbitrary-syscall

9c3c9d1

fix: ugly fix used at def con, not for production purposes

f4ced83

feat: added checks for clone syscall

33130f8

Frank01001 added 8 commits April 19, 2025 13:47

fix: all tests now run correctly, including invocation in callback an…

0f93b13

…d exception in syscall handler callback

docs: documentation of arbitrary syscall invocation

bf741c0

docs: minor changes to uniform some icons for API

cd164d4

fix: added initial check to ensure we are not inside another syscall

0b41925

fix: implemented function in status handler to check if the process i…

bdc2fdc

…s stopped inside a syscall

test: removed old TODO and added test for the new check

f180731

docs: documented the behavior of callbacks and pprints while invoking…

04d2fa8

… syscalls

chore: removed syscall arg parsing feature, which has been moved to a…

1ff7e08

…nother branch

Frank01001 added the enhancement New feature or request label Apr 19, 2025

Frank01001 self-assigned this Apr 19, 2025

Frank01001 added 4 commits April 19, 2025 16:48

fix: realigned leftovers of postponed feature

0640841

fix: extended sorrounding NOPs to other archs

7b63efc

fix: removed another trace of syscall arg parsing

270d4c6

test: generalized tests to other archs

32205cd

This was linked to issues Apr 22, 2025

Syscall Number is not what it says it is #225

Open

Arbitrary System Call #169

Open

Frank01001 added 8 commits April 22, 2025 22:09

test: fix other skill issues in test

8837b4b

test: this will fix the shellcode test being single arch

fec836c

test: i386 fixes

362e509

test: this might finally work

017f454

fix: changed wrong register for aarch64

38d240b

fix: fixed wrong handling of clone syscall

417915d

test: more fixes for tests on other archs

cec9255

test: fixed test for i386 and hopefully also for aarch64

b9728f5

io-no reviewed Apr 25, 2025

View reviewed changes

Frank01001 added 2 commits April 26, 2025 09:33

fix: misc fixes (incomplete)

6bb29a0

Merge remote-tracking branch 'origin/dev' into arbitrary-syscall

0067f45

MrIndeciso reviewed Apr 26, 2025

View reviewed changes

fix: prettier but not working

ee5b920

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arbitrary Syscall Invocation #235

Arbitrary Syscall Invocation #235

Frank01001 commented Apr 19, 2025

Frank01001 commented Apr 24, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

Frank01001 Apr 26, 2025

MrIndeciso Apr 26, 2025

io-no Apr 28, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

io-no Apr 25, 2025

MrIndeciso Apr 26, 2025

Frank01001 Apr 26, 2025

MrIndeciso Apr 28, 2025 •

edited

Loading

MrIndeciso Apr 26, 2025

MrIndeciso Apr 26, 2025

MrIndeciso Apr 26, 2025

MrIndeciso Apr 26, 2025

MrIndeciso Apr 27, 2025

io-no Apr 28, 2025

MrIndeciso Apr 28, 2025

MrIndeciso Apr 26, 2025

MrIndeciso Apr 26, 2025

MrIndeciso Apr 26, 2025

Frank01001 Apr 26, 2025


		is_cloning_event = syscall_name in ["fork", "vfork", "clone", "clone3"]

		if is_cloning_event:

Arbitrary Syscall Invocation #235

Are you sure you want to change the base?

Arbitrary Syscall Invocation #235

Conversation

Frank01001 commented Apr 19, 2025

Frank01001 commented Apr 24, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MrIndeciso Apr 28, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MrIndeciso Apr 28, 2025 •

edited

Loading