0% found this document useful (0 votes)

41 views19 pages

Munich Rust 2020

William Woodruff's presentation at the Rust Munich Meetup discusses steganography in x86 binaries, introducing a tool called steg86 that hides messages within binary programs using semantic duals of x86 instructions. The tool is capable of embedding and extracting messages while maintaining the integrity of the original binary, leveraging the complexity of x86 instruction encoding. Woodruff also addresses the challenges and limitations of this approach, including code/data disambiguation and detection issues.

Uploaded by

rtloweb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views19 pages

Munich Rust 2020

Uploaded by

rtloweb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

steg86: hiding messages in x86 binaries

rust munich meetup

william woodruff

august 25 2020
agenda

I yours truly
I steganography?
I steg on programs
I x86 instruction encoding
I steg86
yours truly

I william woodruff
I @8x5clPW2 • yossarian.net • blog.yossarian.net

I senior security engineer @ trail of bits

I work: program analysis research, mostly in LLVM
I disclaimer: independent talk, not representing employer

I open source: member of homebrew, miscellaneous contributor

steganography?

I “hiding data within data”

I not cryptography
I different techniques for
different data
I popular targets:
I images
I sound files
I plain text
I what about programs?
steg on programs

programs are a natural choice

for steg
I can be very large (lots of
info capacity)
I complex binary formats
(PE, Mach-O, ELF)
I complex instruction
encodings (x86/AMD64,
ARM w/ Thumb)
I present on every computer,
not inherently suspicious
steg on programs: approaches

I hide information in stack layout, register selection

I problem: need the program’s source
I problem: need to maintain a compiler. . .

I hide information in the format itself (e.g. segment order)

I problem: specific to a format, may not apply to others

I rewrite the program after compilation

I ex: add eax, -50 → sub eax, 50
I problem: code/data disambiguation (difficult to solve)
I problem: relocations, position independent code (-fPIC)
I problem: CPU-level semantics (arithmetic, status flags)

I can we do better?
x86 instruction encoding
I variable length (up to 15 bytes)
I extremely complex (decades of compat, overloaded fields)
I rich source/sink combinations
I register-to-register (mov ebx, eax)
I register-to-memory (mov dword [1337], eax)
I memory-to-register (mov eax, dword [1337])
I immediate-to-register (mov eax, 1337)
I immediate-to-memory (mov dword [1337], 1337)
x86 instruction encoding: modr/m

I essentially an 8-bit lookup table of (some) operand encodings

I doesn’t cover all possible operands, for historical reasons. . .

I simplest case: encodes one or two operands

I reg/opcode field: one register operand
I r/m field: one register or memory operand
I enables mem-to-reg, reg-to-mem, reg-to-reg operations
x86 instruction encoding: xor

opcode instruction
31 /r xor r/m32, r32
33 /r xor r32, r/m32

I reg-to-mem, mem-to-reg, reg-to-reg, ...

I there are two reg-to-reg encodings!
I 31 C0 → mov eax, eax
I 33 C0 → also mov eax, eax!
I they’re even the same size!
I 64-bit variants (w/ REX prefix) work too!
steg86

I central conceit: each reg-to-reg pair represents one bit of

information
I with enough bits, we can hide messages!

I binary format independent

I uses goblin to unpack PE/ELF/Mach-O binaries

I encodings are the same size, so PIC/relocations aren’t broken

I uses iced for decoding/encoding/semantics

I ~700 lines of rust total (much of it constants)

I CLI: steg86 {profile,embed,extract}
steg86: semantic duals
I it turns out there are a bunch of these
I 9 instructions (add, adc, sub, sbb, and, or, xor, mov, cmp)
I 4 variants (8, 16, 32, 64-bit) each1

I each dual gives us 1 bit of information

I minus a little space for a header with metadata

I how common are these instructions?

$ steg86 profile /bin/bash
Summary for /bin/bash:
175828 total instructions
27957 potential semantic pairs
27925 bits of information capacity (3490 bytes)
I not bad!

1
actually 3 in any particular CPU mode. . .
steg86: semantic duals

each pair represents (false, true). . .

static SEMANTIC_PAIRS: &[(Code, Code)] = &[
// ADD
(Code::Add_rm8_r8, Code::Add_r8_rm8),
(Code::Add_rm16_r16, Code::Add_r16_rm16),
(Code::Add_rm32_r32, Code::Add_r32_rm32),
(Code::Add_rm64_r64, Code::Add_r64_rm64),

// ... snip ...

];
steg86: profiling

for every instruction in the program. . .

// skip instructions we don't support
if !SUPPORTED_OPCODES.contains(&instruction.code()) {
continue;
}

// skip non reg-to-reg instructions

if instruction.op0_kind() != OpKind::Register
|| instruction.op1_kind() != OpKind::Register
{
continue;
}

offsets.push(instruction.ip() as usize);
steg86: embedding
for each candidate instruction. . .
let new_code = {
let tuple = SEMANTIC_PAIRS
.iter()
.find(|&&t| old_code == t.0 || old_code == t.1)
.unwrap();

match (bit, tuple.0 == old_code) {

(false, true) | (true, false) => {
// already correct!
continue;
}
(false, false) => tuple.0,
(true, true) => tuple.1,
}
};
steg86: embedding

let new_instruction = Instruction::with_reg_reg(

new_code,
instruction.op0_register(),
instruction.op1_register(),
);
let new_len = encoder
.encode(&new_instruction, offset as u64)
.map_err(|s| anyhow!(s))?;

// ... snip ...

text_copy
.data
.splice(
offset..(offset + new_len), encoder.take_buffer());
steg86: results
binary diff:
$ cargo install steg86

$ echo "hello!" > message.txt

$ steg86 embed \
/bin/bash test.steg \
< message.txt

$ steg86 extract test.steg

hello!
steg86: next steps

I other tricks
I test reg1, reg2 is the same as test reg2, reg1
I same with xchg
I multi-byte nops

I deficiencies
I code/data disambiguation is impossible in the general case
I many open problems in program analysis reduce to this
I partial workarounds: CFG recovery, jump table identification

I very easy to detect (real compilers stick to one encoding)

thank you!
slides: yossarian.net/publications#munich-rust-2020
github: woodruffw/steg86
blog post: hiding messages in x86 binaries using semantic duals
contact: [email protected] / @8x5clPW2
links and prior work

I A86 assembler (1980s!)

I HYDAN (2004)
I ARMaHYDAN (2019, PoC||GTFO)

The Most Notorious "Talker" Runs The World's Greatest Clan Vol 3
No ratings yet
The Most Notorious "Talker" Runs The World's Greatest Clan Vol 3
339 pages
Algebraic Geometry - A First Course - Joe Harris - Harvard University
86% (7)
Algebraic Geometry - A First Course - Joe Harris - Harvard University
337 pages
Whirlpool Schema
No ratings yet
Whirlpool Schema
11 pages
Tutorial 09 Sol
No ratings yet
Tutorial 09 Sol
5 pages
Assembler Verilog
No ratings yet
Assembler Verilog
9 pages
Kelly Strategy for Investors
50% (2)
Kelly Strategy for Investors
7 pages
RPRT
No ratings yet
RPRT
6 pages
Tenses: S + V1/s/es S + Tobe (Is, Am, Are) + C
No ratings yet
Tenses: S + V1/s/es S + Tobe (Is, Am, Are) + C
3 pages
P 1515 - Design and Contstruction of Anchored and Strutted Sheet Pile Walls Iin Soft Clay PDF
No ratings yet
P 1515 - Design and Contstruction of Anchored and Strutted Sheet Pile Walls Iin Soft Clay PDF
36 pages
8086 Programming: Compiled By: Chandra Thapa October 23, 2012
No ratings yet
8086 Programming: Compiled By: Chandra Thapa October 23, 2012
76 pages
Apndxd
No ratings yet
Apndxd
42 pages
Some Basic Concepts of Chemistry
No ratings yet
Some Basic Concepts of Chemistry
19 pages
Op Codes
No ratings yet
Op Codes
42 pages
MIPS Assembly Code Exam 2019/20
No ratings yet
MIPS Assembly Code Exam 2019/20
6 pages
ARM Assembly Shellcode Guide
No ratings yet
ARM Assembly Shellcode Guide
66 pages
Y86 Programmer-Visible State
No ratings yet
Y86 Programmer-Visible State
14 pages
Processor Architecture
No ratings yet
Processor Architecture
25 pages
Integers Floating Point: N N S E
No ratings yet
Integers Floating Point: N N S E
4 pages
L11 Datapath1
No ratings yet
L11 Datapath1
49 pages
RISC-V ISA Lectures
100% (1)
RISC-V ISA Lectures
65 pages
ARM Architecture Overview
No ratings yet
ARM Architecture Overview
6 pages
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
No ratings yet
Offensive Security & Reverse Engineering (OSRE) : Ali Hadi
110 pages
The DLX Instruction Set
No ratings yet
The DLX Instruction Set
13 pages
Machine Language
No ratings yet
Machine Language
51 pages
Micro 7
No ratings yet
Micro 7
79 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
24 pages
VLSI
No ratings yet
VLSI
48 pages
x64 Assembly and ASCII Reference
No ratings yet
x64 Assembly and ASCII Reference
4 pages
Kavi Bhai Santokh Singh
No ratings yet
Kavi Bhai Santokh Singh
4 pages
16 MachineLang
No ratings yet
16 MachineLang
83 pages
Chapter 3 Instructions ARM
No ratings yet
Chapter 3 Instructions ARM
35 pages
x86 Assembly Tutorial: COS 318: Fall 2017
No ratings yet
x86 Assembly Tutorial: COS 318: Fall 2017
23 pages
MIPS Instruction Formats & Registers
No ratings yet
MIPS Instruction Formats & Registers
25 pages
80x85 Format
No ratings yet
80x85 Format
17 pages
ARM-Inst Summary
No ratings yet
ARM-Inst Summary
2 pages
Computer Instruction Sets Guide
No ratings yet
Computer Instruction Sets Guide
31 pages
Intel I
No ratings yet
Intel I
72 pages
2 Mips Architecture
No ratings yet
2 Mips Architecture
70 pages
PLDI Week 02 X86lite
No ratings yet
PLDI Week 02 X86lite
30 pages
6 Machine - Intro v2
No ratings yet
6 Machine - Intro v2
29 pages
3 - ARMv8-A Architecture
No ratings yet
3 - ARMv8-A Architecture
67 pages
MA6452 S&NM 1 - by Civildatas - Com 12
No ratings yet
MA6452 S&NM 1 - by Civildatas - Com 12
50 pages
Design and Manufacturing of Carbon Fiber Composite Drive Shaft As An Alternative To Conventional Steel Drive Shaft
No ratings yet
Design and Manufacturing of Carbon Fiber Composite Drive Shaft As An Alternative To Conventional Steel Drive Shaft
10 pages
Instruction Encoding: - The ISA Defines
No ratings yet
Instruction Encoding: - The ISA Defines
25 pages
Isas and Y86-64: Samira Khan
No ratings yet
Isas and Y86-64: Samira Khan
49 pages
Hoc Sinh Gioi 8 - 2022
No ratings yet
Hoc Sinh Gioi 8 - 2022
10 pages
Experiment 16: Heat Conduction
No ratings yet
Experiment 16: Heat Conduction
6 pages
Current Affairs Weekly Q&A PDF February 2023 2nd Week by AffairsCloud 1
No ratings yet
Current Affairs Weekly Q&A PDF February 2023 2nd Week by AffairsCloud 1
79 pages
01 Lecture02
No ratings yet
01 Lecture02
78 pages
RRB Alp Xam: Study Material For Quantative Aptitude
No ratings yet
RRB Alp Xam: Study Material For Quantative Aptitude
12 pages
1.5.2 Strategy As Position: Why Strategy Execution Fails
No ratings yet
1.5.2 Strategy As Position: Why Strategy Execution Fails
12 pages
CAO Fall 2024 Lecture 04 Instruction Set Architecture RISC V Machine Language Microarchitecture
No ratings yet
CAO Fall 2024 Lecture 04 Instruction Set Architecture RISC V Machine Language Microarchitecture
42 pages
Multi-Core Computer Architecture: Instruction Encoding
No ratings yet
Multi-Core Computer Architecture: Instruction Encoding
14 pages
ES Alcoholic Beverages
No ratings yet
ES Alcoholic Beverages
10 pages
Optimize Y86-64 Pipelined Processor
No ratings yet
Optimize Y86-64 Pipelined Processor
10 pages
Refrigerants: The Pragmatic Solution of Today
No ratings yet
Refrigerants: The Pragmatic Solution of Today
2 pages
Reflection Paper Guide for "The Billionaire"
No ratings yet
Reflection Paper Guide for "The Billionaire"
4 pages
Critical Thinking Exercise: "Wild Child: The Story of Feral Children"
No ratings yet
Critical Thinking Exercise: "Wild Child: The Story of Feral Children"
2 pages
Career Development As A Management Accou
No ratings yet
Career Development As A Management Accou
19 pages
Business 70 PDF
No ratings yet
Business 70 PDF
1 page
UV Stable Waterproof Membrane Guide
No ratings yet
UV Stable Waterproof Membrane Guide
3 pages
Namma Kalvi 12th Zoology Question Bank em 217045
No ratings yet
Namma Kalvi 12th Zoology Question Bank em 217045
45 pages
Cse331 L3 Arm Isa
No ratings yet
Cse331 L3 Arm Isa
103 pages
160719a0cd3011 - 29094359708
No ratings yet
160719a0cd3011 - 29094359708
2 pages
CSCE 5610 Computer System Architecture: Content
No ratings yet
CSCE 5610 Computer System Architecture: Content
7 pages
WeekARM Assy Slides
No ratings yet
WeekARM Assy Slides
17 pages
Instruction Encoding: CS223 Computer Architecture & Organization
No ratings yet
Instruction Encoding: CS223 Computer Architecture & Organization
15 pages
Lecture-3-07 01 2025
No ratings yet
Lecture-3-07 01 2025
16 pages
Riscv Isa Full For Lab4
No ratings yet
Riscv Isa Full For Lab4
44 pages
Partial Derivatives Quiz Analysis
No ratings yet
Partial Derivatives Quiz Analysis
8 pages
Electric Fan
No ratings yet
Electric Fan
1 page
Expanding Codes
No ratings yet
Expanding Codes
30 pages
MIPS Instruction Format
No ratings yet
MIPS Instruction Format
19 pages
Vacuum Test Procedure (VCP)
No ratings yet
Vacuum Test Procedure (VCP)
5 pages
1-Introduction Au Systeme D'exploitation 2 (Emu8086)
No ratings yet
1-Introduction Au Systeme D'exploitation 2 (Emu8086)
39 pages
ch2 1
No ratings yet
ch2 1
54 pages
2024-Spring - 2242-Biol-1345-001 3
No ratings yet
2024-Spring - 2242-Biol-1345-001 3
5 pages
Lesson 2.1 - Intro + x86-x64 Assembly
No ratings yet
Lesson 2.1 - Intro + x86-x64 Assembly
33 pages
Latin American Veggie Meal Plan
No ratings yet
Latin American Veggie Meal Plan
2 pages
CIE IGNITE Season 01 Ideathon Idea Submissions
No ratings yet
CIE IGNITE Season 01 Ideathon Idea Submissions
255 pages
(78s) (2018) (Azeria) HITB-LAB - ARM ExploitationLab
No ratings yet
(78s) (2018) (Azeria) HITB-LAB - ARM ExploitationLab
78 pages
MIPS
No ratings yet
MIPS
22 pages
Lecture 4
No ratings yet
Lecture 4
109 pages
Lecture 6
No ratings yet
Lecture 6
54 pages
Cse331 l3 Arm Isa
No ratings yet
Cse331 l3 Arm Isa
101 pages
Program Encodings Assembly
No ratings yet
Program Encodings Assembly
12 pages
4 Disassembler Example
No ratings yet
4 Disassembler Example
13 pages
RISC-V Chap3 Lab1
No ratings yet
RISC-V Chap3 Lab1
6 pages
Info1112 A1: Stdchip: Due: Sunday The 14 September 2025, 11:59Pm Aest
No ratings yet
Info1112 A1: Stdchip: Due: Sunday The 14 September 2025, 11:59Pm Aest
18 pages

Munich Rust 2020

Uploaded by

Munich Rust 2020

Uploaded by

steg86: hiding messages in x86 binaries

rust munich meetup

I senior security engineer @ trail of bits

I open source: member of homebrew, miscellaneous contributor

I “hiding data within data”

programs are a natural choice

I hide information in stack layout, register selection

I hide information in the format itself (e.g. segment order)

I rewrite the program after compilation

I essentially an 8-bit lookup table of (some) operand encodings

I simplest case: encodes one or two operands

I reg-to-mem, mem-to-reg, reg-to-reg, ...

I central conceit: each reg-to-reg pair represents one bit of

I binary format independent

I encodings are the same size, so PIC/relocations aren’t broken

I ~700 lines of rust total (much of it constants)

I each dual gives us 1 bit of information

I how common are these instructions?

each pair represents (false, true). . .

// ... snip ...

for every instruction in the program. . .

// skip non reg-to-reg instructions

match (bit, tuple.0 == old_code) {

let new_instruction = Instruction::with_reg_reg(

// ... snip ...

$ echo "hello!" > message.txt

$ steg86 extract test.steg

I very easy to detect (real compilers stick to one encoding)

I A86 assembler (1980s!)

You might also like