Thanks to visit codestin.com
Credit goes to github.com

Skip to content

feat: deterministic distributed inference #20

@mascharkh

Description

@mascharkh

Current state: different GPUs produce different results so we have verification failures and an auditor can not repro
Desired state: bit-identical outputs across A40/T4/2080 via FP64 with controlled rounding
Actions: forward_f64() layers, RoundingLog struct, --deterministic flag, Ray worker integration
Success Criteria: Prover on A40, auditor on T4 then will get identical outputs, rounding log less than 2MB, and overhead less than 1.5x

Related to #19

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions