π§ββοΈ
Sorcerer
07
π¦π§βπ»πΌοΈπ§ββοΈ
- Kerala
-
13:54
(UTC +05:30) - https://www.sonubabu.me/
- https://orcid.org/0009-0001-2493-7995
- @sonuxb
Highlights
Pinned Loading
-
Anthropic-Alignment-Fake-Demo-Extension
Anthropic-Alignment-Fake-Demo-Extension PublicTraining-Aware Behavior and Compliance Shifts in Small Open-Source LLMs
-
temporal-refusal-timing-pilot
temporal-refusal-timing-pilot PublicPilot framework for trajectory-sensitive refusal timing evaluation and mitigation diagnostics.
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


