Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddings recursively. This helps us understand user behaviour on…

Python 359 36 Updated Sep 10, 2025

LRudL / sad

Situational Awareness Dataset

HTML 41 6 Updated Dec 14, 2024

EleutherAI / cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 822 42 Updated Jul 29, 2025

jordansauce / obfuscated_backdoors

Forked from abhay-sheshadri/sae_experiments

Code for reproducing sections 4 and 6.2 of the paper "Obfuscated Activations Bypass LLM Latent-Space Defenses"

Jupyter Notebook 2 3 Updated Feb 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maheep Chaudhary MaheepChaudhary

Achievements

Achievements

Highlights

Block or report MaheepChaudhary

Stars

MaheepChaudhary / safetynet

Blkalkin / Optimal-TestTime

jacobhilton / deep_learning_curriculum

emergent-misalignment / emergent-misalignment

TeunvdWeij / sandbagging

LukeBailey181 / obfuscated-activations

xingjunm / Awesome-Large-Model-Safety

simplescaling / s1

google-deepmind / treescope

huggingface / smolagents

browser-use / browser-use

longtermrisk / openweights

unslothai / unsloth

567-labs / kura

LRudL / sad

EleutherAI / cookbook

jordansauce / obfuscated_backdoors

ApolloResearch / apd

facebookresearch / llm-transparency-tool

anthropics / hh-rlhf

UFO-101 / auto-circuit

EleutherAI / elk-generalization

ejnnr / cupbearer

EleutherAI / training-jacobian

adamkarvonen / SAEBench

guanyingc / latex_paper_writing_tips

bhoov / distributed_DAM

facebookresearch / detectron2

git-disl / awesome_LLM-harmful-fine-tuning-papers

UIC-Liu-Lab / ContinualLM