Mysticbirdie

Kelly Hohman Mysticbirdie

Full stack AI engineer

Achievements

hallucination-elimination-benchmark hallucination-elimination-benchmark Public

Multi-tier benchmark: Cultural grounding + Triad Engine eliminates LLM hallucination across Claude 4.6, GPT-5.2, Mistral 7B, Gemini 2.5 Pro. Raw 15-58% → 95-100% accuracy on 222 adversarial QA pair…

Python 6 1
image-cultural-accuracy-benchmark image-cultural-accuracy-benchmark Public

Benchmark measuring historical accuracy of AI-generated images. 24 image pairs (3 characters × 8 scenes) set in Rome 110 CE, comparing naive prompts vs culturally-grounded prompts. Blinded A/B eval…

Python 2
DevTrail DevTrail Public

Universal Memory Layer for Developers — A system that automatically captures, indexes, and makes searchable everything you do across your development tools. It extracts conversation history, code c…

Python 2
triad-rome-benchmark triad-rome-benchmark Public template

Cultural AI benchmark demonstrating 100% accuracy

Python 1
GlobalChitchat GlobalChitchat Public

JavaScript
ACGBM ACGBM Public

Forked from teslasolar/ACGBM

ACG Barcode Monsters

JavaScript