Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View Sruthi-sk's full-sized avatar

Block or report Sruthi-sk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Stop AI agents from half-building features. Ship complete code in one session.

TypeScript 225 19 Updated Dec 24, 2025

This repository accompanies the research paper "Sandbagging Auditing Games" on detecting sandbagging in frontier AI systems. We provide access to the model organisms used in the paper and tools for…

Python 4 Updated Dec 15, 2025

James' cookbook of evaluations and finetuning experiments

Python 16 1 Updated Dec 15, 2025

✨ Monorepo containing most of BlueDot Impact's custom software.

TypeScript 24 30 Updated Dec 24, 2025

Inference API for many LLMs and other useful tools for empirical research

Python 89 24 Updated Dec 24, 2025

Code for the paper: Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models

Jupyter Notebook 10 5 Updated Oct 7, 2025

ControlArena is a collection of settings, model organisms and protocols - for running control experiments.

Python 144 81 Updated Dec 18, 2025

MedRAX: Medical Reasoning Agent for Chest X-ray - ICML 2025

Python 1,067 185 Updated Oct 31, 2025

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,952 153 Updated Nov 16, 2025
Jupyter Notebook 857 540 Updated Nov 12, 2025

Repository for the "Chain-of-Thought Reasoning In The Wild Is Not Always Faithful" paper

HTML 30 18 Updated Nov 28, 2025

[NeurIPS 2024] CoSy is an automatic evaluation framework for textual explanations of neurons.

Jupyter Notebook 19 2 Updated Jun 20, 2025

A curated list of foundation models, datasets, and tools for biosignals

97 11 Updated Dec 12, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,181 841 Updated May 12, 2025

⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.

Jupyter Notebook 100 27 Updated Oct 27, 2025

The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.

Python 3,234 633 Updated Dec 24, 2025
Python 23 4 Updated Oct 3, 2025

Weekly seminar on transformers/LLMs at the University of Wyoming

Jupyter Notebook 3 Updated Jul 1, 2025

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 16,057 1,274 Updated Jan 18, 2025
Python 26 17 Updated Dec 18, 2025

LLM101n: Let's build a Storyteller

35,956 1,962 Updated Aug 1, 2024
Python 61 14 Updated Dec 17, 2025

Experimental LLM interface exploring new ways to use AI to improve human thinking

TypeScript 19 2 Updated Mar 5, 2025

Web based agent that can control your computer.

Python 3 Updated Feb 24, 2025

The 2024 edition of The Nature of Code with p5.js. Includes Notion workflow and build system.

HTML 1,690 133 Updated Sep 4, 2025

A Python implementation of a project that classifies the valence and context of several thousand pig calls, extended to have a web interface.

Python 1 1 Updated Mar 18, 2025
Jupyter Notebook 1,073 174 Updated Dec 22, 2025
Next