Thanks to visit codestin.com
Credit goes to github.com

crizCraig

Follow

🛠️

Building

Craig Quiter crizCraig

🛠️

Building

Follow

🤖 AI guy

166 followers · 93 following

Singularity's edge
19:59 (UTC -07:00)
deepdrive.io
@crizcraig

Sponsoring

Organizations

Stars

temporalio / temporal

Temporal service

Go 16,293 1,153 Updated Oct 24, 2025

groq / openbench

Provider-agnostic, open-source evaluation infrastructure for language models

Python 626 73 Updated Oct 24, 2025

charmbracelet / crush

The glamourous AI coding agent for your favourite terminal 💘

Go 14,186 763 Updated Oct 24, 2025

bytedance / trae-agent

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,750 1,008 Updated Sep 24, 2025

NVIDIA / RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,337 116 Updated Oct 9, 2025

davidkimai / Context-Engineering

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 7,334 811 Updated Sep 30, 2025

JerryZLiu / Dayflow

Generate a timeline of your day, automatically

Swift 3,661 152 Updated Oct 22, 2025

crizCraig / open-webui

Forked from open-webui/open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 4 Updated Feb 17, 2025

rapidfuzz / Levenshtein

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

C++ 358 23 Updated Apr 11, 2025

rapidfuzz / python-Levenshtein

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

121 6 Updated Mar 5, 2025

parthsarthi03 / raptor

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,443 192 Updated Sep 3, 2024

VectifyAI / PageIndex

📄🧠 PageIndex: Document Index for Reasoning-based RAG

Python 2,881 214 Updated Oct 14, 2025

jwest33 / jam_model_memory

An associative memory system that stores and retrieves experiences using the 5W1H framework (Who, What, When, Where, Why, How) and content-addressable memory.

Python 172 28 Updated Sep 15, 2025

T3-Content / t3-cloneathon

TypeScript 154 33 Updated Jul 9, 2025

crizCraig / mem_evals

Evals for Context Memory

Python 1 Updated Aug 22, 2025

GoodStartLabs / AI_Diplomacy

Frontier Models playing the board game Diplomacy.

Python 597 86 Updated Sep 5, 2025

cogwheel0 / conduit

Native mobile client for OpenWebUI. Chat with your self‑hosted AI.

Dart 574 45 Updated Oct 23, 2025

icanhasjonas / run-claude-docker

Run claude code in somewhat safe and isolated yolo mode

Shell 37 7 Updated Aug 14, 2025

ryoppippi / ccusage

A CLI tool for analyzing Claude Code/Codex CLI usage from local JSONL files.

TypeScript 8,658 268 Updated Oct 21, 2025

METR / vivaria

Vivaria is METR's tool for running evaluations and conducting agent elicitation research.

TypeScript 116 37 Updated Oct 23, 2025

METR / RE-Bench

Python 113 14 Updated Oct 16, 2025

poking-agents / modular-public

Python 29 4 Updated Jun 4, 2025

UKGovernmentBEIS / inspect_evals

Collection of evals for Inspect AI

Python 262 185 Updated Oct 23, 2025

METR / eval-analysis-public

Public repository containing METR's DVC pipeline for eval data analysis

Python 122 24 Updated Apr 6, 2025

logic-star-ai / baxbench

Python 68 14 Updated Oct 22, 2025

amazon-science / SWE-PolyBench

SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents

Python 69 9 Updated Oct 13, 2025

openai / frontier-evals

OpenAI Frontier Evals

Python 923 105 Updated Oct 21, 2025

ExpeRepair / ExpeRepair

Python 94 12 Updated Sep 12, 2025

zilliztech / claude-context

Code search MCP for Claude Code. Make entire codebase the context for any coding agent.

TypeScript 4,199 367 Updated Sep 16, 2025

SWE-bench / experiments

Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.

Shell 218 261 Updated Oct 21, 2025