🐢 Open-Source Evaluation & Testing library for LLM Agents
-
Updated
Oct 10, 2025 - Python
🐢 Open-Source Evaluation & Testing library for LLM Agents
Agentic testing for agentic codebases
Deliver safe & effective language models
MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.
GPT4Go: AI-Powered Test Case Generation for Golang 🧪
A Python library for verifying code properties using natural language assertions.
👁 零代码零标注 CV AI 自动化测试工具 🚀 免除大量人工画框和打标签等,直接零代码快速自动化测试 CV 计算机视觉 AI 人工智能图像识别算法:行人检测、动植物分类、人脸识别、OCR 车牌识别、旋转校正、舞蹈姿态、抠图分割 等,还可一键 下载测试报告、导出训练和测试数据集
Open-source framework for stress-testing LLMs and conversational AI. Identify hallucinations, policy violations, and edge cases with scalable, realistic simulations. Join the discord: https://discord.gg/ssd4S37WNW
Übungsaufgaben zum Buch "Basiswissen KI-Testen"
Prompture is an API-first library for requesting structured JSON output from LLMs (or any structure), validating it against a schema, and running comparative tests between models.
Agent testing library that uses an agent to test your agent, in Go.
A CLI for testing your UI. Easy
Agent testing library that uses an agent to test your agent, in Typescript.
Integration of OpenAI with Pytest to automate API test generation.
🚀 ARM64 Browser Automation for Claude Code - SaaS testing on 80 Raspberry Pi budget. The first solution that works where Playwright/Puppeteer fail on ARM64. Autonomous testing without human debugging.
Evaluate - The Robust LLM Testing Framework 🦀
Turn plain English into Robot Framework files with AI. No dependencies, no hassle — just validated, ready-to-run tests
Public whitepaper on AI testing strategies in healthcare using prompt engineering and LLMs.
Burro is a command-line interface (CLI) tool built with Deno for evaluating Large Language Model (LLM) outputs. It provides a straightforward way to run different types of evaluations with secure API key management.
Add a description, image, and links to the ai-testing topic page so that developers can more easily learn about it.
To associate your repository with the ai-testing topic, visit your repo's landing page and select "manage topics."