Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View polaris79's full-sized avatar

Highlights

  • Pro

Block or report polaris79

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.

74 5 Updated Oct 15, 2025

πŸ’» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

1,055 58 Updated Aug 17, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 99,882 11,329 Updated Jan 12, 2026

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 120,329 16,956 Updated Jan 11, 2026

AgentScope: Agent-Oriented Programming for Building LLM Applications

Python 15,382 1,308 Updated Jan 12, 2026

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and mo…

Python 3,962 289 Updated Dec 28, 2025

πŸ™ Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 69,020 7,357 Updated Dec 29, 2025

Production-ready platform for agentic workflow development.

Python 125,594 19,535 Updated Jan 12, 2026

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 840 63 Updated Jul 1, 2024

A tool for evaluating LLMs

TypeScript 428 45 Updated May 10, 2024

An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]

SAS 384 39 Updated May 20, 2024

The complete stack for AI Engineers: framework, runtime and control plane.

Python 36,804 4,870 Updated Jan 12, 2026

πŸ¦œπŸ”— The platform for reliable agents.

Python 123,978 20,427 Updated Jan 12, 2026

showing various ways to serve Keras based stable diffusion

Jupyter Notebook 111 5 Updated Feb 28, 2023

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,759 807 Updated Dec 8, 2022

Scrapes the Robinhood API to retrieve + store popularity and price data.

JavaScript 688 195 Updated Oct 31, 2024

Models and examples built with TensorFlow

Python 77,694 45,342 Updated Jan 10, 2026

Learning to Rank in TensorFlow

Python 2,781 479 Updated Mar 18, 2024

ClickModels is a small set of Python scripts for the user click models initially developed at Yandex. A Click Model is a probabilistic graphical model used to predict search engine click data from …

Python 239 71 Updated Jun 6, 2018

A PyTorch implementation of Paragraph Vectors (doc2vec)

Python 1 Updated Sep 20, 2017

A tensorflow implementation of Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

Python 188 85 Updated Aug 4, 2016