HarshTrivedi

Harsh Trivedi HarshTrivedi

🤖 Building AI agents & interactive environments. E.g., 🌍 AppWorld (https://appworld.dev)

132 followers · 36 following

Research Scientist @ Allen Institute for AI
United States
http://harshtrivedi.me/
@harsh3vedi
in/harshjtrivedi

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

tomerwolgithub / monaco

https://huggingface.co/datasets/allenai/MoNaCo_Benchmark

Python 6 Updated Aug 21, 2025

snap-stanford / relbench

RelBench: Relational Deep Learning Benchmark

Python 300 71 Updated Oct 24, 2025

pyg-team / pytorch-frame

Tabular Deep Learning Library for PyTorch

Python 713 68 Updated Oct 27, 2025

astral-sh / ty

An extremely fast Python type checker and language server, written in Rust.

Python 13,087 131 Updated Oct 27, 2025

laude-institute / sandboxes

Python 24 15 Updated Oct 27, 2025

PrimeIntellect-ai / verifiers

Environments for LLM Reinforcement Learning

Python 3,381 402 Updated Oct 26, 2025

gepa-ai / gepa

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,399 99 Updated Oct 26, 2025

Shangyint / langProBe

Python 25 6 Updated Jun 12, 2025

princeton-pli / hal-harness

Python 172 32 Updated Oct 22, 2025

pdasigi / mcp-playground

Jinja 3 Updated Aug 27, 2025

awslabs / unified-text2sql-benchmark

UNITE: A Unified Benchmark for Text-to-SQL Evaluation

Python 82 2 Updated May 30, 2025

laude-institute / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 969 347 Updated Oct 27, 2025

HarshTrivedi / packing-unpacking-pytorch-minimal-tutorial

Minimal tutorial on packing and unpacking sequences in pytorch

Python 209 17 Updated Feb 9, 2019

StonyBrookNLP / multee

Repository for Repurposing Entailment for Multi-Hop Question Answering Tasks, NAACL19

Python 29 7 Updated May 4, 2020

StonyBrookNLP / teabreac

Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22

Python 19 Updated Jun 23, 2023

StonyBrookNLP / ircot

Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23

Jsonnet 236 32 Updated Jun 12, 2024

StonyBrookNLP / musique

Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022

Python 173 18 Updated Jun 12, 2024

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 71,028 2,160 Updated Oct 27, 2025

StonyBrookNLP / appworld-leaderboard

🌍 Leaderboard Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024

Python 6 4 Updated Oct 15, 2025

nektos / act

Run your GitHub Actions locally 🚀

Go 66,577 1,760 Updated Oct 1, 2025

sotopia-lab / awesome-social-agents

A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.

TypeScript 98 23 Updated Jun 3, 2024

sotopia-lab / sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 256 38 Updated Oct 6, 2025

camel-ai / oasis

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.

Python 2,012 222 Updated Oct 16, 2025

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 7,609 830 Updated Oct 14, 2025

sveltejs / svelte

web development for the rest of us

JavaScript 84,495 4,662 Updated Oct 26, 2025

koaning / uvtrick

A fun party trick to run Python code from another venv into this one.

Python 205 7 Updated Mar 15, 2025

Paitesanshi / LLM-Agent-Survey

2,856 156 Updated Feb 20, 2025

urschrei / pyzotero

Pyzotero: a Python client for the Zotero API

Python 1,117 121 Updated Oct 26, 2025

samkhur006 / awesome-llm-planning-reasoning

A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning materials.

303 16 Updated Feb 28, 2025

StonyBrookNLP / appworld

🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource Paper.

Python 295 39 Updated Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harsh Trivedi HarshTrivedi

Achievements

Achievements

Highlights

Organizations

Block or report HarshTrivedi

Stars

tomerwolgithub / monaco

snap-stanford / relbench

pyg-team / pytorch-frame

astral-sh / ty

laude-institute / sandboxes

PrimeIntellect-ai / verifiers

gepa-ai / gepa

Shangyint / langProBe

princeton-pli / hal-harness

pdasigi / mcp-playground

awslabs / unified-text2sql-benchmark

laude-institute / terminal-bench

HarshTrivedi / packing-unpacking-pytorch-minimal-tutorial

StonyBrookNLP / multee

StonyBrookNLP / teabreac

StonyBrookNLP / ircot

StonyBrookNLP / musique

astral-sh / uv

StonyBrookNLP / appworld-leaderboard

nektos / act

sotopia-lab / awesome-social-agents

sotopia-lab / sotopia

camel-ai / oasis

simular-ai / Agent-S

sveltejs / svelte

koaning / uvtrick

Paitesanshi / LLM-Agent-Survey

urschrei / pyzotero

samkhur006 / awesome-llm-planning-reasoning

StonyBrookNLP / appworld