-
Research Scientist @ Allen Institute for AI
- United States
- http://harshtrivedi.me/
- @harsh3vedi
- in/harshjtrivedi
Highlights
- Pro
Stars
https://huggingface.co/datasets/allenai/MoNaCo_Benchmark
RelBench: Relational Deep Learning Benchmark
Tabular Deep Learning Library for PyTorch
An extremely fast Python type checker and language server, written in Rust.
Environments for LLM Reinforcement Learning
Optimize prompts, code, and more with AI-powered Reflective Text Evolution
UNITE: A Unified Benchmark for Text-to-SQL Evaluation
A benchmark for LLMs on complicated tasks in the terminal
Minimal tutorial on packing and unpacking sequences in pytorch
Repository for Repurposing Entailment for Multi-Hop Question Answering Tasks, NAACL19
Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22
Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23
Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022
An extremely fast Python package and project manager, written in Rust.
π Leaderboard Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024
A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
ποΈ OASIS: Open Agent Social Interaction Simulations with One Million Agents.
Agent S: an open agentic framework that uses computers like a human
A fun party trick to run Python code from another venv into this one.
Pyzotero: a Python client for the Zotero API
A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning materials.
π AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource Paper.