Stars
A general purpose scientific writer
A program analysis, verification, and optimization framework
MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 60.2% Avg@8 score on the challenging GAIA benchmark.
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
ZJU-PL / RepoAudit
Forked from PurCL/RepoAuditAn autonomous LLM-agent for large-scale, repository-level code auditing
ConcoLLMic: the first language- and theory-agonistic concolic execution engine via LLM agents
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Bear is a tool that generates a compilation database for clang tooling.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
An app that brings language models directly to your phone.
The book "Performance Analysis and Tuning on Modern CPU"
Minimal reproduction of DeepSeek R1-Zero
Math & CS awesome List, distinguished by proof and logic technique
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
A curated list of resources for using LLMs to develop more competitive grant applications.
An Ethereum Dynamic Analyzer, a.k.a, open-sourced transaction explorer similar to Phalcon/EthTx/TxTracer
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
Research Artifact of ISSTA 2024 Paper: See the Forest, not Trees: Unveiling and Escaping the Pitfalls of Error-Triggering Inputs in Neural Network Testing
Code for ACL 2024 paper: A Critical Study of What Code-LLMs (Do not) Learn
wtf is a distributed, code-coverage guided, customizable, cross-platform snapshot-based fuzzer designed for attacking user and / or kernel-mode targets running on Microsoft Windows and Linux user-m…
⚙️ A curated list of static analysis (SAST) tools and linters for all programming languages, config files, build tools, and more. The focus is on tools which improve code quality.
Large Language Model guided Protocol Fuzzing (NDSS'24)