This repository contains the code for the website and leaderboard of the SEC-bench project.
To learn more about SEC-bench, please check out the main code repository along with the main paper, SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks.
git clone https://github.com/SEC-bench/sec-bench.github.io
cd sec-bench.github.io
pnpm install
pnpm devWe express our deepest gratitude to the creators of the Multi-SWE-bench and SWE-bench datasets. This repository is a modified version of their original website repository.
If you found SEC-bench helpful for your work, please cite as follows:
@article{lee2025sec,
title={SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks},
author={Lee, Hwiwon and Zhang, Ziqi and Lu, Hanxiao and Zhang, Lingming},
journal={arXiv preprint arXiv:2506.11791},
year={2025}
}
This project is licensed under Apache License 2.0. See the LICENSE flie for details.