Agents4PLC

A Multi-Agent System for Programmable Logic Controller (PLC) Code Generation based on IEC-61131 standard. The paper see: Agents4PLC: Automating Closed-loop PLC Code Generation and Verification in Industrial Control Systems using LLM-based Agents.

Important Notice: Due to collaboration agreements with industrial partners, we are unable to release the full implementation of our multi-agent framework at this time. However, we provide comprehensive resources to reproduce our research, including benchmark datasets, evaluation methodologies, demos, formal verification tools, and reproduction code for key baselines. Researchers can rebuild the framework following the detailed methodology described in our paper.

Dataset

Dataset v2.0 – New Extended Release

The dataset accompanies a new extension of Agents4PLC with a newly extended benchmark dataset of 70 medium cases and 3 hard cases. If you need functionality code to execute the formal specs, see [https://github.com/Luoji-zju/Agents4PLC_release]

Purpose & Role of the Dataset

In Agents4PLC we established the first verifiable benchmark for automatically generating IEC-61131-3 Structured-Text (ST) code from natural-language requirements. The dataset serves three goals:

Provide formal specifications (nuXmv / PLCverif compatible) so that correctness can be proven automatically.
Supply human-checked reference ST code that compiles and satisfies the specs.
Enable fair comparison of more complex PLC-code generation tasks with advanced tasks.

What's New in Dataset v2.0

Item	v1.0 (arXiv)	v2.0 (this release)
#tasks	23	96 (23 + 73)
Task IDs	not identified	M1-M70, H1-H3
Granularity	easy / medium	medium / hard
Quality gates	basic manual review	manual review with PLC engineers + fine-tuned LLM evaluator
Annotations	spec + reference	+ difficulty tag

All 73 new tasks are tagged with new prefix to avoid collisions with v1.0.

** New in this release**: Added High-Fidelity Dataset with 21 industrial tasks collected from programming competitions (released 2024-2025, sourced from [https://github.com/cangkui/AutoPLC] with our added properties) specifically for efficiency and reliability analysis (RQ2). This dataset is designed to minimize pre-training contamination risks in LLM evaluations.

File Layout

benchmark_v1/
│   ├── easy.jsonl
│   └── medium.jsonl
benchmark_v2/
│   ├── medium.jsonl
│   └── hard.jsonl
│   └── high_fidelity.jsonl  # New dataset for efficiency analysis
├── readme.md

The dataset is composed of benchmark dataset and RAG dataset, where released benchmarks are collected from open-source Github programs and we manually convert instruction into formalized and plcverif compatiable properties.

Source Code Organization

benchmark_v1/ and benchmark_v2/ show the constructed tasks for plc code generation & verification task with patterns compatiable for plcverif verification.

result/ demonstrate the experiment statistics and detailed responses.

prompts/ contain specified prompts for each agent, including agent prompt engineering ways mentioned in our paper to enhance ability of agents.

LangChain/, MetaGPT/, LLM4PLC_reproduce/ or other folders corresponding to a certain multi agent framework's contain the implementation of the framework. You are free to create one for your own task, as long as the output satisfy the verification standards. (not available)

Notice: you should adjust content in config otherwise our framework could not work!

Documents

For tools related to st compilation and smv verification not provided in src, see src/README.md. For benchmark's construction process and how to use it, see benchmark/readme.md.

Key Updates in This Release:

✅ LLM4PLC reproduction code updated: Complete implementation of the LLM4PLC[https://github.com/AICPS/LLM_4_PLC] baseline with automation script, component interfaces, and file system interactions for fair comparison.
✅ New logs in simple-demo: Updated demonstration logs showing the full multi-agent workflow trajectory, including structured context handoff between agents and asynchronous integration with formal verification environments.
✅ High-Fidelity dataset in benchmark_v2: Added 21 complex industrial tasks specifically designed for efficiency and reliability analysis (RQ2) with minimal risk of pre-training contamination.

Recommended Agent Related Tools & Works

Despite that our detailed code cannot be released due to cooperation with company, the following propose some recommended types to adjust multi-agent system, enhance ability of certain agents and reproduce experiments in out paper:

Agent	Link
MetaGPT	https://github.com/geekan/MetaGPT
LangGraph	https://github.com/langchain-ai/langgraph
ChatDev	https://github.com/OpenBMB/ChatDev
MapCoder	MapCoder: Multi-Agent Code Generation for Competitive Problem Solving

Citation

If you want to use this dataset, please cite:

@article{liu2024agents4plc,
  title={Agents4plc: Automating closed-loop plc code generation and verification in industrial control systems using llm-based agents},
  author={Liu, Zihan and Zeng, Ruinan and Wang, Dongxia and Peng, Gengyun and Wang, Jingyi and Liu, Qiang and Liu, Peiyu and Wang, Wenhai},
  journal={arXiv preprint arXiv:2410.14209},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LLM4PLC_reproduce		LLM4PLC_reproduce
benchmark_v1		benchmark_v1
benchmark_v2		benchmark_v2
evaluate		evaluate
pics		pics
simple-demo		simple-demo
src		src
.gitignore		.gitignore
LICENSE		LICENSE
config_template.py		config_template.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agents4PLC

Dataset

Purpose & Role of the Dataset

What's New in Dataset v2.0

File Layout

Source Code Organization

Documents

Key Updates in This Release:

Recommended Agent Related Tools & Works

Citation

About

Uh oh!

Releases

Packages

Languages

License

Luoji-zju/Agents4PLC_release

Folders and files

Latest commit

History

Repository files navigation

Agents4PLC

Dataset

Purpose & Role of the Dataset

What's New in Dataset v2.0

File Layout

Source Code Organization

Documents

Key Updates in This Release:

Recommended Agent Related Tools & Works

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages