SeedBench is a benchmark suite for the seed/agriculture industry's LLM (Large Language Model) evaluation. It is designed to test the model's performance at two stages: pretraining and SFT (Supervised Fine-Tuning).
-
Notifications
You must be signed in to change notification settings - Fork 3
[ACL 2025 main] SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Scienceš¾
License
open-sciencelab/SeedBench
Ā
Ā
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Ā | Ā | |||
Ā | Ā | |||
Ā | Ā | |||
Ā | Ā | |||
Repository files navigation
About
[ACL 2025 main] SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Scienceš¾