the performance metrics of the code LLM

![Image](https://github.com/user-attachments/assets/d2e31731-d7bb-4e83-9f1f-7b0e92d181f9)
Hi authors, the performance metrics of the code LLM are not explicitly mentioned in the paper, is the score?And how is it calculated?