Update README.md
Browse files
README.md
CHANGED
|
@@ -225,6 +225,7 @@ Below we report the evaluation results for K2-V2 after supervised fine-tuning (S
|
|
| 225 |
| **MBPP** | 71.0 | 75.8 | 84.8 | 87.6 | 91.6 | 82.8 | 83.8 | 96.2 | 80.0 |
|
| 226 |
| **HumanEval** | 82.3 | 91.5 | 91.5 | 96.3 | 96.3 | 97.6 | 89.6 | 94.5 | 85.4 |
|
| 227 |
| **LCBv6** | 39.9 | 51.3 | 67.0 | 67.9 | 67.6 | 67.8 | 79.2 | 72.8 | 36.7 |
|
|
|
|
| 228 |
|
| 229 |
Please refer to our [Tech Report](https://www.llm360.ai/reports/K2_V2_report.pdf) for detailed evaluation results.
|
| 230 |
|
|
|
|
| 225 |
| **MBPP** | 71.0 | 75.8 | 84.8 | 87.6 | 91.6 | 82.8 | 83.8 | 96.2 | 80.0 |
|
| 226 |
| **HumanEval** | 82.3 | 91.5 | 91.5 | 96.3 | 96.3 | 97.6 | 89.6 | 94.5 | 85.4 |
|
| 227 |
| **LCBv6** | 39.9 | 51.3 | 67.0 | 67.9 | 67.6 | 67.8 | 79.2 | 72.8 | 36.7 |
|
| 228 |
+
| **IFEVAL** | 73.2 | 82.7 | 89.6 | 80.1 | 88.7 | 88.7 | 89.6 | 88.7 | 85.7 |
|
| 229 |
|
| 230 |
Please refer to our [Tech Report](https://www.llm360.ai/reports/K2_V2_report.pdf) for detailed evaluation results.
|
| 231 |
|