Commit Graph

30 Commits

Author SHA1 Message Date
jyong
a6209a27db Merge remote-tracking branch 'origin/feat/evaluation' into feat/evaluation 2026-03-17 18:36:11 +08:00
jyong
6d836e798b evaluation runtime 2026-03-17 18:35:58 +08:00
FFXN
e6e668d1d9 feat: Adapt EvaluationMetricName. 2026-03-17 16:08:57 +08:00
jyong
f692def738 evaluation runtime 2026-03-17 15:26:39 +08:00
jyong
f81bcf53e3 evaluation runtime 2026-03-16 18:08:46 +08:00
jyong
f60084fc43 Merge remote-tracking branch 'origin/feat/evaluation' into feat/evaluation 2026-03-13 16:55:19 +08:00
jyong
2ed0805c13 evaluation runtime 2026-03-13 16:54:23 +08:00
FFXN
c51f3219aa Merge remote-tracking branch 'origin/feat/evaluation' into feat/evaluation 2026-03-13 10:11:14 +08:00
FFXN
c68194093e feat: Parse the expression to get the input parameters for the evaluation workflow. 2026-03-13 10:09:38 +08:00
FFXN
18198b88ff feat: Parse the expression to get the input parameters for the evaluation workflow. 2026-03-13 09:45:13 +08:00
jyong
c0fac68f2d evaluation runtime 2026-03-12 17:21:57 +08:00
jyong
08c5200aa1 evaluation runtime 2026-03-12 17:21:46 +08:00
jyong
4555c98d30 evaluation runtime 2026-03-12 16:24:39 +08:00
jyong
1d248053e6 evaluation runtime 2026-03-12 14:32:36 +08:00
jyong
8ea3729fe9 evaluation runtime 2026-03-11 19:57:46 +08:00
jyong
a83a28bf70 evaluation runtime 2026-03-11 17:31:11 +08:00
jyong
2bd48e62a3 evaluation runtime 2026-03-10 17:37:28 +08:00
jyong
7a065b3f42 evaluation runtime 2026-03-10 17:37:20 +08:00
jyong
dabad46393 evaluation runtime 2026-03-09 15:56:03 +08:00
jyong
2b3f5adfab Merge remote-tracking branch 'origin/feat/evaluation' into feat/evaluation 2026-03-09 15:17:50 +08:00
jyong
2ffd7e519f evaluation runtime 2026-03-09 15:17:35 +08:00
FFXN
9340ee8af4 feat: Implement snippet_generate_service.py. 2026-03-06 14:28:08 +08:00
FFXN
b160dce4db feat: Implement customized evaluation in BaseEvaluationInstance. 2026-03-05 14:30:39 +08:00
FFXN
7149af3dac Merge remote-tracking branch 'origin/feat/evaluation' into feat/evaluation 2026-03-05 13:38:35 +08:00
FFXN
99d3c645b8 feat: Implement customized evaluation in BaseEvaluationInstance. 2026-03-05 13:36:05 +08:00
FFXN
ce0c2ea3bd feat: Implement customized evaluation in BaseEvaluationInstance. 2026-03-05 13:30:26 +08:00
jyong
13c0d6eddb evaluation runtime 2026-03-04 19:20:08 +08:00
jyong
4e593df662 evaluation runtime 2026-03-04 18:43:58 +08:00
FFXN
7251bffae1 feat: implement customized evaluation with workflow, and add judgment condition after evaluate_metrics. 2026-03-04 14:46:24 +08:00
jyong
a3cf1a18a3 evaluation runtime 2026-03-03 16:01:13 +08:00