Are you sure you want to reset the application? This will delete all test runs and results. This action cannot be undone.
[[Math.round(selectedRun.summary.success_rate * 100)]]%
[[selectedRun.summary.total_tests]]
[[selectedRun.summary.successful_tests]]
[[formatCompactRuntime(selectedRun.summary.runtime_seconds)]]
[[selectedRun.summary.score_average ? selectedRun.summary.score_average.toFixed(2) : 'N/A']]
| Test Name | Category | Status | Expected | Actual |
|---|---|---|---|---|
| [[result.test_name || 'Unnamed Test']] | [[result.category || 'No Category']] | [[ result.success ? 'Pass' : 'Fail' ]] | [[result.validation.expected]] [[result.validation.expected.join(', ')]] [[JSON.stringify(result.validation.expected)]] No validation data | [[result.output || 'No output']] |
[[result.category]]
[[selectedRun.results[selectedTestIndex].input]]
Size: [[ model.model_info.model_metadata.size ]]
Quantization: [[ model.model_info.model_metadata.quantization ]]
Standard mode for testing exact matches or content validation
Agent mode for evaluating support ticket resolution workflows and scoring
Adjust response randomness (0 = deterministic, 1 = creative)
Upload one or multiple JSON test suite files
[[ file.testCount ]] test(s)
System prompt: [[ file.systemPrompt.length > 100 ? file.systemPrompt.substring(0, 100) + '...' : file.systemPrompt ]]
No test cases yet. Click "Add Test Case" to create one.
Adjust response randomness (0 = deterministic, 1 = creative)