Skip to content

Conversation

@Viktor286
Copy link

@Viktor286 Viktor286 commented Jul 27, 2025

This addition proposes to start adding evaluation tests (evals) to be part of the context of agent app.

General concept:

The main idea of the PR is that evals convey context specific info that corresponds to the alignment of the agent.

Referencing the original 12-factor app, testing is still implicitly supported and encouraged through several of the factors:

Factor 10: Dev/Prod Parity
This encourages keeping development, staging, and production environments as similar as possible. It implies that automated tests should run in environments that closely resemble production.

Factor 5: Build, Release, Run
Since the build phase includes compiling code and running tests, a solid CI/CD pipeline that enforces testing fits naturally here.

Factor 12: Admin Processes
You could technically run test scripts as one-off admin processes.

@CLAassistant
Copy link

CLAassistant commented Jul 27, 2025

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants