Core Concepts

Agents
Understand Agent CI's core philosophy: agents are software applications, not ML models. Learn about multi-prompt architecture, Git-native versioning, and production deployment strategies.
Evaluations (Evals)
Automated tests that validate agent behavior and performance. Learn about accuracy, performance, safety, consistency, and LLM-as-judge evaluation types for comprehensive agent testing.
Continuous Integration (CI/CD)
Apply software engineering best practices to agent development. Learn how Agent CI automates testing, validation, and deployment through Git-based CI/CD pipelines.