Core Concepts

  • Agents
    Understand Agent CI's core philosophy: agents are software applications, not ML models. Learn about multi-prompt architecture, Git-native versioning, and production deployment strategies.

  • Evaluations (Evals)
    Automated tests that validate agent behavior and performance. Learn about accuracy, performance, safety, consistency, and LLM-as-judge evaluation types for comprehensive agent testing.

  • Continuous Integration (CI/CD)
    Apply software engineering best practices to agent development. Learn how Agent CI automates testing, validation, and deployment through Git-based CI/CD pipelines.