Source from repo

Microsoft Foundry Skill

Build and deploy AI applications on Azure AI Foundry using Microsoft's model catalog and AI services

microsoftGitHub microsoftOfficialSource repo Original GitHub link Publisher page

Files

155

Skill

n/a

Size

976.3 KB

Entrypoint

SKILL.md

Format

git-repo

Open file

foundry-agent/agent-optimizer/references/eval-yaml.md

Syntax-highlighted preview of this file as included in the skill package.

Rendered Source

markdown92 linesFree

foundry-agent/agent-optimizer/references/eval-yaml.md

1# eval.yaml Guidance
2 
3Create `eval.yaml` directly when the conversation or `.foundry/agent-metadata*.yaml` already selected the dataset/evaluators. Otherwise ask whether to run `azd ai agent eval generate` or let optimize use built-in defaults.
4 
5## Include
6 
7```yaml
8name: <suite-or-optimization-name>
9agent:
10  name: <agent-name>
11  kind: hosted
12  version: "<agent-version>"
13  model: <baseline-model-deployment-name>
14  config: .agent_configs/baseline/metadata.yaml
15dataset:
16  local_uri: <path-to-jsonl>
17  # name: <foundry-dataset-name>
18  # version: "<dataset-version>"
19# validation_dataset:
20#   name: <validation-dataset-name>
21#   version: "<validation-version>"
22evaluators:
23  - <evaluator-name>
24  - name: <custom-evaluator-name>
25    version: "<evaluator-version>"
26    local_uri: <local-evaluator-json>
27options:
28  eval_model: <existing-chat-model-deployment-name>
29  optimization_model: <allowed-optimizer-model-deployment-name>
30  max_candidates: 4
31  optimization_config:
32    model_search_space:
33      - <target-model-deployment-name>
34```
35 
36Use existing model deployments for `agent.model` and `options.eval_model`; do not assume `gpt-4o`.
37 
38For `options.optimization_model`, first verify that the target Foundry project has a deployment whose name is in this allowlist:
39 
40- `GPT-5`
41- `GPT-5.1`
42- `GPT-5.2`
43- `GPT-5.4`
44- `GPT-5.5`
45- `DeepSeek-V4-Pro`
46- `DeepSeek-V-3.2`
47 
48If none exist, ask the user to deploy one before configuring optimization. Use `options.optimization_config.model_search_space` only for target model candidates that exist in the project; it may include the baseline model when the user wants it compared.
49 
50## Generate evals when inputs are missing
51 
52Prefer `eval generate` over older init flows:
53 
54```bash
55azd ai agent eval generate --dataset <path-to-jsonl>
56azd ai agent eval generate --reset-defaults
57```
58 
59After generation, run `azd ai agent optimize --optimize-model <allowed-optimizer-model-deployment-name>` from the azd project; optimize auto-detects the generated `eval.yaml`.
60 
61## Skip
62 
63Do not add these fields unless the user explicitly asks and understands the tradeoff:
64 
65- `target_attributes`
66- `budget`
67- `min_improvement`
68- `pass_threshold`
69- `keep_versions`
70- `generation_instruction`
71- `max_samples`
72- `trace_days`
73- legacy `dataset_file`, `dataset_reference`, or `validation_reference` when writing a new file
74 
75Keep `target_attributes` omitted so azd can auto-detect optimizable attributes.
76 
77## Source mapping
78 
79| Source | eval.yaml field |
80|--------|-----------------|
81| effective azd context | `agent.name`, `agent.version`, `agent.kind` |
82| baseline config | `agent.model`, `agent.config` |
83| selected local dataset JSONL | `dataset.local_uri` |
84| selected remote/local dataset | `dataset.name`, `dataset.version`, `dataset.local_uri` |
85| selected validation dataset | `validation_dataset` |
86| selected Foundry/local evaluators | `evaluators[]` |
87| selected judge/eval deployment | `options.eval_model` |
88| selected optimizer deployment | `options.optimization_model` |
89| selected target model candidates | `options.optimization_config.model_search_space` |
90 
91Treat older `dataset_file`, `dataset_reference`, `validation_reference`, `max_iterations`, and `optimization_config.model` as legacy inputs when reading existing files, but write new files with the current contract above.
92

Preparing the source view

Microsoft Foundry Skill

foundry-agent/agent-optimizer/references/eval-yaml.md