Test Time Compute in Agents

Tree search, self-reflection, retry budgets, and allocating compute across agent steps.