<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Agent design on core-agent</title><link>https://go-steer.github.io/core-agent/docs/agent-design/</link><description>Recent content in Agent design on core-agent</description><generator>Hugo</generator><language>en-us</language><atom:link href="https://go-steer.github.io/core-agent/docs/agent-design/index.xml" rel="self" type="application/rss+xml"/><item><title>System instructions</title><link>https://go-steer.github.io/core-agent/docs/agent-design/system-instructions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://go-steer.github.io/core-agent/docs/agent-design/system-instructions/</guid><description>&lt;p&gt;&lt;code&gt;AGENTS.md&lt;/code&gt; is the most-impactful customization surface in &lt;code&gt;core-agent&lt;/code&gt;. The model sees it on every turn. Get it right and the agent behaves consistently across operators and across sessions; get it wrong and the agent acts unpredictably regardless of what skills or tools you wire up.&lt;/p&gt;
&lt;p&gt;This page covers the patterns that work, the failure modes to watch for, and how to iterate on an &lt;code&gt;AGENTS.md&lt;/code&gt; instead of writing it once and hoping.&lt;/p&gt;</description></item><item><title>Skills</title><link>https://go-steer.github.io/core-agent/docs/agent-design/skills/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://go-steer.github.io/core-agent/docs/agent-design/skills/</guid><description>&lt;p&gt;Skills are named, reusable procedures the agent invokes when the task matches the skill&amp;rsquo;s description. This page covers the design patterns: when to write a skill vs. an &lt;code&gt;AGENTS.md&lt;/code&gt; rule, how to write a description that actually triggers, what belongs in the body vs. a &lt;code&gt;references/&lt;/code&gt; file, and how to test that a skill fires when you expect it to.&lt;/p&gt;
&lt;p&gt;For the schema and discovery details (file locations, YAML frontmatter, allow/deny lists), see &lt;a href="https://go-steer.github.io/core-agent/docs/reference/skills/"&gt;Reference → Skills&lt;/a&gt;. This page is the prescriptive companion.&lt;/p&gt;</description></item><item><title>Subagents and wrappers</title><link>https://go-steer.github.io/core-agent/docs/agent-design/subagents-and-wrappers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://go-steer.github.io/core-agent/docs/agent-design/subagents-and-wrappers/</guid><description>&lt;p&gt;Two ways to push work off the parent agent:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Agentic tool wrappers&lt;/strong&gt; (&lt;code&gt;agentic_read_file&lt;/code&gt;, &lt;code&gt;agentic_grep&lt;/code&gt;, &lt;code&gt;agentic_research&lt;/code&gt;, &lt;code&gt;agentic_fetch_url&lt;/code&gt;) — synchronous, bounded, single-purpose. The parent calls them like any other tool; under the hood they spawn a focused subtask on a (typically cheaper) model and return only the digest. Raw tool output never enters the parent&amp;rsquo;s context.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Background subagents&lt;/strong&gt; (&lt;code&gt;spawn_agent&lt;/code&gt;, &lt;code&gt;list_agents&lt;/code&gt;, &lt;code&gt;check_agent&lt;/code&gt;, &lt;code&gt;stop_agent&lt;/code&gt;) — asynchronous, longer-running, multi-turn. The parent dispatches a goal; the subagent works in its own session until done; alerts and completion summaries land back in the parent&amp;rsquo;s chat.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This page covers when to use each, how to actually get the model to use them (the model-side adoption story is non-trivial), and the failure modes worth designing around.&lt;/p&gt;</description></item><item><title>Cost efficiency</title><link>https://go-steer.github.io/core-agent/docs/agent-design/cost-efficiency/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://go-steer.github.io/core-agent/docs/agent-design/cost-efficiency/</guid><description>&lt;p&gt;What actually moves the cost needle on &lt;code&gt;core-agent&lt;/code&gt; sessions, in rough order of impact. Built-in price tracking and per-model breakdowns make the tradeoffs measurable rather than guesswork; this page covers how to read those signals and the patterns that consistently reduce cost without sacrificing capability.&lt;/p&gt;
&lt;p&gt;For the mechanism details see &lt;a href="https://go-steer.github.io/core-agent/docs/reference/context-management/"&gt;Context management&lt;/a&gt; and &lt;a href="https://go-steer.github.io/core-agent/docs/reference/configuration/"&gt;Configuration → pricing&lt;/a&gt;.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="the-cost-hierarchy"&gt;The cost hierarchy&lt;/h2&gt;
&lt;p&gt;A rough sense of where dollars go on a typical coding session:&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Driver&lt;/th&gt;
 &lt;th&gt;Typical share of session cost&lt;/th&gt;
 &lt;th&gt;Lever&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;Input tokens (cumulative across turns)&lt;/td&gt;
 &lt;td&gt;70-90%&lt;/td&gt;
 &lt;td&gt;Compaction, checkpoints, agentic wrappers&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Output tokens&lt;/td&gt;
 &lt;td&gt;10-25%&lt;/td&gt;
 &lt;td&gt;Tighter &lt;code&gt;AGENTS.md&lt;/code&gt; (&amp;ldquo;be concise&amp;rdquo;), structured output formats&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Per-turn model rate&lt;/td&gt;
 &lt;td&gt;Multiplier on both&lt;/td&gt;
 &lt;td&gt;Model selection (Pro vs Flash for subtasks)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Subtask + compaction LLM calls&lt;/td&gt;
 &lt;td&gt;2-10%&lt;/td&gt;
 &lt;td&gt;Bounded by infrastructure, not operator-tunable&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;&lt;strong&gt;The input-token share is the killer.&lt;/strong&gt; Every turn re-sends the entire conversation history to the model. By turn 30, you&amp;rsquo;re paying input-token prices for everything that came before, plus the new turn. The single biggest lever on cost is preventing that input-token total from growing without bound — which is exactly what compaction, checkpoints, and agentic wrappers do.&lt;/p&gt;</description></item></channel></rss>