Keep this to three required sources. The optional refresher is only for resetting API mechanics before the drill.
Required
A direct engineering treatment of context engineering as the next step beyond prompt engineering, with practical framing for just-in-time context, memory, selection, compression, and isolation.
Read for: the vocabulary and architecture tradeoffs behind reliable agent context assembly.
Required
The current OpenAI guide to reducing context size in long-running interactions while carrying forward the state needed for future turns.
Read for: compression policy, state continuity, context thresholds, and latency control in extended workflows.
Required
A detailed source on cacheable prompt prefixes, cache breakpoints, multi-turn caching, TTL choices, invalidation, and how prompt structure affects cost.
Read for: cache-aware prompt layout and the operational cost model of large static context.
Optional refresher
A short reset on persistent conversation state, previous response chaining, and when the platform can carry context instead of your app resending the full transcript.
Use only if: stateful conversation mechanics feel fuzzy before the drill.