Appendix A — Prerequisites and Further Reading

This appendix is intentionally outward-pointing. The core book teaches how to make an agentic system observable, evaluable, policy-bound, and shippable. It deliberately does not survey the full taxonomy of agent techniques, and it does not tutorialize any particular framework. Those topics are real, but they are prerequisites or companions, not the book’s subject.

Two practical reasons keep them out of the core chapters. First, the book’s value is a narrow thesis: deterministic baseline, evidence, policy, gates. Folding in technique surveys and framework walkthroughs would blur that thesis. Second, technique and framework material ages at a different rate than the engineering discipline here. Frameworks consolidate and rename on short cycles, so this appendix favors primary sources and names them as moving targets. When a link here disagrees with a vendor’s current documentation, trust the vendor.

Assumed Reader Baseline

The README states the reader is highly technical. Made explicit, the book assumes you can already:

read and write Python and reason about software tradeoffs;
use tests, version control, and basic observability;
make a single LLM call and a tool/function call, and read the response.

If any of those is shaky, cover it before Chapter 1. The chapters below assume the LLM and tool-calling baseline in particular.

Track A: Agent Techniques (Concepts and Papers)

Read in this order. One line each on what it gives you and which chapter it back-fills.

Anthropic, Building Effective Agents (Anthropic 2024). The best single on-ramp, and aligned with this book: workflows before agents, simplest thing that works. Back-fills Chapters 1–2.
Weng, LLM Powered Autonomous Agents (Weng 2023). The canonical overview tying planning, memory, and tool use into one mental model. Read it as a map.
ReAct (Yao et al. 2022). Interleaved reasoning and acting; the ancestor of most agent loops. Back-fills the runtime in Chapter 5.
Chain-of-Thought (Wei et al. 2022). The reasoning primitive the later patterns build on.
Tree of Thoughts (Yao et al. 2023). Deliberate search over reasoning steps.
Reflexion (Shinn et al. 2023). Self-critique plus memory; the “agent improves itself” pattern.
RAG (Lewis et al. 2020). Retrieval-augmented generation, the concept the book’s context chapters assume you already understand. Back-fills Chapters 4 and 11.
MemGPT (Packer et al. 2023). How a persistent memory architecture actually works. Fills the long-term memory gap the book deliberately leaves open in Chapter 4.

Track B: Framework Grounding

Go deep on one framework, then know what the others are for. Do not survey all of them. The goal is to pattern-match this book’s abstractions onto code you will actually write.

LangGraph (LangChain 2025). The recommended first framework here because it maps most directly onto the book’s worldview: explicit graph, durable state, traceable nodes, inspectable control flow. Learning it makes the trace/eval/gate abstractions concrete.
Claude Agent SDK (Anthropic 2025a). Strong for Anthropic-native production agents with native tool use and memory.
OpenAI Agents SDK (OpenAI 2025a). Useful for OpenAI-native agent apps when your application owns orchestration, tool execution, approvals, state, tracing, and eval loops.
LlamaIndex (LlamaIndex 2025). Learn the retrieval slice when you hit the RAG gap; strongest when the agent’s main job is reasoning over your indexed data.
Microsoft Agent Framework (Microsoft 2026b). Microsoft positions it as the successor/consolidation of AutoGen and Semantic Kernel; relevant mainly on .NET or enterprise stacks. AutoGen is now in maintenance mode (Microsoft 2026a), so learn the successor rather than the predecessor.
Hugging Face Agents Course (Hugging Face 2025). Guided hands-on if you prefer a course to docs-spelunking.

Model-Specific Behavior

The book stays single-model and deterministic by design, so it does not compare how different models behave as agents. To feel those differences, read the two major providers’ tool-use guides side by side and run the same small task against each:

Anthropic tool use (Anthropic 2025b);
OpenAI tools (OpenAI 2025b).

Instruction-following, tool-call reliability, and context utilization differ enough between models that an agent design tuned for one can underperform on another. Treat that variance as a system property, not a model defect.

One-Paragraph Reading Path

Read Anthropic’s Building Effective Agents, then Weng’s overview, then skim ReAct, Chain-of-Thought, Tree of Thoughts, and Reflexion. Add the RAG paper and MemGPT for context and memory. Build one small thing in LangGraph. Read the Anthropic and OpenAI tool-use docs side by side. Then read this book to learn how to make any of it observable, evaluated, and shippable.

Takeaways

These topics are prerequisites and companions, not gaps in the core thesis.
Learn agent techniques from primary papers and one strong overview, in order.
Go deep on one framework, and prefer current vendor docs over any snapshot.
Treat cross-model behavior as a system property you must measure.

Anthropic. 2024. Building Effective Agents. https://www.anthropic.com/engineering/building-effective-agents.

Anthropic. 2025a. Claude Agent SDK. https://code.claude.com/docs/en/agent-sdk/overview.

Anthropic. 2025b. Tool Use with Claude. https://platform.claude.com/docs/en/agents-and-tools/tool-use/overview.

Hugging Face. 2025. Agents Course. https://huggingface.co/learn/agents-course/en.

LangChain. 2025. LangGraph Documentation. https://docs.langchain.com/oss/python/langgraph.

Lewis, Patrick, Ethan Perez, Aleksandra Piktus, et al. 2020. “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.” arXiv Preprint arXiv:2005.11401. https://arxiv.org/abs/2005.11401.

LlamaIndex. 2025. LlamaIndex Documentation. https://developers.llamaindex.ai/python/framework/.

Microsoft. 2026a. AutoGen. https://github.com/microsoft/autogen.

Microsoft. 2026b. Microsoft Agent Framework Overview. https://learn.microsoft.com/en-us/agent-framework/overview/.

OpenAI. 2025a. Agents SDK. https://developers.openai.com/api/docs/guides/agents.

OpenAI. 2025b. Using Tools. https://developers.openai.com/api/docs/guides/tools.

Packer, Charles, Sarah Wooders, Kevin Lin, et al. 2023. “MemGPT: Towards LLMs as Operating Systems.” arXiv Preprint arXiv:2310.08560. https://arxiv.org/abs/2310.08560.

Shinn, Noah, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, and Shunyu Yao. 2023. “Reflexion: Language Agents with Verbal Reinforcement Learning.” arXiv Preprint arXiv:2303.11366. https://arxiv.org/abs/2303.11366.

Wei, Jason, Xuezhi Wang, Dale Schuurmans, et al. 2022. “Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.” arXiv Preprint arXiv:2201.11903. https://arxiv.org/abs/2201.11903.

Weng, Lilian. 2023. LLM Powered Autonomous Agents. Blog post. https://lilianweng.github.io/posts/2023-06-23-agent/.

Yao, Shunyu, Dian Yu, Jeffrey Zhao, et al. 2023. “Tree of Thoughts: Deliberate Problem Solving with Large Language Models.” arXiv Preprint arXiv:2305.10601. https://arxiv.org/abs/2305.10601.

Yao, Shunyu, Jeffrey Zhao, Dian Yu, et al. 2022. “ReAct: Synergizing Reasoning and Acting in Language Models.” arXiv Preprint arXiv:2210.03629. https://arxiv.org/abs/2210.03629.