Articles

A practitioner's reference for AI agent architecture and engineering patterns.

Advanced

Dynamic Filtering for Web Search Agents

How agents use code execution to filter retrieved web content before it enters the context window, improving accuracy and reducing token costs.

web searchtool usecontext engineering

Feb 2026

Programmatic Tool Calling

How agents can execute tool calls inside a sandboxed code environment to reduce round-trip latency and token overhead in multi-step workflows.

tool-usecode-executionlatency

Feb 2026

Agent-Assisted Fine-Tuning

How coding agents automate the entire LLM fine-tuning workflow from GPU selection to model deployment using natural language instructions.

fine-tuningtrainingllm

Feb 2026

Agentic RAG

Beyond simple retrieve-then-generate: intelligent agents that decide when, what, and how to retrieve, then critique and correct their own retrieval.

ragretrievalself-rag

Feb 2026

Learning & Adaptation

How AI agents improve over time without retraining: token-space learning from successful trajectories, Reflexion self-critique, and self-evolving architectures.

learningreflexionself-improvement

Feb 2026

Multi-Agent Orchestration

Patterns and frameworks for coordinating multiple specialized AI agents including supervisor, peer-to-peer, debate, and mixture of experts.

multi-agentorchestrationcoordination

Feb 2026

Skills Pattern

A filesystem-based approach to tool management that achieves 98% token savings by loading tool definitions on-demand rather than sending all tools on every request.

skillstool-managementtoken-savings

Feb 2026

The Harness Is the Product: How LangChain Gained 13.7 Points on a Coding Benchmark Without Changing the Model

*LangChain's coding agent vaulted from outside the Top 30 to the Top 5 on Terminal Bench 2.0 by engineering the scaffolding, not the AI.*

harness-engineeringcontext-engineeringcoding-agents

Feb 2026

Context Layer

Context Bloat & Context Rot

How performance degrades within supported context limits, and practical strategies to detect, measure, and mitigate both failure modes.

context-rotdegradationperformance

Feb 2026

Context Engineering

The discipline of optimizing what enters the context window — a key skill for practitioners building reliable agents alongside prompt engineering.

context-windowpromptingoptimization

Feb 2026

Prompt Caching / KV Cache

Reduce inference costs by 90% and time-to-first-token by 80% by reusing computed attention states across requests with identical prefixes.

cachingkv-cachelatency

Feb 2026

Evaluation

Context-Bench: Benchmarking Agentic Context Engineering

A look at Context-Bench, Letta's benchmark for measuring how well language models perform context engineering tasks including filesystem traversal and dynamic skill loading.

evaluationbenchmarkingcontext engineering

Feb 2026

Evaluation & Metrics

Measuring agent performance across component accuracy, task completion, trajectory quality, and system-level metrics with benchmarks and LLM-as-judge.

evaluationmetricsbenchmarks

Feb 2026

Foundational

Agent Memory Systems

How agents maintain context, learn from past interactions, and build persistent knowledge across sessions using layered memory architectures.

memorycontextembeddings

Feb 2026

The ReAct Pattern

Reasoning plus Acting — the foundational loop that enables AI agents to think through problems and take targeted action in the world.

reactreasoningagentic-loop

Feb 2026

Tool Use & Function Calling

The bridge between language models and real-world actions, enabling agents to query APIs, execute code, and interact with external systems.

tool-usefunction-callingllm

Feb 2026

Protocols

Agent2Agent Protocol (A2A)

Google's open protocol enabling AI agents to discover, communicate, and collaborate across organizational boundaries using standardized task exchange.

a2aagent-communicationgoogle

Feb 2026

MCP Apps

An official MCP extension enabling tools to return interactive UI components — dashboards, forms, and visualizations — that render directly in conversations.

mcpuiinteractive

Feb 2026

Model Context Protocol (MCP)

An open standard from Anthropic that defines how AI agents connect to external tools, data sources, and services through a composable server architecture.

mcpprotocolanthropic

Feb 2026

Universal Commerce Protocol (UCP)

An open industry protocol enabling AI agents to shop across any participating merchant using unified APIs for checkout, identity linking, and order management.

commerceprotocolcheckout

Feb 2026

Safety

Safety & Guardrails

Defense in depth for AI agents: input validation, output filtering, tool sandboxing, guardian agents, and OWASP LLM security risks.

safetyguardrailsowasp

Feb 2026