Methodology & Bibliography
How this glossary is sourced, what counts as a primary source, how freshness is managed, and the full 51-entry bibliography.
Data last refreshed: 2026-05-08. Next scheduled refresh: Q3 2026, or sooner on a vendor / regulator / paper trigger (see refresh cadence).
Sourcing rule
Every term in this glossary has a single canonical primary-source citation: a vendor or framework documentation page, a peer-reviewed or arXiv paper, an analyst publication where the analyst is the originator of the framing, or a standards-body specification. Wikipedia and secondary blogs are never a primary citation; if a concept only appears in such surfaces, the term does not yet warrant inclusion.
When more than one canonical source could be cited (e.g., RAG has the Lewis et al. paper and several vendor docs), the citation is chosen by hierarchy: peer-reviewed paper > standards body > vendor primary documentation > analyst publication.
Freshness flags
- Foundational — original peer-reviewed papers that anchor the field (Lewis et al. RAG 2020; Park et al. Generative Agents 2023; Liu et al. Lost in the Middle 2023; Malkov & Yashunin HNSW 2018; Robertson & Zaragoza BM25 2009; Cormack et al. RRF 2009; Jégou et al. Product Quantization 2011; Izacard & Grave Fusion-in-Decoder 2020; Wang et al. Voyager 2023; Shinn et al. Reflexion 2023; Packer et al. MemGPT 2023; Hogan et al. Knowledge Graphs 2021). Refreshed only if substantively revised.
- In force — binding standards or vendor specifications currently in effect (e.g., the W3C SPARQL 1.1 query language; Anthropic prompt-caching documentation as a billed product feature).
- Emerging 2026 — terms that entered mainstream discourse in 2025–2026 and are still stabilising (Mem0, A-Mem, GraphRAG, Agentic RAG, Context engineering, Context graph). Carried with explicit emerging flags so readers know the framing may evolve.
- Contested — terms with named, meaningful disagreement in the field; we carry both positions when they exist.
Refresh cadence
Quarterly term audit (next scheduled: Q3 2026). Trigger-based refresh on any of:
- A canonical-vendor SDK or doc page changes a term's framing (e.g., Anthropic's prompt-caching API, Cohere's rerank model, Pinecone's reranking guide).
- A new arXiv paper introduces a term that has crossed the bar from emerging to load-bearing.
- A vector-store vendor releases a new index family or quantisation technique that displaces a current default.
- The W3C, Neo4j, or another standards body updates a query-language specification cited here.
- A canonical-source URL 404s (citation rot is the fastest-acting decay).
Daily 15-minute news skim flows back into raw/research-notes.md under Open questions / follow-up. Findings accumulate between sprints; the next quarterly refresh sweeps them in.
Coverage scope
This glossary is intentionally scoped to memory architecture for LLM agents. It deliberately excludes general-purpose ML vocabulary (gradient descent, backpropagation, transformer architecture details) that does not bear on agent-memory design decisions. For broader agentic-AI vocabulary see the Agentic Glossary — Quick Reference; for narrative depth on flagship terms (RAG, Memory, Knowledge, Context graph) see the deep Agentic Glossary.
Bibliography (51 unique sources)
- AgentsBooks — Anatomy of a Firm. https://agentsbooks.com/anatomy
- Anthropic — Building Effective Agents. https://www.anthropic.com/research/building-effective-agents
- Anthropic — Claude long-context model card. https://docs.claude.com/en/docs/build-with-claude/context-windows
- Anthropic — Effective context engineering for AI agents. https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents Emerging 2026
- Anthropic — Long context prompting tips. https://www.anthropic.com/news/prompting-long-context
- Anthropic — Prompt caching. https://docs.claude.com/en/docs/build-with-claude/prompt-caching In force
- Asai et al. — Self-RAG (2023). https://arxiv.org/abs/2310.11511
- Cohere — Rerank model documentation. https://docs.cohere.com/docs/rerank-overview
- Cormack et al. — RRF (2009). https://plg.uwaterloo.ca/~gvcormac/cormacksigir09-rrf.pdf Foundational
- Faiss — Approximate nearest neighbour wiki. https://github.com/facebookresearch/faiss/wiki
- Faiss — IVF documentation. https://github.com/facebookresearch/faiss/wiki/Faiss-indexes
- Gao et al. — HyDE (2022). https://arxiv.org/abs/2212.10496
- Gartner — Hype Cycle for Agentic AI. https://www.gartner.com/en/articles/hype-cycle-for-agentic-ai Emerging 2026
- Hogan et al. — Knowledge Graphs (2021). https://arxiv.org/abs/2003.02320 Foundational
- Izacard & Grave — Fusion-in-Decoder (2020). https://arxiv.org/abs/2007.01282 Foundational
- Jégou et al. — Product Quantization (2011). https://lear.inrialpes.fr/pubs/2011/JDS11/jegou_searching_with_quantization.pdf Foundational
- Karpukhin et al. — Dense Passage Retrieval (2020). https://arxiv.org/abs/2004.04906
- LangChain — ConversationSummaryMemory. https://python.langchain.com/docs/versions/migrating_memory/conversation_summary_memory/
- LangChain — Memory in Agents. https://python.langchain.com/docs/how_to/migrate_agent/
- LangChain — Query analysis (decomposition). https://python.langchain.com/docs/how_to/query_multiple_queries/
- LangChain — Semantic chunker. https://python.langchain.com/docs/how_to/semantic-chunker/
- Letta — Documentation. https://docs.letta.com/
- Lewis et al. — Retrieval-Augmented Generation (2020). https://arxiv.org/abs/2005.11401 Foundational
- Liu et al. — Lost in the Middle (2023). https://arxiv.org/abs/2307.03172 Foundational
- Malkov & Yashunin — HNSW (2018). https://arxiv.org/abs/1603.09320 Foundational
- Mem0 — Memory architecture for AI agents. https://docs.mem0.ai/overview
- Microsoft Research — GraphRAG. https://microsoft.github.io/graphrag/ Emerging 2026
- Milvus — Documentation. https://milvus.io/docs
- Neo4j — Cypher Manual. https://neo4j.com/docs/cypher-manual/current/
- Neo4j — Property graph model. https://neo4j.com/docs/getting-started/appendix/graphdb-concepts/
- OpenAI — Embeddings guide. https://platform.openai.com/docs/guides/embeddings
- Packer et al. — MemGPT (2023). https://arxiv.org/abs/2310.08560 Foundational
- Park et al. — Generative Agents (2023). https://arxiv.org/abs/2304.03442 Foundational
- pgvector — GitHub. https://github.com/pgvector/pgvector
- Pinecone — Chunking strategies. https://www.pinecone.io/learn/chunking-strategies/
- Pinecone — Documentation. https://docs.pinecone.io/
- Pinecone — Reranking guide. https://docs.pinecone.io/guides/search/rerank-results
- Pinecone — Similarity search. https://docs.pinecone.io/guides/search/semantic-search
- Pinecone — Vector similarity explained. https://www.pinecone.io/learn/vector-similarity/
- Pinecone — What is a vector database?. https://www.pinecone.io/learn/vector-database/
- Qdrant — Documentation. https://qdrant.tech/documentation/
- Robertson & Zaragoza — BM25 (2009). https://www.staff.city.ac.uk/~sbrp622/papers/foundations_bm25_review.pdf Foundational
- Shinn et al. — Reflexion (2023). https://arxiv.org/abs/2303.11366 Foundational
- W3C — RDF 1.1 Primer. https://www.w3.org/TR/rdf11-primer/
- W3C — RDF Concepts. https://www.w3.org/TR/rdf11-concepts/
- W3C — SPARQL 1.1 Query Language. https://www.w3.org/TR/sparql11-query/ In force
- Wang et al. — Voyager (2023). https://arxiv.org/abs/2305.16291
- Weaviate — Documentation. https://weaviate.io/developers/weaviate
- Weaviate — Hybrid search documentation. https://weaviate.io/developers/weaviate/search/hybrid
- Xu et al. — A-Mem (2025). https://arxiv.org/abs/2502.12110 Emerging 2026
- Zhong et al. — MemoryBank (2023). https://arxiv.org/abs/2305.10250
Privacy and editorial discipline
This site reuses the AgentsBooks portfolio's controlled vocabulary — agent fleet, agentic firm, 8 primitives, complete-not-compete, auditable substrate — and never references private client identifiers. Voice is operator-grade, plural first-person, citation-anchored. Comparison-style claims about vendors quote the vendor's own published documentation; we do not editorialise inside a definition.