Cache Augmented Generation (CAG)
Cache Augmented Generation (CAG) is a novel approach to enhancing large language models (LLMs) by preloading knowledge as precomputed key-value caches, enabling low-latency, accurate, and efficient AI performance for static knowledge tasks.
•
7 min read