MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-AugmentedGeneration Systems
paperium.net·6h·
Discuss: DEV
Flag this post

Overview

This article introduces the Mixtures of scenario-aware document Memories (MoM) framework, a novel solution for Retrieval-Augmented Generation (RAG) systems. MoM transforms passive text chunking into proactive document memory extraction, simulating human cognition. It leverages Large Language Models (LLMs) for outline generation and core content extraction, training Small Language Models (SLMs) to construct these memories.

A key innovation is its three-layer document memory retrieval mechanism, theoretically grounded in probabilistic modeling. Experiments across three domains demonstrate MoM’s effectiveness, resolving RAG text chunking challenges by providing LLMs with semantically complete document memories and enabling SLMs to achieve human-centri…

Similar Posts

Loading similar posts...