QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling
arxiv.org·6h
💻Local LLMs
Preview
Report Post

Computer Science > Machine Learning

arXiv:2512.06582 (cs)

View PDF

Abstract:Recurrent neural architectures such as LSTM and GRU remain widely used in sequence modeling, but they continue to face two core limitations: redundant gate-specific parameters and reduced ability to retain information across long temporal distances. This paper introduces the Quantum-Leap LSTM (QL-LSTM), a recurrent architecture designed to address both challenges through two independent components. The Parameter-Shared Unified Gating mechanism replaces all gate-specific transformations with a single shared weight matrix, reducing parameters by approximately 48 percent while preserving full gating behavior. The Hierarchical Gated Recurrence with Additive Skip …

Similar Posts

Loading similar posts...