A non-diagonal SSM RNN computed in parallel without requiring stabilization
github.com·3w·
Discuss: Hacker News
Flag this post

goom_ssm_rnn

Reference implementation of a deep RNN that captures dependencies with a non-diagonal linear state-space model (SSM) over our implementation of generalized orders of magnitude (GOOMs), allowing recurrent states to fluctuate freely over a greater dynamic range of real values than previously possible, enabling computation of non-diagonal recurrences in parallel, via a prefix scan, without any form of stabilization.

Installing

Clone this repository. 1.

Install the Python dependencies in requirements.txt. 1.

There is no third step.

Instantiating the RNN

The following code instantiates a small RNN for generative language modeling tasks with GPT-2’s vocabulary:

import torch
import...

Similar Posts

Loading similar posts...