Rnj-1: Building Instruments of Intelligence
essential.ai·1h·
Discuss: Hacker News
Preview
Report Post

The long-term advancement and equitable diffusion of AI technologies crucially depend on their development in the Open. In the US, a few stalwarts of open-source AI are protecting its future. Today, we are proud to make our first model contribution to the open-source canon with Rnj-1 (an homage to Ramanujan, pronounced “range-1”), a world-class pair of base and instruction-tuned large language models. In this blog, we will summarize the key capabilities of the models, briefly cover the background behind their development, and share our vision for what lies ahead.


Capabilities

Rnj-1 is an 8B model that roughly follows the open-source Gemma 3 architecture. We employ global self-attention and YaRN to extend the context to 32k. The Rnj-1 Base and Instruct models compare favor…

Similar Posts

Loading similar posts...