GitHub - transkatgirl/Tapestry-Loom: A power user focused interface for LLM base models.

Tapestry Loom

A power user focused interface for LLM base models, inspired by the designs of loom, loomsidian, exoloom, logitloom, and wool.

Known issues

Some documents may cause the text editor to render token boundaries incorrectly
This is due to a bug in egui regarding textedit underline rendering
Tab bars are not read by screen readers
This is due to a bug in egui_tiles

If you are experiencing an issue not listed here or in this repository’s active issues, please file an issue so that it can be fixed.

Getting started

Binary…

Tapestry Loom

A power user focused interface for LLM base models, inspired by the designs of loom, loomsidian, exoloom, logitloom, and wool.

Known issues

Some documents may cause the text editor to render token boundaries incorrectly
This is due to a bug in egui regarding textedit underline rendering
Tab bars are not read by screen readers
This is due to a bug in egui_tiles

If you are experiencing an issue not listed here or in this repository’s active issues, please file an issue so that it can be fixed.

Getting started

Binary releases

Compiled binaries can be found on the releases page.

MacOS-specific instructions

Before using the app, you will need to run the following CLI command in the extracted folder:

xattr -d com.apple.quarantine tapestry*

Compiling from source

Requires the Rust Programming Language and a working C compiler to be installed.

git clone --recurse-submodules https://github.com/transkatgirl/Tapestry-Loom.git
cd Tapestry-Loom
cargo build --release

The compiled binary can be found in the ./target/release/ folder.

Updating

Run the following commands in the repository folder:

git pull
git submodule update --init --recursive
cargo build --release

Usage

See Getting Started for more information on how to use the application.

The rest of this README covers the usage of external tools which Tapestry Loom can interface with.

Migrating weaves from other Loom implementations

See migration-assistant for more information on how to migrate weaves from other Loom implementations to Tapestry Loom.

Local inference

llama.cpp’s llama-server is recommended, as it has been confirmed to work properly with all of the features within Tapestry Loom.

Ollama should not be used due to bad sampling settings which cannot be overridden in API requests, along with a lack of available base models.

KoboldCpp is not recommended due to a lack of request queuing and a poor implementation of logprobs (the number of requested logprobs is entirely ignored).

The recommended CLI arguments for llama-server are listed below:

llama-server --models-dir $MODEL_DIRECTORY --models-max 1 --jinja --chat-template "message.content" --ctx-size 4096 --temp 1 --top-k 0 --top-p 1 --min-p 0

Where $MODEL_DIRECTORY is set to the directory where model gguf files are stored.

(Regarding quantization: Benchmarks of how chat models are affected by quantization likely do not generalize to how base models are used. Quantization should be kept as low as reasonably possible, but q8_0 is likely good enough for most use cases.)

Explanation of arguments:

Only one model loaded into VRAM at a time; old models are automatically unloaded to make room for new ones
The specified chat template passes user input directly to the model without further changes.
Reducing the maximum context length helps reduce VRAM usage without sacrificing quality.
The default sampling parameters (those specified by the CLI arguments) should leave the model’s output distribution unchanged. Sampling parameter defaults for chat models do not generalize to how base models are used.
The sampling parameters specified in the CLI arguments will be overridden by any sampling parameters that are specified in a request.

Additional useful arguments (depending on your use case):

--no-cont-batching
Continuous batching significantly improves response determinism at the expense of performance. Should be used if you plan on analyzing logprobs or using greedy sampling.

If you are running llama-server on the same device as Tapestry Loom (and you are using the default port), you do not need to explicitly specify an endpoint URL when filling out the "OpenAI-style Completions" and "OpenAI-style ChatCompletions" templates.

Recommended models

If you are new to working with LLM base models, Trinity-Mini-Base-Pre-Anneal or (Trinity-Nano-Base-Pre-Anneal if you have <32GB of VRAM) is a good first model to try.

Tokenization server (optional)

See tapestry-tokenize for more information on how to configure and use the (optional) tokenization server.

Once a tokenization endpoint is configured for a model, enabling the setting "(Opportunistically) reuse output token IDs" can improve output quality, especially in weaves where nodes are being generated by one model rather than an ensemble of models.

This setting requires the inference backend to support returning token IDs (to check if this is working, hover over generated tokens in the text editor to see if they contain a token identifier). This is a non-standard addition to the OpenAI Completions API which is currently supported by very few inference backends (llama.cpp has been confirmed to work properly with this feature).

If your inference backend returns token IDs in OpenAI-style Completions responses but they do not appear in your weaves, please file an issue.

Plans

At the moment, all major features planned for the initial release have been implemented. Development will slow down for the next few months, as the focus shifts towards fixing bugs and improving documentation.

Development of the next major version of Tapestry Loom will begin in Q1 2026. Please consider donating to help fund further development.

Plans for next major version

Support for DAG-based Weaves, similar to this unreleased loom implementation
FIM completions
Selected text is used to determine FIM location
Node copying & moving
Perform heavy testing of data structures and/or formal verification to prevent bugs that could result in data loss
Implement node "editing" UI (not actually editing node content, but editing the tree by adding nodes / splitting nodes / merging nodes), similar to inkstream
Fully immutable nodes; Node splitting is implemented through duplication
Prefix-based duplication
Embedding model support
Node ordering by seriation
Add a plugin API & custom inference API
Support the following use cases:
LLM research
Support adding custom UI elements and editor subviews
Autolooms (looms where node choices are picked by a user-determined algorithm)
Adaptive looming (node lengths are picked by a user-determined algorithm)
Implement an optional inference server using llama.cpp
Adaptive looming using token entropy or confidence
Context window wrapping
Allow adjusting proportion of completions from each model
When working with multiple models, allow dynamically adjusting proportions based on usage
Flatten proportion bias when increasing number of completions, do the inverse when reducing completion count
Further UI improvements
Add ability to manually control refreshing of model tokenization identifier
Blind comparison modes
(Hide) Models & token probabilities / boundaries
(Hide) Generated node text (only showing metadata & probabilities)
Improve handling of hovered + omitted/collapsed nodes
Better handle valid UTF-8 character split across multiple nodes
Improve graph/canvas layout algorithm
Add generate buttons (displayed on hover) to canvas
Improve clarity of error messages
Better file manager
Support keyboard shortcuts for all aspects of the UI, not just the weave editor
Aim to support navigating the entirety of the UI without a mouse
Improve built-in color schemes
Node finding
Customizable node sorting
Time added
Alphabetical
Semantic sorting
Customizable node color coding
Probability
Confidence
Node bulk selection
Node custom ordering via drag and drop
Support reordering nodes in canvas and graph views as well
Keyboard shortcut presets
Built-in presets
loomsidian-like
exoloom-like
Tapestry Loom
Saving & loading custom presets
Importing & exporting custom presets
Support touchscreen-only devices
Show hovered child of active node in editor, similar to exoloom
Add ability to add custom labels to bookmarks/nodes
Add ability to add custom attributes to nodes, rather than just bookmarks
Weave statistical analysis tools
Predictability analysis using logprobs
Statistical analysis of various metrics (model usage, text length, logprobs, number of branches, etc)
Token streaming and display of nodes being generated
Optimize for performance whenever possible
Aim to have acceptable performance on weaves with ~1 million nodes, ~200k active and ~10MB of active text on low-end hardware (such as a Raspberry Pi)
Implement a special "link" node to allow splitting giant weaves into multiple documents
Optimize memory usage to be as low as reasonably possible
Add support for more weave migrations
bonsai (using damask)
wool
helm
(subset of) miniloom
Support Standard Completions (after the specification is finalized)

Note: Tapestry Loom will be entirely focused on base and/or embedding models for the foreseeable future.

There are already good chat looms (such as miniloom) and base model looms which heavily integrate assistant functionality (such as helm); Tapestry Loom will not be one of them.

Speculative ideas

Support weaves of arbitrarily large size using a database-based format
Self-contained packaging: All documentation and tools in one app, rather than being spread out over multiple
Collaborative weave editing
Server-client, multi-user WebUI
Efficiently store full edit history in weave for lossless unbounded undo/redo
Alternate input devices
Talon Voice
Controllers / Gamepads
USB DDR Pads

Tapestry Loom

Known issues

Getting started

Binary…

Tapestry Loom

Known issues

Getting started

Binary releases

MacOS-specific instructions

Compiling from source

Updating

Usage

Migrating weaves from other Loom implementations

Local inference

Recommended models

Tokenization server (optional)

Plans

Plans for next major version

Speculative ideas

Similar Posts