I Built a tool to stop manually swapping models on my 8GB GPU,chains a small Prompter and a large Coder into one pipeline with automatic VRAM swap (opens in new tab)
Contribute to atharva557/Prompt-Chaining development by creating an account on GitHub.
Read the original article