⚙️ Finetuning LLMs faster with less memory - autocole · Scour

⚙️ Finetuning LLMs faster with less memory

How to run Qwen3 0.6B at 8.4 tok/sec on 2 x 5090s

reddit.com·8h·

Discuss: r/LocalLLaMA

🦙Simple finetuning LLMs

Show HN: Training an LLM to Play Wordle with RL on Apple Silicon

charbull.github.io·1d·

Discuss: Hacker News

🦙Simple finetuning LLMs

Xmake v3.0.2 has been released, Improve C++ modules and new native thread support.

github.com·4h·

Discuss: Hacker News, r/cpp

Fantastic Pretraining Optimizers and Where to Find Them

arxiviq.substack.com·23h·

Discuss: Substack

🔵LLM frameworks and AI libraries for TypeScript

Writing a C compiler in 500 lines of Python

vgel.me·17h·

Discuss: Hacker News, r/programming

The Term "Non-Deterministic" and LLMs

vishalbakshi.github.io·9h·

Discuss: Hacker News

🔵LLM frameworks and AI libraries for TypeScript

Pretraining a LLM with less than $50 budget which outperforms Google BERT

medium.com·2d·

Discuss: Hacker News, r/LLM, r/LocalLLaMA

🔵LLM frameworks and AI libraries for TypeScript

[Level 0] Fine-tuned my first personal chatbot

reddit.com·5h·

Discuss: r/LocalLLaMA

🦙Simple finetuning LLMs

Wild Performance Tricks

davidlattimore.github.io·1d·

Discuss: Lobsters, Hacker News, r/programming, r/rust

🦀Rust language vector embeddings

How to Vibe Code Effectively

ibrahimahmed.ca·13h·

Discuss: Hacker News

🤖Coding Automation

Show HN: Higher-order transform streams: 10x faster AI with recursive prompts

timetler.com·18h·

Discuss: Hacker News, r/programming

🔄AI Pipeline design and techniques

SectorC: A C Compiler in 512 bytes (2023)

xorvoid.com·15h·

Discuss: Hacker News

🤖Coding Automation

Optimizing AI Inference with Edge Computing

edgee.cloud·1d·

Discuss: Hacker News

🔄AI Pipeline design and techniques

I made a transformer by hand (no training)

vgel.me·5h·

Discuss: Hacker News

🦀Rust language vector embeddings

All Hype, No Bite: The "Your Brain on ChatGPT.." Preprint

residualinsights.com·15h·

Discuss: Hacker News

🦙Simple finetuning LLMs

Show HN: Fine-tuned Llama 3.2 3B to match 70B models for local transcripts

bilawal.net·2d·

Discuss: Hacker News

🦙Simple finetuning LLMs

Show HN: CompareGPT – Making LLMs More Trustworthy by Reducing Hallucinations

news.ycombinator.com·1d·

Discuss: Hacker News

🦙Simple finetuning LLMs

Should you use AsyncLocalStorage? (2023)

eytanmanor.medium.com·1d·

Discuss: Hacker News

Serving AI from the basement part I

ahmadosman.com·11h·

Discuss: Hacker News

🔄AI Pipeline design and techniques

InvisiCaps: The Fil-C capability model

fil-c.org·5h·

Discuss: Hacker News

Loading more...