Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚙️ Finetuning LLMs faster with less memory
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4259
posts in
125.4
ms
llama.cpp
guide - Running LLMs
locally
, on any hardware, from scratch
blog.steelph0enix.dev
·
3d
🦙
Simple finetuning LLMs
Show HN: Fighting the War Against
Expensive
Reinforcement
Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
1h
·
Discuss:
Hacker News
🔄
AI Pipeline design and techniques
Staying
on Top in the Age of LLMs
andrasgerlits.medium.com
·
1h
·
Discuss:
Hacker News
,
r/programming
🦙
Simple finetuning LLMs
GLM
5 is already on
huggingface
!
huggingface.co
·
14h
·
Discuss:
r/LocalLLaMA
🦙
Simple finetuning LLMs
[
TUHS
] bare m4 (was BTL
summmer
employees)
tuhs.org
·
16h
·
Discuss:
Lobsters
🦀
Rust language vector embeddings
Running
Mistral-7B
on Intel
NPU
— 12.6 tokens/s, zero CPU/GPU usage
github.com
·
2h
·
Discuss:
r/LocalLLaMA
🧩
WASI
Show HN: Latent-k –
Persistent
dependency
map to reduce AI coding token usage
latentk.org
·
19h
·
Discuss:
Hacker News
🤖
Coding Automation
LLM Performance in
Astro
, React,
Tailwind
and Cloudflare
10xbench.ai
·
1d
·
Discuss:
Hacker News
🦙
Simple finetuning LLMs
GLM-5
: From
Vibe
Coding to Agentic Engineering
simonwillison.net
·
13h
🔄
AI Pipeline design and techniques
Show HN: I built an AI executive
assistant
you use through
iMessage
getattache.com
·
16h
·
Discuss:
Hacker News
🔵
LLM frameworks and AI libraries for TypeScript
Garnix
Blog:
Forwardly-evaluated
build systems
garnix.io
·
20h
·
Discuss:
Lobsters
📚
Monorepo Patterns
EyesOff
: Why Some Models
Quantize
Better Than Others
ym2132.github.io
·
9h
·
Discuss:
Hacker News
📊
Vector Databases
Opus 4.6 Reasoning
Distill
3k
prompts
huggingface.co
·
2d
·
Discuss:
r/LocalLLaMA
🦙
Simple finetuning LLMs
Building a
Regex
Engine with a team of parallel
Claudes
lesswrong.com
·
1d
📚
Monorepo Patterns
Building
Chess
in about 350 lines of
Clojure
sammystraus.com
·
6h
·
Discuss:
Hacker News
🔥
Svelte
Import AI 444: LLM
societies
; Huawei makes kernels with AI;
ChipBench
importai.substack.com
·
2d
·
Discuss:
Substack
🔵
LLM frameworks and AI libraries for TypeScript
Cache-aware
disaggregated
inference for up to 40% faster long-context LLM
serving
together.ai
·
1d
·
Discuss:
Hacker News
,
r/LocalLLaMA
🔥
Svelte
Concurrent
vs.
Parallel
Execution in LLM API Calls: From an AI Engineer’s Perspective
pub.towardsai.net
·
3d
🤖
Coding Automation
How We Built the Fastest
Kimi
K2.5
on Artificial Analysis
baseten.co
·
17h
·
Discuss:
Hacker News
🔄
AI Pipeline design and techniques
An
async
HTTP server in ~80 lines of modern C++ (
coroutines
)
vixcpp.com
·
58m
·
Discuss:
Hacker News
🔥
Svelte
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help