Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚙️ Finetuning LLMs faster with less memory
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4255
posts in
120.6
ms
llama.cpp
guide - Running LLMs
locally
, on any hardware, from scratch
blog.steelph0enix.dev
·
2d
🦙
Simple finetuning LLMs
LLM Performance in
Astro
, React,
Tailwind
and Cloudflare
10xbench.ai
·
22h
·
Discuss:
Hacker News
🦙
Simple finetuning LLMs
The Problem With LLMs
deobald.ca
·
20h
·
Discuss:
Lobsters
🦙
Simple finetuning LLMs
GLM
5 is already on
huggingface
!
huggingface.co
·
2h
·
Discuss:
r/LocalLLaMA
🦙
Simple finetuning LLMs
Show HN: Latent-k –
Persistent
dependency
map to reduce AI coding token usage
latentk.org
·
7h
·
Discuss:
Hacker News
🤖
Coding Automation
datavorous/spheni
: An in-memory vector search library in C++ with Python bindings
github.com
·
6h
·
Discuss:
Hacker News
📊
Vector Databases
[
TUHS
] bare m4 (was BTL
summmer
employees)
tuhs.org
·
4h
·
Discuss:
Lobsters
🦀
Rust language vector embeddings
GLM-5
: From
Vibe
Coding to Agentic Engineering
simonwillison.net
·
1h
🔄
AI Pipeline design and techniques
Building
DamN64
: LLM-Assisted
N64
Development
vieux.fr
·
1h
·
Discuss:
Hacker News
🔥
Svelte
Opus 4.6 Reasoning
Distill
3k
prompts
huggingface.co
·
1d
·
Discuss:
r/LocalLLaMA
🦙
Simple finetuning LLMs
Show HN: I built an AI executive
assistant
you use through
iMessage
getattache.com
·
4h
·
Discuss:
Hacker News
🔵
LLM frameworks and AI libraries for TypeScript
Building a
Regex
Engine with a team of parallel
Claudes
lesswrong.com
·
20h
📚
Monorepo Patterns
Show HN: Model Training Memory
Simulator
czheo.github.io
·
3d
·
Discuss:
Hacker News
🔄
AI Pipeline design and techniques
Garnix
Blog:
Forwardly-evaluated
build systems
garnix.io
·
8h
·
Discuss:
Lobsters
📚
Monorepo Patterns
How We Built the Fastest
Kimi
K2.5
on Artificial Analysis
baseten.co
·
5h
·
Discuss:
Hacker News
🔄
AI Pipeline design and techniques
Concurrent
vs.
Parallel
Execution in LLM API Calls: From an AI Engineer’s Perspective
pub.towardsai.net
·
2d
🤖
Coding Automation
Compare
up to 5 LLMs side-by-side, then
fuse
the best answers
llmcode.ai
·
1d
·
Discuss:
Hacker News
🦙
Simple finetuning LLMs
Show HN:
RTK
– Wrap your CLI
commands
, save 60-90% of tokens in AI coding agents
github.com
·
6h
·
Discuss:
Hacker News
🤖
Coding Automation
C-- Home
cs.tufts.edu
·
1d
·
Discuss:
Lobsters
🔵
LLM frameworks and AI libraries for TypeScript
DFlash
: Block Diffusion for Flash
Speculative
Decoding
z-lab.ai
·
1d
·
Discuss:
Hacker News
🔥
Svelte
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help