Local model deployment, model quantization, inference optimization, edge deployment

Feeds to Scour
SubscribedAll
Scoured 4364 posts in 51.4 ms
The Abstraction Trap: Why Layers Are Lobotomizing Your Model
news.ycombinator.com·5h·
Discuss: Hacker News
💸Affordable LLMs
Preview
Report Post
hyunwoongko/nanoRLHF: nanoRLHF: from-scratch journey into how LLMs and RLHF really work.
github.com·22h·
Discuss: r/LocalLLaMA
💸Affordable LLMs
Preview
Report Post
Customizing LLMs with Your Data: A Progressive Strategy from Prompting to Fine-Tuning
dev.to·3d·
Discuss: DEV
💸Affordable LLMs
Preview
Report Post
Taming P99s in OpenFGA: How We Built a Self-Tuning Strategy Planner
auth0.com·1d·
Discuss: Lobsters
🚀Performance
Preview
Report Post
Beyond Prompts: Context Engineering as Production AI’s Critical Infrastructure Layer
pub.towardsai.net·22h
💬Prompt Engineering
Preview
Report Post
vLLM: An Efficient Inference Engine for Large Language Models
www2.eecs.berkeley.edu·5d·
Discuss: Hacker News
💸Affordable LLMs
Preview
Report Post
The quest for flow with AI coding
joshuavaldez.com·4h
💬AI Code Assistants
Preview
Report Post
Show HN: Constellations – On-the-fly D3 collaboration graphs of history via LLMs
github.com·3h·
Discuss: Hacker News
🗂️Vector Databases
Preview
Report Post
Writing an LLM from scratch, part 29 -- using DistributedDataParallel to train a base model from scratch in the cloud
gilesthomas.com·2d·
💸Affordable LLMs
Preview
Report Post
Cloudspecs: Cloud Hardware Evolution Through the Looking Glass
muratbuffalo.blogspot.com·11h·
🚀Performance
Preview
Report Post
Fragments: January 8
martinfowler.com·1d
💬Prompt Engineering
Preview
Report Post
AI as the Engine of Application State
jonwoodlief.com·1h·
Discuss: Hacker News
💬AI Code Assistants
Preview
Report Post
Engineering an LLM-Based Data Classifier
getnumberseven.com·3d·
Discuss: Hacker News
🦙Ollama
Preview
Report Post
AI’s Memorization Crisis
theatlantic.com
·5h·
Discuss: Hacker News
🔍RAG
Preview
Report Post
Digital Red Queen: Adversarial Program Evolution in Core War with LLMs
sakana.ai·1d·
Discuss: Hacker News
🛡️AI Security
Preview
Report Post
HW-Accelerated Physical AI Framework For Resource-Constrained Edge Devices (ASU)
semiengineering.com·2d
📱Edge AI
Preview
Report Post
DeepSeek To Release Next Flagship AI Model With Strong Coding Ability
theinformation.com
·14h·
📱Edge AI
Preview
Report Post
How AI Agents is Revolutionizing Open Source Software
oneuptime.com·1d·
🤖spec-driven ai-assisted development
Preview
Report Post
Per-query energy consumption of LLMs
muxup.com·2d·
🚀Performance
Preview
Report Post
LLMs contain a LOT of parameters. But what’s a parameter?
technologyreview.com·2d
💸Affordable LLMs
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help