Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI

Feeds to Scour
SubscribedAll
Scoured 15546 posts in 1.12 s
PRIMAL: Processing-In-Memory Based Low-Rank Adaptation for LLM Inference Accelerator
arxiv.org·1d
📊Quantization
Preview
Report Post
No Libraries No Shortcuts: Reasoning LLMs from Scratch with PyTorch — Part 2
pub.towardsai.net
·2d
📏Linear Logic
Preview
Report Post
From 75% to 99.6%: The Math of LLM Ensembles
shibaprasadb.com·1d·
Discuss: Hacker News
🧮Kolmogorov Bounds
Preview
Report Post
Privacy-Preserving Active Learning for heritage language revitalization programs with zero-trust governance guarantees
dev.to·22h·
Discuss: DEV
🔒Privacy Archives
Preview
Report Post
Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM)
arxiv.org·3h
📝ABNF Extensions
Preview
Report Post
The three types of LLM workloads and how to serve them
modal.com·16h·
Discuss: Hacker News
⚙️Batch Processing
Preview
Report Post
IPAB Workshop - 22/1/26 | IPAB
informatics.ed.ac.uk·22h
🎼Audio Lambda Calculus
Preview
Report Post
Decoupling the AI Stack: How to Architect a Production-Grade Local LLM System
dev.to·4h·
Discuss: DEV
📏Linear Logic
Preview
Report Post
The Silent AI Breach: How Data Escapes in Fragments
hackernoon.com·12h
🔓Hacking
Preview
Report Post
MIT’s new ‘recursive’ framework lets LLMs process 10 million tokens without context rot
venturebeat.com·1d·
🌀Brotli Internals
Preview
Report Post
Using Local LLMs to Discover High-Performance Algorithms
towardsdatascience.com·2d
🧮SMT Solvers
Preview
Report Post
Quiz: How to Integrate Local LLMs With Ollama and Python
realpython.com·20h
⚙️WASM Runtime
Preview
Report Post
featurestorebook/mlfs-book: O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs
github.com·7h·
Discuss: Hacker News
🧠Machine Learning
Preview
Report Post
Everything Moe
ianbarber.blog·1d·
Discuss: Hacker News
🧠Learned Compression
Preview
Report Post
A Visual Guide to Quantization
newsletter.maartengrootendorst.com·2d
📊Quantization
Preview
Report Post
AI researchers map models to banish 'demon' persona
theregister.com·1d
🔍Vector Forensics
Preview
Report Post
Redacting Faces, People, Vehicles, and Plates with Amped Replay Assisted Redaction
blog.ampedsoftware.com·17h
🧪Archive Fuzzing
Preview
Report Post
Evolution of LLMs use by a programmer
asfaload.com·15h·
Discuss: Hacker News
🧩WASM Components
Preview
Report Post
As Strong As Your Weakest Parameter: An AI Authorization Bypass
praetorian.com·15h
🎯Threat Hunting
Preview
Report Post
Ensemble Listening Model (ELM): State-of-the Art Foundation Model Accuracy. A Fraction of the Cost.
ensemblelisteningmodel.com·1d·
Discuss: Hacker News
🎵Audio ML
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help