Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 LLM Inference
Quantization, Attention Mechanisms, Batch Processing, KV Caching
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
26871
posts in
4.63
s
VSORA
Board Chair Sandra
Rivera
on Solutions for AI Inference and LLM Processing
semiwiki.com
·
1d
🏗️
LLM Infrastructure
Alibaba launches open source AI model
RynnBrain
for
robotics
techzine.eu
·
22h
🆕
New AI
LookML
: An Alternative Semantic Layer Approach to build a Reliable AI Analytics Agent with
BigQuery
pub.towardsai.net
·
2d
🦙
Ollama
Securing
GenAI
: Vol 4 — Fundamentals of AI model security
pub.towardsai.net
·
1d
🛡️
AI Security
SAE
Feature
Matchmaking
(Layer-to-Layer)
lesswrong.com
·
1d
🔗
ONNX
Colab
marketplace.visualstudio.com
·
19h
🎨
ChromaDB
How
Yelp
Built “
Yelp
Assistant
”
blog.bytebytego.com
·
1d
💳
Content Monetization
Gemini
thinking
| Gemini API | Google AI for
Developers
ai.google.dev
·
1d
✨
Gemini
Reinforcement
Inference
: Leveraging Uncertainty for
Self-Correcting
Language Model Reasoning
arxiv.org
·
1d
🏗️
LLM Infrastructure
Tokens
of AI
Bias
chinamediaproject.org
·
2d
🛡️
AI Security
What do LLMs think when you don't
tell
them what to think about?
together.ai
·
5d
🏗️
LLM Infrastructure
HQP
: Sensitivity-Aware Hybrid Quantization and
Pruning
for Ultra-Low-Latency Edge AI Inference
arxiv.org
·
2d
📱
Edge AI Optimization
Designing and Using
Combinators
: The
Essence
of Functional Programming
cse.chalmers.se
·
1d
·
Discuss:
Hacker News
💻
Programming languages
NotebookLM
: The AI that only
learns
from you
byandrev.dev
·
3d
·
Discuss:
Hacker News
👨💻
AI Coding
Last30Days
: A
Recency-Aware
Research API for X, Reddit, and the Web
lumify.ai
·
21h
·
Discuss:
Hacker News
📊
Feed Optimization
Show HN: 289x
speedup
over
MLP
using Spectral Graphs
zenodo.org
·
3d
·
Discuss:
Hacker News
🎨
ChromaDB
The control
layer
for AI
blog.dottxt.ai
·
4d
·
Discuss:
Hacker News
🛡️
AI Security
Data Modeling for the Agentic Era:
Semantics
, Speed, and
Stewardship
rilldata.com
·
1d
·
Discuss:
Hacker News
🔄
Incremental Computation
Ask HN: Are past LLM models getting
dumber
?
news.ycombinator.com
·
21h
·
Discuss:
Hacker News
🏆
LLM Benchmarking
Reliability of LLMs as medical assistants for the general public: a
randomized
preregistered
study
nature.com
·
1d
·
Discuss:
Hacker News
🏆
LLM Benchmarking
Loading...
Loading more...
« Page 9
•
Page 11 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help