Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
馃 AI Engineering
LLM, generative AI, AI systems, model deployment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
818
posts in
8.3
ms
BacteReason: A Reasoning
Model
for Antimicrobial Resistance Prediction
聽
馃
Machine Learning
聽
Content type:
Academic
biorxiv.org
路
3d
3 days ago
Actions for BacteReason: A Reasoning Model for Antimicrobial Resistance Prediction
Your
AI
agent reads the
fine
print: building a
RAG
pipeline over EU regulations with Elasticsearch and OGX
聽
鈿欙笍
Backend Engineering
聽
Content type:
Blog
elastic.co
路
1d
1 day ago
Actions for Your AI agent reads the fine print: building a RAG pipeline over EU regulations with Elasticsearch and OGX
huawei-csl/KVarN: KVarN is a native
vLLM
KV-cache
quantization
backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.
聽
馃挰
LLMs
聽
Content type:
Code
github.com
路
6d
6 days ago
路
Hacker News
Actions for huawei-csl/KVarN: KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.
LC-QAT:
Data-Efficient
2-Bit QAT for LLMs via Linear-Constrained
Vector
Quantization
聽
馃挰
LLMs
聽
Content type:
Academic
arxiv.org
路
21h
21 hours ago
Actions for LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization
Quiz: Embeddings and
Vector
Databases
With ChromaDB
聽
馃
Machine Learning
realpython.com
路
1d
1 day ago
Actions for Quiz: Embeddings and Vector Databases With ChromaDB
New comment by jasonlayton4323 in "Ask HN: Who wants to be hired? (June 2026)"
聽
鈽侊笍
Cloud Computing
drive.google.com
路
6d
6 days ago
路
Hacker News
Actions for New comment by jasonlayton4323 in "Ask HN: Who wants to be hired? (June 2026)"
How to Defend Against
Prompt
Injection in Production
聽
鈿欙笍
Backend Engineering
聽
Content type:
Reference
leanpub.com
路
1d
1 day ago
路
DEV
Actions for How to Defend Against Prompt Injection in Production
Fine
tuning
classification in Elixir
聽
馃
Machine Learning
elixirstatus.com
路
2d
2 days ago
Actions for Fine tuning classification in Elixir
MongoDB as a
Vector
Database
for
AI
Agents-MongoDB
聽
鈿欙笍
Query Engines
foojay.io
路
6d
6 days ago
Actions for MongoDB as a Vector Database for AI Agents-MongoDB
The Order Matters: Sequential
Fine-Tuning
of
LLaMA
for Coherent Automated Essay Scoring
聽
馃挰
LLMs
聽
Content type:
Academic
arxiv.org
路
21h
21 hours ago
Actions for The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring
GGUF vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization
(and Which One to Pick)
聽
馃挰
LLMs
vettedconsumer.com
路
4d
4 days ago
路
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every
prompt
and response locally to stop
data
leaks and runaway token costs.
聽
馃挰
LLMs
聽
Content type:
Code
github.com
路
1d
1 day ago
路
Hacker News
Actions for ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.
Research Proposal: Decoupled
RISC-LLM
Architectures via Circadian Synaptic Consolidation
聽
馃挰
LLMs
aermia.com
路
4d
4 days ago
路
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
Agentic
AI
vs
Generative
AI
: Why one without the other hits a ceiling
聽
馃挰
LLMs
聽
Content type:
Blog
udacity.com
路
6d
6 days ago
Actions for Agentic AI vs Generative AI: Why one without the other hits a ceiling
UniSVQ: 2-bit Unified
Scalar-Vector
Quantization
聽
馃
Machine Learning
聽
Content type:
Academic
arxiv.org
路
21h
21 hours ago
Actions for UniSVQ: 2-bit Unified Scalar-Vector Quantization
Unlocking dependable responses with Gemini Enterprise Agent Platform鈥檚 Agentic
RAG
聽
馃挰
LLMs
聽
Content type:
Blog
research.google
路
5d
5 days ago
Actions for Unlocking dependable responses with Gemini Enterprise Agent Platform鈥檚 Agentic RAG
How we fight GPU scarcity without compromise
聽
馃挰
LLMs
聽
Content type:
Blog
equixly.com
路
5d
5 days ago
路
Hacker News
Actions for How we fight GPU scarcity without compromise
Context
Engineering
Is the Skill That Actually Ships Reliable
AI
Agents
聽
馃挰
LLMs
haloproject.gumroad.com
路
4d
4 days ago
路
DEV
Actions for Context Engineering Is the Skill That Actually Ships Reliable AI Agents
Two to Tango: Coupled Task-Reference Selection for Safe
LLM
Fine-tuning
聽
馃挰
LLMs
聽
Content type:
Academic
arxiv.org
路
21h
21 hours ago
Actions for Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning
What Is
Generative
AI
?
聽
馃挰
LLMs
聽
Content type:
Academic
excelsior.edu
路
6d
6 days ago
Actions for What Is Generative AI?
« Page 1
路
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help