Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
🤖 AI Engineering
LLM, AI systems, machine learning ops, model deployment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
3205
posts in
17.9
ms
TileFuse: A Fused Mixed-Precision Kernel Library for Efficient
Quantized
LLM
Inference
on AMD NPUs
🚀
Inference
Content type:
Academic
arxiv.org
·
4h
4 hours ago
Actions for TileFuse: A Fused Mixed-Precision Kernel Library for Efficient Quantized LLM Inference on AMD NPUs
shoo99/paper-rag
: A private, fully-local
RAG
over your own PDFs: BGE-M3 + embedded Qdrant + a local
LLM
via
Ollama
. ~150 lines, nothing leaves your machine.
🗄️
Vector Databases
Content type:
Code
github.com
·
5d
5 days ago
·
DEV
Actions for shoo99/paper-rag: A private, fully-local RAG over your own PDFs: BGE-M3 + embedded Qdrant + a local LLM via Ollama. ~150 lines, nothing leaves your machine.
ClinicalBench: Can
LLMs
Beat Traditional
ML
Models
in Clinical Prediction?
🔧
MLOps
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?
Mind your key: An Empirical Study of
LLM
API Credential Leakage in iOS Apps
🧠
LLMs
Content type:
Academic
arxiv.org
·
4h
4 hours ago
Actions for Mind your key: An Empirical Study of LLM API Credential Leakage in iOS Apps
Nvidia DGX Spark GB10 –
AI
Models
and Guide with
vLLM
and Autonomous Script
🚀
Inference
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script
Rosetta Memory: Adaptive Memory for
Cross-LLM
Agents
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Rosetta Memory: Adaptive Memory for Cross-LLM Agents
Impacts of Histories and
Models
on
LLM
Grading: A Study in Advanced Software
Engineering
Courses
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Impacts of Histories and Models on LLM Grading: A Study in Advanced Software Engineering Courses
heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient
LLM
inference
.
🚀
Inference
Content type:
Code
github.com
·
4d
4 days ago
·
r/LocalLLaMA
Actions for heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
Benchmarking and Exploring the Capabilities of
LLMs
for Attack Investigations
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Benchmarking and Exploring the Capabilities of LLMs for Attack Investigations
Show HN: TuringLLM – a
LLM-powered
Universal Turing
machine
🧠
LLMs
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for Show HN: TuringLLM – a LLM-powered Universal Turing machine
"I understand your perspective":
LLM
Persuasion and Sycophancy through the Lens of Communicative Action Theory
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for "I understand your perspective": LLM Persuasion and Sycophancy through the Lens of Communicative Action Theory
LC-QAT:
Data-Efficient
2-Bit QAT for
LLMs
via Linear-Constrained
Vector
Quantization
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization
KJLdefeated/RL.cu: RLVR training for
LLM
in CUDA/C++
🚀
Inference
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
When
LLMs
Invent Rust Crates: An Empirical Study of Hallucination Patterns and Mitigation
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for When LLMs Invent Rust Crates: An Empirical Study of Hallucination Patterns and Mitigation
Show HN: CLI for scoring OpenAPI for
LLM
legibility
🧠
LLMs
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for Show HN: CLI for scoring OpenAPI for LLM legibility
AuRA: Internalizing Audio Understanding into
LLMs
as LoRA
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for AuRA: Internalizing Audio Understanding into LLMs as LoRA
Guts444/argus-agent: Windows-native local-first
AI
agent command center with durable memory, connected context, tools, private web research, and Telegram support.
🧠
LLMs
Content type:
Code
github.com
·
4d
4 days ago
·
r/vibecoding
Actions for Guts444/argus-agent: Windows-native local-first AI agent command center with durable memory, connected context, tools, private web research, and Telegram support.
Dep-LLM
: Training-Free Depression Diagnosis via Evidence-Guided Structured
Multi-factor
with Reliable
LLM
Reasoning
🚀
Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning
Evaluating Hallucinations in Domain-Adapted Large Language
Models
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Evaluating Hallucinations in Domain-Adapted Large Language Models
feat(parallel): add free Parallel Search MCP as the zero-config defau… · openclaw/openclaw@983b65b
🧠
LLMs
Content type:
Code
github.com
·
4d
4 days ago
Actions for feat(parallel): add free Parallel Search MCP as the zero-config defau… · openclaw/openclaw@983b65b
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help