Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
NLP
💬 NLP
natural language processing, text analysis, language models, NLU
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
131
posts in
12.6
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
✨
LLMs
Content type:
Code
github.com
·
3d
3 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
A system programmer’s guide to
LLM
inference
🤖
AI
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
Attention Expansion: Enhancing Keyphrase Extraction from Long Documents with Attention-Augmented Contextualized
Embeddings
🔍
Information Retrieval
Content type:
Academic
arxiv.org
·
8h
8 hours ago
Actions for Attention Expansion: Enhancing Keyphrase Extraction from Long Documents with Attention-Augmented Contextualized Embeddings
Mini Shai-Hulud, Miasma, and Hades
Worms
Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
🔧
Agent Tooling
Content type:
Blog
socket.dev
·
1d
1 day ago
·
Hacker News
Actions for Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
Large
companies can add a local
LLM
filter layer to considerably reducing their AI costs
🤖
AI
umrashrf.github.io
·
4d
4 days ago
·
Hacker News
Actions for Large companies can add a local LLM filter layer to considerably reducing their AI costs
ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway
token
costs.
👨💻
AI Coding
Content type:
Code
github.com
·
19h
19 hours ago
·
Hacker News
Actions for ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.
I built a
sentiment
analyzer for Hacker News (as an MCP server)
🔧
Agent Tooling
mcpize.com
·
2d
2 days ago
·
Hacker News
Actions for I built a sentiment analyzer for Hacker News (as an MCP server)
Agentic AI frameworks compared:
LangChain
, LangGraph, AutoGen
🔧
Agent Tooling
Content type:
Blog
udacity.com
·
4d
4 days ago
Actions for Agentic AI frameworks compared: LangChain, LangGraph, AutoGen
Optimality of FSQ
Tokens
for Continuous Diffusion for Categorical Data with Application to
Text-to-Speech
ℹ️
Information Theory
Content type:
Academic
arxiv.org
·
8h
8 hours ago
Actions for Optimality of FSQ Tokens for Continuous Diffusion for Categorical Data with Application to Text-to-Speech
Vibe Diaries: Training Nanochat
🧠
Machine Learning
vibediary.dev
·
1d
1 day ago
·
Hacker News
Actions for Vibe Diaries: Training Nanochat
StereoTales: Multilingual Open-Ended Stereotype Discovery in LLMs
🎭
Claude
Content type:
Blog
research.giskard.ai
·
6d
6 days ago
·
Hacker News
Actions for StereoTales: Multilingual Open-Ended Stereotype Discovery in LLMs
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and Qwen 3.6 MTP
modes
🤖
AI
Content type:
Code
github.com
·
11h
11 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
Auditing Training Data in Domain-adapted LLMs: LoRA-MINT
✨
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Auditing Training Data in Domain-adapted LLMs: LoRA-MINT
What Are
Tokens
in LLMs?
🤖
LLM
Content type:
Blog
bearisland.dev
·
3d
3 days ago
·
Hacker News
Actions for What Are Tokens in LLMs?
How LLMs Actually
Work
: A Friendly Map for Humans • oreoro
🔧
Agent Tooling
oreoro.github.io
·
4d
4 days ago
·
Hacker News
Actions for How LLMs Actually Work: A Friendly Map for Humans • oreoro
Causal Semantic Alignment for
LLM-based
Time Series Forecasting
✨
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Causal Semantic Alignment for LLM-based Time Series Forecasting
How to Measure Time To First
Token
(TTFT) in AI Systems
🤖
AI
qainsights.com
·
3d
3 days ago
·
Hacker News
Actions for How to Measure Time To First Token (TTFT) in AI Systems
Phase transition in
large
language
models
and the criticality of natural languages
🧠
Machine Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Phase transition in large language models and the criticality of natural languages
Show HN: Run Llama.cpp
In-Process
from Java with Project Panama FFM
🤖
AI
deemwar-products.github.io
·
5d
5 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios
🧠
Machine Learning
Content type:
Blog
huggingface.co
·
6d
6 days ago
Actions for EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help