Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Web Datasets
🗄️ Web Datasets
Common Crawl, Corpus, Training data, Web scraping
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
66
posts in
47.4
ms
Publishers push
Common
Crawl
to stop collecting content for AI
training
🔗
Interoperability
searchengineland.com
·
20h
20 hours ago
Actions for Publishers push Common Crawl to stop collecting content for AI training
US publishers tell
Common
Crawl
to stop
scraping
and delete archive
🔗
Interoperability
pressgazette.co.uk
·
1d
1 day ago
·
Hacker News
Actions for US publishers tell Common Crawl to stop scraping and delete archive
Pythia 1.4B reproduces 3.6% of
training
samples verbatim given 950-token prompts
⚡
Fast AI Inference
Content type:
Blog
ret2libc.com
·
3d
3 days ago
·
Hacker News
Actions for Pythia 1.4B reproduces 3.6% of training samples verbatim given 950-token prompts
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
·
Hacker News
Actions for AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe model delivers 300 tokens per second on benchmar...
⚡
Fast AI Inference
digg.com
·
6d
6 days ago
Actions for NVIDIA releases Nemotron 3 Ultra, claiming five times the speed and 30 percent lower costs than prior modelsThe model delivers 300 tokens per second on benchmar...
Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.
🔧
Agent Tooling
Content type:
Code
github.com
·
1h
1 hour ago
·
Hacker News
Actions for Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.
Google’s DiffusionGemma is 4x faster than its other Gemma models
🤖
AI
thenewstack.io
·
2h
2 hours ago
Actions for Google’s DiffusionGemma is 4x faster than its other Gemma models
LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…
📊
Embeddings
pub.towardsai.net
·
1d
1 day ago
Actions for LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…
nex-agi/Nex-N2-mini •
Huggingface
🏗️
LLM Infrastructure
huggingface.co
·
6d
6 days ago
·
r/LocalLLaMA
Actions for nex-agi/Nex-N2-mini • Huggingface
My life as a human pincushion continues (Day 17, post-surgery)
🎆
Year End
creolened.com
·
1d
1 day ago
Actions for My life as a human pincushion continues (Day 17, post-surgery)
Stack Overflow didn't just help AI learn to code
🤖
AI
zozo123.github.io
·
3d
3 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
Less-relevant results
Testing MiniMax M3 on real tasks: repo refactor, screenshot debugging, and Spotify recommendations
🆕
New AI
Content type:
Blog
andlukyane.com
·
20h
20 hours ago
·
Hacker News
Actions for Testing MiniMax M3 on real tasks: repo refactor, screenshot debugging, and Spotify recommendations
OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for
training
agents.
🔗
Interoperability
Content type:
Blog
huggingface.co
·
2d
2 days ago
·
Hacker News
,
r/LocalLLaMA
Actions for OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.
Multiple Runtimes, Reducing Your Claude Code Bill, and Your Doctors
Database
💻
Claude Code
thereactnativerewind.com
·
4h
4 hours ago
·
r/javascript
,
r/reactjs
Actions for Multiple Runtimes, Reducing Your Claude Code Bill, and Your Doctors Database
I Asked 50 Developers How They Manage Browser Tabs (And the Results Are Wild)
🎨
Graphic Design
gopeek-lovat.vercel.app
·
8h
8 hours ago
·
Hacker News
Actions for I Asked 50 Developers How They Manage Browser Tabs (And the Results Are Wild)
What Does Abliteration Actually Cost?
🤖
AI
lesswrong.com
·
5d
5 days ago
Actions for What Does Abliteration Actually Cost?
SafeRun: Enabling Determinism in
LLM
Planning for Running
🏆
LLM Benchmarking
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for SafeRun: Enabling Determinism in LLM Planning for Running
Job Searcher
🤖
AI
Content type:
Blog
huggingface.co
·
4d
4 days ago
Actions for Job Searcher
Purpose-built local AI agents
🤖
AI
Content type:
Blog
samihonkonen.com
·
1d
1 day ago
·
Hacker News
Actions for Purpose-built local AI agents
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and Qwen 3.6 MTP modes
🤖
AI
Content type:
Code
github.com
·
19h
19 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help