Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Scaling
📈 LLM Scaling
Specific
scaling laws, inference scaling, compute, throughput
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
42
posts in
18.2
ms
🖥️
GPU Computing
morphllm.com
·
4d
4 days ago
Optimizing
Models
to Be Fast at Codegen
Covers
KernelBench: Can LLMs Write Efficient GPU Kernels?
Covered by
tldr.tech
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Optimizing Models to Be Fast at Codegen
📊
Machine Learning
arXiv
·
15h
15 hours ago
Solve for the Hyperparameter, Skip the Search:
Kolmogorov-Optimal
Scaling
Laws
for Spline Regression
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Solve for the Hyperparameter, Skip the Search: Kolmogorov-Optimal Scaling Laws for Spline Regression
🔐
Cybersecurity
blog.r-lopes.com
·
5d
5 days ago
The Line Vibe Coding Can't Cross
Covers
AI writes code faster. Your job is still to prove it works.
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Line Vibe Coding Can't Cross
🏗️
Systems Design
arXiv
·
15h
15 hours ago
The Energy Consumption of
Transformer
Fine-Tuning: A Roofline-Inspired
Scaling
Model
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Energy Consumption of Transformer Fine-Tuning: A Roofline-Inspired Scaling Model
📊
Machine Learning
sequenceanddestroy.substack.com
·
6d
6 days ago
Issue № 80 // Stable Points, Sensors, & Strange Attractors
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Issue № 80 // Stable Points, Sensors, & Strange Attractors
🛡️
AI Safety
Lawfare
·
6d
6 days ago
Today on
Lawfare
: June 16, 2026
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Today on Lawfare: June 16, 2026
🏗️
Data Engineering
Jakob Nielsen on UX
·
5d
5 days ago
From AGI to ASI: DeepMind’s Roadmap as a Comic Book
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From AGI to ASI: DeepMind’s Roadmap as a Comic Book
🧠
LLMs
arXiv
·
15h
15 hours ago
L20-Edu-135M: An Auditable Single-GPU Study of Data-Efficient Small
Language
Modeling
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for L20-Edu-135M: An Auditable Single-GPU Study of Data-Efficient Small Language Modeling
🛡️
AI Safety
venturebeat.com
·
6d
6 days ago
Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again
Covers
6 stories
See all stories this covers
including
Anthropic/Claude AI is down
Covered by
3 sources
See all sources covering this story
including
Bug
,
tldr.tech
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again
🧠
Transformer Architecture
arXiv
·
15h
15 hours ago
Circuit Synchronization Precedes Generalization: A Causal Precursor to Grokking
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Circuit Synchronization Precedes Generalization: A Causal Precursor to Grokking
🧠
Transformer Architecture
medium.com
·
6d
6 days ago
What Is Reflective Memory, and Why Does Your AI Agent Need It?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What Is Reflective Memory, and Why Does Your AI Agent Need It?
🧠
Transformer Architecture
arXiv
·
6d
6 days ago
Recursive
Scaling
in Masked Diffusion
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Recursive Scaling in Masked Diffusion Models
⚙️
MLOps
arXiv
·
4d
4 days ago
Towards Engineering
Scaling
Laws
with Pretraining Data Composition
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Towards Engineering Scaling Laws with Pretraining Data Composition
📚
RAG
Lawfare
·
3d
3 days ago
The Week That Was 04e
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Week That Was 04e
🔐
Cybersecurity
arXiv
·
6d
6 days ago
How
Inference
Compute
Shapes Frontier
LLM
Evaluation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How Inference Compute Shapes Frontier LLM Evaluation
🧠
LLMs
arXiv
·
4d
4 days ago
How LLMs Fail and Generalize in RTL Coding for Hardware Design?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How LLMs Fail and Generalize in RTL Coding for Hardware Design?
🔬
Deep Learning
arXiv
·
4d
4 days ago
Statistical Properties of
Training
& Generalization
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Statistical Properties of Training & Generalization
🧠
LLMs
arXiv
·
4d
4 days ago
Rethinking Shrinkage Bias in
LLM
FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe
🧠
Transformer Architecture
arXiv
·
6d
6 days ago
Adaptive Volumetric Mechanical Property Fields Invariant to Resolution
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Adaptive Volumetric Mechanical Property Fields Invariant to Resolution
🖧
Distributed Systems
arXiv
·
6d
6 days ago
Universal
scaling
and relaxation in decaying turbulence of Bose gases
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Universal scaling and relaxation in decaying turbulence of Bose gases
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report