Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Mixture of Experts
🎭 Mixture of Experts
Specific
MoE Architecture, Sparse Models, Gating Networks, Model Scaling
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
95
posts in
4.5
ms
DiffusionGemma is Google’s fastest AI yet, but it comes with a big trade-off
💾
KV Cache
androidauthority.com
·
5h
5 hours ago
Actions for DiffusionGemma is Google’s fastest AI yet, but it comes with a big trade-off
Harnessing Routing Foresight for Micro-step-level
MoE
load balancing in RL Post-training
🤖
agentic system
Content type:
Academic
arxiv.org
·
7h
7 hours ago
Actions for Harnessing Routing Foresight for Micro-step-level MoE load balancing in RL Post-training
Google’s Sergey Brin Sees A Path To AGI But Not Beyond It via @sejournal, @martinibuster
🔄
Transformers
searchenginejournal.com
·
5d
5 days ago
Actions for Google’s Sergey Brin Sees A Path To AGI But Not Beyond It via @sejournal, @martinibuster
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
💾
KV Cache
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
AI Week in Review 26.06.06
🤖
agentic system
Content type:
News
Content type:
Blog
patmcguinness.substack.com
·
4d
4 days ago
·
Substack
Actions for AI Week in Review 26.06.06
Google's new open
model
DiffusionGemma generates text from noise instead of word by word
🔄
Transformers
the-decoder.com
·
16h
16 hours ago
Actions for Google's new open model DiffusionGemma generates text from noise instead of word by word
TENP: Trapezoidal
Expert
Neuron Pruning For
Mixture-of-Experts
↩️
Backpropagation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for TENP: Trapezoidal Expert Neuron Pruning For Mixture-of-Experts
Introducing the Third Generation of Apple’s Foundation
Models
🔄
Transformers
machinelearning.apple.com
·
3d
3 days ago
·
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple’s Foundation Models
Qualcomm Announces On-Device AI Claw Ecosystem Plan
🤖
agentic system
autonews.gasgoo.com
·
3d
3 days ago
Actions for Qualcomm Announces On-Device AI Claw Ecosystem Plan
From Observation to Intervention: A Causal Audit of
Expert
Importance in
Mixture-of-Experts
Models
📊
LLM Evaluation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for From Observation to Intervention: A Causal Audit of Expert Importance in Mixture-of-Experts Models
Dnotitia Releases DNA 3.0, an Enterprise-Ready AI Language
Model
Family - HPCwire
🔄
Transformers
hpcwire.com
·
5d
5 days ago
Actions for Dnotitia Releases DNA 3.0, an Enterprise-Ready AI Language Model Family - HPCwire
PADD: Path-Aligned Decompression Distillation for
Non-Router
Teacher to Guide
MoE
Student Learning
⚡
Inference Optimization
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning
Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
⚡
Inference Optimization
Content type:
Blog
towardsai.net
·
3d
3 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
Startup Ricursive to Create an End-to-End AI
Model
for Chip Design
🔲
TPU Architecture
Content type:
News
eetimes.com
·
19h
19 hours ago
Actions for Startup Ricursive to Create an End-to-End AI Model for Chip Design
Enhancing Multilingual LLM-based ASR with
Mixture
of
Experts
and Dynamic Downsampling
🎛️
Fine-Tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Enhancing Multilingual LLM-based ASR with Mixture of Experts and Dynamic Downsampling
MosaicIMU: Composing Carrier
Experts
for Generalizable Neural Inertial Odometry
⚡
FlashAttention
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for MosaicIMU: Composing Carrier Experts for Generalizable Neural Inertial Odometry
Sakana AI's Recursive Self-Improvement (RSI) Lab
🤖
agentic system
sakana.ai
·
5d
5 days ago
·
Hacker News
Actions for Sakana AI's Recursive Self-Improvement (RSI) Lab
FAME: Forecastability-Aware
Mixture
of
Experts
for Heterogeneous Time Series Forecasting
⚡
Inference Optimization
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for FAME: Forecastability-Aware Mixture of Experts for Heterogeneous Time Series Forecasting
MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
⚡
Inference Optimization
Content type:
News
Content type:
Blog
kaitchup.substack.com
·
5d
5 days ago
·
r/LocalLLaMA
Actions for MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
NGram-MoSE: Efficient Remote Sensing Super-Resolution via N-Gram Context and
Mixture-of-Experts
🔄
Transformers
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for NGram-MoSE: Efficient Remote Sensing Super-Resolution via N-Gram Context and Mixture-of-Experts
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help