Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model Training
⚙️ Model Training
pretraining, fine-tuning, training run, compute, loss curve
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
830
posts in
7.8
ms
BacteReason: A Reasoning
Model
for Antimicrobial Resistance Prediction
📐
Scaling Laws
Content type:
Academic
biorxiv.org
·
4d
4 days ago
Actions for BacteReason: A Reasoning Model for Antimicrobial Resistance Prediction
Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning
Parameter-Efficient Adapter
Tuning
for Tabular-Image Multimodal Learning
🧠
AI Research
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Parameter-Efficient Adapter Tuning for Tabular-Image Multimodal Learning
ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public
deep-research
benchmarks.
📐
Scaling Laws
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.
When Probing Accuracy Saturates, Fragility Resolves: A Complementary Metric for LLM
Pre-Training
Analysis
💬
LLMs
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for When Probing Accuracy Saturates, Fragility Resolves: A Complementary Metric for LLM Pre-Training Analysis
(Mis)generalization of Helpful-Only
Fine-tuning
🎮
Reinforcement Learning
lesswrong.com
·
6d
6 days ago
Actions for (Mis)generalization of Helpful-Only Fine-tuning
ViP-VL
: Vietnamese Self-supervised Speech
Pretraining
Model
with Vector-Quantization Learning
💬
LLMs
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for ViP-VL: Vietnamese Self-supervised Speech Pretraining Model with Vector-Quantization Learning
mirkolenz/llmhop: Tiny, stateless Go router that dispatches OpenAI-compatible requests to
single-model
vLLM
and sglang backends with zero external dependencies
💬
LLMs
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for mirkolenz/llmhop: Tiny, stateless Go router that dispatches OpenAI-compatible requests to single-model vLLM and sglang backends with zero external dependencies
When
RL
Fails after
SFT
: Rejuvenating
Model
Plasticity for Robust
SFT-to-RL
Handoff
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff
Corpus Augmentation for Sign Language Translation via LLM-Guided Video Stitching
💬
LLMs
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Corpus Augmentation for Sign Language Translation via LLM-Guided Video Stitching
heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
💬
LLMs
Content type:
Code
github.com
·
4d
4 days ago
·
r/LocalLLaMA
Actions for heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
A Unifying Lens on Supervised
Fine-Tuning
Through Target Distribution Design
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design
If Claude Fable stops helping you, you'll never know
💬
LLMs
Content type:
Blog
jonready.com
·
1d
1 day ago
·
Lobsters
,
Hacker News
Actions for If Claude Fable stops helping you, you'll never know
Training
Deliberative Monitors for Black-Box Scheming Detection
🎮
Reinforcement Learning
lesswrong.com
·
6d
6 days ago
Actions for Training Deliberative Monitors for Black-Box Scheming Detection
Simplicity Suffices for Parameter Noise Injection in Stochastic
Gradient
Descent
📉
Deep Learning
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Simplicity Suffices for Parameter Noise Injection in Stochastic Gradient Descent
Does anyone know what PCIe
mode
was used for these benchmarks?
💬
LLMs
Content type:
Code
github.com
·
4d
4 days ago
·
r/LocalLLaMA
Actions for Does anyone know what PCIe mode was used for these benchmarks?
Harness In-Context Operator Learning with Chain of Operators
💬
LLMs
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Harness In-Context Operator Learning with Chain of Operators
PriFT: Prior-Support Guided Supervised
Fine-Tuning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for PriFT: Prior-Support Guided Supervised Fine-Tuning
SlideCheck: Guiding Self-Supervised
Pretraining
of Pathology Foundation
Models
via Dataset Distributions
💬
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for SlideCheck: Guiding Self-Supervised Pretraining of Pathology Foundation Models via Dataset Distributions
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help