Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Post-Training
🎯 Post-Training
Specific
RLHF, fine-tuning, alignment, instruction tuning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
157
posts in
4.3
ms
How Will the Multimodal AI Market Grow Through 2034 Amid Emerging Trends and Business Strategies?
👁️
Multimodal LLMs
Content type:
Blog
semiconinsights.wordpress.com
·
5d
5 days ago
Actions for How Will the Multimodal AI Market Grow Through 2034 Amid Emerging Trends and Business Strategies?
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large
Language
Model
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
(Mis)generalization of Helpful-Only
Fine-tuning
🤖
LLM Inference
lesswrong.com
·
6d
6 days ago
Actions for (Mis)generalization of Helpful-Only Fine-tuning
RASFT: Rollout-Adaptive
Supervised
Fine-Tuning
for Reasoning
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning
PSA: Convoy offers
SFT-70
4000K, CRI 90 (
pre-production
)
🤖
LLM Inference
convoylight.com
·
5d
5 days ago
·
r/flashlight
Actions for PSA: Convoy offers SFT-70 4000K, CRI 90 (pre-production)
I built a machine that turns AI papers into interactive explainers
🔍
Retrieval-Augmented Generation
Content type:
Blog
blog.skz.dev
·
5d
5 days ago
Actions for I built a machine that turns AI papers into interactive explainers
SLUUG Talk: Demystifying Large
Language
Models
on Linux
🤖
LLM Inference
Content type:
Code
github.com
·
3d
3 days ago
·
DEV
Actions for SLUUG Talk: Demystifying Large Language Models on Linux
PriFT: Prior-Support Guided
Supervised
Fine-Tuning
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PriFT: Prior-Support Guided Supervised Fine-Tuning
The Substitution Wave in AI
⚙️
AI Infrastructure
tomtunguz.com
·
3d
3 days ago
Actions for The Substitution Wave in AI
X-VPN proves its privacy credentials with new independent no-logs audit
🤖
LLM Inference
Content type:
News
techradar.com
·
2d
2 days ago
Actions for X-VPN proves its privacy credentials with new independent no-logs audit
Can You Hide From a Natural
Language
Autoencoder?
🤖
LLM Inference
Content type:
Blog
yogesh.bearblog.dev
·
7h
7 hours ago
Actions for Can You Hide From a Natural Language Autoencoder?
You Can Catch Sleeper Agents by Teaching Another
Model
to Imitate Them
🤖
LLM Inference
lesswrong.com
·
8h
8 hours ago
Actions for You Can Catch Sleeper Agents by Teaching Another Model to Imitate Them
On the Geometry of On-Policy Distillation
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for On the Geometry of On-Policy Distillation
Training
LLMs to Enforce Multi-Level
Instruction
Hierarchies via Gravity-Weighted
Direct
Preference Optimization
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization
AWS Destroyed the Value Proposition for Bedrock
⚙️
AI Infrastructure
Content type:
Blog
securosis.com
·
12h
12 hours ago
Actions for AWS Destroyed the Value Proposition for Bedrock
Breaking the Tokenizer Barrier: On-Policy Distillation across
Model
Families
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Breaking the Tokenizer Barrier: On-Policy Distillation across Model Families
Optimisation over non-stationary distributions creates weirder minds
🤖
LLM Inference
lesswrong.com
·
4d
4 days ago
Actions for Optimisation over non-stationary distributions creates weirder minds
umair-tareen/philosopher-council: An eleven-philosopher
LLM
council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
⚙️
AI Infrastructure
Content type:
Code
github.com
·
5d
5 days ago
·
r/SideProject
Actions for umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
Stage-1 Controls the Entropy Regime, Not the Outcome
👁️
Multimodal LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Stage-1 Controls the Entropy Regime, Not the Outcome
Representation-Aware Advantage Estimation: Your Reward
Model
Provides More Than A Scalar Output
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help