Post-Training

Feeds to Scour
SubscribedAll
Scoured 157 posts in 4.3 ms

How Will the Multimodal AI Market Grow Through 2034 Amid Emerging Trends and Business Strategies?

 👁️Multimodal LLMs  Content type: Blog

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

 🤖LLM Inference  Content type: Academic
arxiv.org·

(Mis)generalization of Helpful-Only Fine-tuning

 🤖LLM Inference
lesswrong.com·

RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning

 🤖LLM Inference  Content type: Academic
arxiv.org·

PSA: Convoy offers SFT-70 4000K, CRI 90 (pre-production)

 🤖LLM Inference

I built a machine that turns AI papers into interactive explainers

 🔍Retrieval-Augmented Generation  Content type: Blog
blog.skz.dev·

SLUUG Talk: Demystifying Large Language Models on Linux

 🤖LLM Inference  Content type: Code
github.com··DEV

PriFT: Prior-Support Guided Supervised Fine-Tuning

 🤖LLM Inference  Content type: Academic
arxiv.org·

The Substitution Wave in AI

 ⚙️AI Infrastructure
tomtunguz.com·

X-VPN proves its privacy credentials with new independent no-logs audit

 🤖LLM Inference  Content type: News
techradar.com
·

Can You Hide From a Natural Language Autoencoder?

 🤖LLM Inference  Content type: Blog
yogesh.bearblog.dev·

You Can Catch Sleeper Agents by Teaching Another Model to Imitate Them

 🤖LLM Inference
lesswrong.com·

On the Geometry of On-Policy Distillation

 🤖LLM Inference  Content type: Academic
arxiv.org·

Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization

 🤖LLM Inference  Content type: Academic
arxiv.org·

AWS Destroyed the Value Proposition for Bedrock

 ⚙️AI Infrastructure  Content type: Blog
securosis.com·

Breaking the Tokenizer Barrier: On-Policy Distillation across Model Families

 🤖LLM Inference  Content type: Academic
arxiv.org·

Optimisation over non-stationary distributions creates weirder minds

 🤖LLM Inference
lesswrong.com·

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

 ⚙️AI Infrastructure  Content type: Code
github.com··r/SideProject

Stage-1 Controls the Entropy Regime, Not the Outcome

 👁️Multimodal LLMs  Content type: Academic
arxiv.org·

Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output

 🤖LLM Inference  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help