🎯 Post-Training - touyou · Scour

How Will the Multimodal AI Market Grow Through 2034 Amid Emerging Trends and Business Strategies?

👁️Multimodal LLMs Blog

semiconinsights.wordpress.com·

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

🤖LLM Inference Academic

(Mis)generalization of Helpful-Only Fine-tuning

🤖LLM Inference

lesswrong.com·

RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning

🤖LLM Inference Academic

PSA: Convoy offers SFT-70 4000K, CRI 90 (pre-production)

🤖LLM Inference

convoylight.com··r/flashlight

I built a machine that turns AI papers into interactive explainers

🔍Retrieval-Augmented Generation Blog

SLUUG Talk: Demystifying Large Language Models on Linux

🤖LLM Inference Code

github.com··DEV

PriFT: Prior-Support Guided Supervised Fine-Tuning

🤖LLM Inference Academic

The Substitution Wave in AI

⚙️AI Infrastructure

tomtunguz.com·

X-VPN proves its privacy credentials with new independent no-logs audit

🤖LLM Inference News

·

Can You Hide From a Natural Language Autoencoder?

🤖LLM Inference Blog

yogesh.bearblog.dev·

You Can Catch Sleeper Agents by Teaching Another Model to Imitate Them

🤖LLM Inference

lesswrong.com·

On the Geometry of On-Policy Distillation

🤖LLM Inference Academic

Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization

🤖LLM Inference Academic

AWS Destroyed the Value Proposition for Bedrock

⚙️AI Infrastructure Blog

securosis.com·

Breaking the Tokenizer Barrier: On-Policy Distillation across Model Families

🤖LLM Inference Academic

Optimisation over non-stationary distributions creates weirder minds

🤖LLM Inference

lesswrong.com·

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

⚙️AI Infrastructure Code

github.com··r/SideProject

Stage-1 Controls the Entropy Regime, Not the Outcome

👁️Multimodal LLMs Academic

Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output

🤖LLM Inference Academic

Sign up or log in to see more results

Log in to enable infinite scrolling