🎯 Post-training - samveed · Scour

My research agenda and work

lesswrong.com·

Alignment Defends LLMs from Property Inference Attacks

🧠AI Academic

Less-relevant results

How Will the Multimodal AI Market Grow Through 2034 Amid Emerging Trends and Business Strategies?

💬LLMs Blog

semiconinsights.wordpress.com·

TAHOE: Text-to-SQL with Automated Hint Optimization from Experience

💬LLMs Academic

The sample efficiency black hole

🏋️Pretraining News

dwarkesh.com··Hacker News

(Mis)generalization of Helpful-Only Fine-tuning

🌐World Models

lesswrong.com·

Beyond the Golden Teacher: Enhancing Graph Learning through LLM-GNN Co-teaching

💬LLMs Academic

EDPB meets with EU Commissioner McGrath and adopts common data breach notification template

💻Software Engineering

edpb.europa.eu·

magenta/magenta-realtime: Magenta RealTime 2: An Open-Weights Live Music Model

💬LLMs Code

Can You Hide From a Natural Language Autoencoder?

🏋️Pretraining Blog

yogesh.bearblog.dev·

A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design

🎮RL Academic

Raize Orion Multi-framework GRC with anchored NIS2 reporting clocks

raizehq.dev··Hacker News

PriFT: Prior-Support Guided Supervised Fine-Tuning

🌐World Models Academic

Training Deliberative Monitors for Black-Box Scheming Detection

🌐World Models

lesswrong.com·

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

💬LLMs Academic

We Should Take Text Optimization More Seriously

💬LLMs Blog

yoonholee.com··Hacker News

Multilingual Refusal Alignment for Safer Large Language Models

💬LLMs Academic

Optimisation over non-stationary distributions creates weirder minds

🌐World Models

lesswrong.com·

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

🧠AI Academic

Job Searcher

💬LLMs Blog

huggingface.co·

Sign up or log in to see more results

Log in to enable infinite scrolling