🎯 RLHF - ibrahimsharaf · Scour

Goal-Conditioned Supervised Learning for LLM Fine-Tuning 🎯LLM Finetuning

NLA Verbalizations on AuditBench: Llama 70B 📊LLM Evaluation

lesswrong.com·4d

LLM Inference 🚀LLM Deployment

iop.systems·2h

The Safety Paradox: How RLHF Creates the AI Psychosis Problem It’s Meant to Prevent ⚙️Transformers

promptinjection.net·2d·Hacker News

I Tried Offline RL With Logs 🎯LLM Finetuning

·49m

Liquid Harness — from zero to a fine-tuned LFM in under an hour 🎯LLM Finetuning

lqh.ai·5d·Hacker News

Show HN: Marlin-2B: a tiny VLM to extract structured information from videos ⚡Quantization

huggingface.co·2d·Hacker News

Putin and Xi deepen anti-West axis 🗣️NLP

intellinews.com·20h

MegaTrain Full Precision Training of 100B+ Parameter LLMs on a Single GPU 🚀LLM Deployment

github.com·3d·Hacker News

Fixing LLM Writing with Distribution Fine Tuning 🎯LLM Finetuning

rosmine.ai·2d·Hacker News

PM Modi Leaves For Italy After Norway Visit, India-Nordic Summit 🚀LLM Deployment

The Rise of the Resident Eccentric in Tech 🏢LLM Adoption

mrmarket.bearblog.dev·6d

Self-Improving Reward Models 🛡️AI Safety

canvas.inc·1d·Hacker News

Document-tuning instills durable animal compassion in LLMs (and generalizes to humans) 🏢LLM Adoption

lesswrong.com·1h

Distributed Direct Preference Optimization 📐Vector Search

India in green tech accord with Nordics 🗣️NLP

newindianexpress.com·1d

Finland on Alert as Suspected Drones Trigger Early-Morning Lockdown in Uusimaa; The Temporary Airspace Ban Is Now Lifted | Finland Today | News in English 🛡️AI Safety

finlandtoday.fi·5d

Report: Upper secondary student numbers to halve in parts of Finland 📊LLM Evaluation

helsinkitimes.fi·2d

Ethereum considers staking reward model change to boost ETH price outlook 🕸️Knowledge Graphs

cryptobriefing.com·6d

The Last Thing Mark Carney Needs Is Trudeau-Era Rhetoric 📊LLM Evaluation

thewalrus.ca·2d

Log in to enable infinite scrolling