🎯 Post-training - amy_yunduo · Scour

🧠LLMs fareedkhan-dev.github.io·

Train LLM from Scratch

Discussed on Hacker News

📊LLM Evaluation arXiv·

Weight-Space Geometry of Offline Reasoning Training

Less-relevant results

🏗️AI Infra Liquid AI·

LFM2.5-230M: Built to Run Anywhere

Covered by VentureBeat

🛡️AI Safety Pangeanic Blog·

From Fine-Tuning to Red Teaming: The Data Operations Behind Reliable AI Models

Covers AI Risk Management Framework

🧠LLMs Bloomberg

·

Tech Disruptors: Invisible Technologies on RLHF and LLM Training

🧠LLMs zentara.co·

LLM Refusal Behavior on Open-Weight Model

Discussed on Hacker News

📚RAG Hacker News·

Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions

Discussed on Hacker News

🧠LLMs GitHub·

Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch

Discussed on Hacker News

🧠LLMs Digital Trends·

As Hollywood jobs dry up, workers are quietly training AI models to survive

Covers I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI

🛡️AI Safety arXiv·

Paved with True Intents: Intent-Aware Training Improves LLM Safety Classification Across Training Regimes

📊LLM Evaluation Helsinki Times·

Orpo intervenes in NGO funding dispute as Soste faces major job cuts

🏗️AI Infra lemmy.ml·

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

🗄️Vector Databases Nature·

Patterns of Edtech use and mastery among university students: an exploratory socio-cognitive analysis

🤖AI Agents chapterpal.com·

Sakana Fugu Technical Report

Discussed on Hacker News

🧠LLMs biorxiv.org·

CellTosg2Sequence: A Unified Text-Omics-Signaling-Graph Large Language Model for Single-Cell Analysis

🧠LLMs arXiv·

Reasoning Quality Emerges Early: Data Curation for Reasoning Models

✍️Prompt Engineering Hugging Face·

Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments

Covers 2 stories including vllm-project/vllm

Covered by 3 sources including GitHub, indiehacker.news

Discussed on r/LocalLLaMA

✍️Prompt Engineering MicroScope

·

Met Palantir pilot: The DPIA that raises more questions than answers

🛡️AI Safety gdpredirect.com·

Become EU compliant in one line of code (satire)

Discussed on Hacker News

✍️Prompt Engineering fig.inc·

Breaking Browser-Use Models Using Domain Randomization

Covers Kimi K2.5: Visual Agentic Intelligence

Discussed on Hacker News and Hacker News

Log in to enable infinite scrolling