Fine-tuning

Feeds to Scour
SubscribedAll
Scoured 215 posts in 8.0 ms

Deep Learning Weekly: Issue 458

 🤖AI Agents

When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·
Less-relevant results

Google Colab CLI opens runtimes to Claude Code and Codex

 🗄️Vector Databases

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents

 🤖AI Agents  Content type: Blog

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

 🤖AI Agents
saintlex.sbs··DEV

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

 🔁Spaced Repetition  Content type: Blog
huggingface.co·

Robust Multi-Mutant Protein Stability Prediction from a Fine-Tuned Evolutionary Scale Model

 Inference  Content type: Academic
biorxiv.org·

DiffusionGemma: The Developer Guide

 Inference  Content type: Blog

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

 🧠LLMs

Introducing Granite Libraries and Project Granite Switch

 🔍RAG  Content type: Blog

Some Interesting Papers on RLVR

 🎯Reinforcement Learning
lesswrong.com·

Can You Hide From a Natural Language Autoencoder?

 Inference  Content type: Blog
yogesh.bearblog.dev·

fc2

 📝Obsidian
yog.ink·

Introducing the Google Colab CLI

 🗄️Vector Databases  Content type: Blog

A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design

 🌐World Models  Content type: Academic
arxiv.org·

Replicate vs Gemini API: An Honest Cost Breakdown of Photo Generation (Real Production Numbers)

 🧪Synthetic Data  Content type: Blog
medium.com·

Latest technical articles & videos.

 🔌MCP
certdepot.net·

Hacker News Cohort Collectively Dismisses Anthropic and Champions Chinese Models over Fable's Fumble

 🔬AI Research  Content type: Discussion

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 Inference  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help