LLM Finetuning

Feeds to Scour
SubscribedAll
Scoured 686 posts in 23.9 ms

Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI

 🎯RLHF  Content type: Blog
aws.amazon.com·

brunokeymolen/lora: LoRa (Long Range) communication related projects

 Speculative Decoding  Content type: Code
github.com··Hacker News

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

 🗣️NLP  Content type: Academic
arxiv.org·

Tracing Eval-Awareness Emergence Through Training of OLMo 3

 🎯RLHF
lesswrong.com·

In Mexico City, axolotl salamanders are everywhere before the World Cup — except in the wild

 🔓Open Source AI  Content type: News
yahoo.com·

Mexico’s unofficial World Cup mascot might already be extinct in the wild

 🔓Open Source AI  Content type: News
the-independent.com·

local llm on laptop 780M GPU using llama + gemma 4 qat

 💻Local AI  Content type: Blog
alper.bearblog.dev·

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

 🔓Open Source AI  Content type: Blog
huggingface.co·

Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond

 🎯RLHF
turingpost.com·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

 🔓Open Source AI  Content type: Blog
towardsai.net·
Less-relevant results

New comment by bkjlblh in "Claude Fable 5"

 💬Prompt Engineering  Content type: Discussion

TRL: GIVE EVERYBODY IN SCOTLAND A SHOVEL

 🎯RLHF  Content type: Blog
channel-6.ghost.io
·

SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption

 Speculative Decoding
eprint.iacr.org·

Unsloth Gemma 4 QAT

 💻Local AI
unsloth.ai·

Google Colab CLI opens runtimes to Claude Code and Codex

 🔓Open Source AI

If Claude Fable stops helping you, you'll never know

 🛡️AI Safety  Content type: Blog
jonready.com··Lobsters, Hacker News

Model predictive task sampling for efficient and robust adaptation

 Continuous Batching  Content type: Academic
nature.com·

Finetuning masking challenges narrow-task evaluation of cell foundation models

 Continuous Batching  Content type: Academic
biorxiv.org·

The Non Profit Association Delivering Future Collaborative Opensource Tools for Energy System Simulation

 🧪Synthetic Data
cresym.eu·

Fine tuning classification in Elixir

 📐Vector Search
elixirstatus.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help