🎯 LLM Finetuning - ibrahimsharaf · Scour

Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI

🎯RLHF Blog

aws.amazon.com·

brunokeymolen/lora: LoRa (Long Range) communication related projects

⚡Speculative Decoding Code

github.com··Hacker News

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

🗣️NLP Academic

Tracing Eval-Awareness Emergence Through Training of OLMo 3

lesswrong.com·

In Mexico City, axolotl salamanders are everywhere before the World Cup — except in the wild

🔓Open Source AI News

Mexico’s unofficial World Cup mascot might already be extinct in the wild

🔓Open Source AI News

the-independent.com·

local llm on laptop 780M GPU using llama + gemma 4 qat

💻Local AI Blog

alper.bearblog.dev·

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

🔓Open Source AI Blog

huggingface.co·

Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond

turingpost.com·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

🔓Open Source AI Blog

towardsai.net·

Less-relevant results

New comment by bkjlblh in "Claude Fable 5"

💬Prompt Engineering Discussion

news.ycombinator.com··Hacker News

TRL: GIVE EVERYBODY IN SCOTLAND A SHOVEL

🎯RLHF Blog

channel-6.ghost.io

·

SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption

⚡Speculative Decoding

eprint.iacr.org·

Unsloth Gemma 4 QAT

Google Colab CLI opens runtimes to Claude Code and Codex

🔓Open Source AI

helpnetsecurity.com··r/ClaudeAI

If Claude Fable stops helping you, you'll never know

🛡️AI Safety Blog

jonready.com··Lobsters, Hacker News

Model predictive task sampling for efficient and robust adaptation

⚡Continuous Batching Academic

Finetuning masking challenges narrow-task evaluation of cell foundation models

⚡Continuous Batching Academic

The Non Profit Association Delivering Future Collaborative Opensource Tools for Energy System Simulation

🧪Synthetic Data

Fine tuning classification in Elixir

📐Vector Search

elixirstatus.com·

Log in to enable infinite scrolling