🎯 Fine-tuning - saeedesmaili · Scour

Which LoRA? An Empirical Study on the Effectiveness of LoRA Techniques During Multilingual Instruction Tuning

🧠LLMs Academic

brunokeymolen/lora: LoRa (Long Range) communication related projects

📶ESP32 Code

github.com··Hacker News

Orchestrate your LLM pipeline. Locally

💬Natural Language Processing

llmforge.app··Hacker News

Fine-tuning Large Language Models (LLMs) using PEFT

🔬Deep Learning Blog

·

Tracing Eval-Awareness Emergence Through Training of OLMo 3

lesswrong.com·

[NEW MODEL] SupraLabs just released Supra1.5-50M Base (Experimental)!

🔤Tokenization

huggingface.co··r/LocalLLaMA

Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond

🎮Reinforcement Learning

turingpost.com·

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

Don't let the LLM speak, just probe it (8 minute read)

🤖LLM Blog

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

🤖Data science News

Malicious Hugging Face Models Could Trigger Remote Code Execution

🤖Data science

techrepublic.com·

Meshcore and Haiku: a Match Apparently Made in Italy

SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption

🤖Data science

eprint.iacr.org·

Less-relevant results

Google open-sources speedy DiffusionGemma text diffusion model

💬Natural Language Processing

siliconangle.com·

Hugging Face Transformers flaw enables RCE via malicious model configs

🤖Data science

Google's new open-weights model brings image-generation tricks to AI text generation

🤖Data science News

theregister.com·

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

🧠LLMs Academic

New comment by bkjlblh in "Claude Fable 5"

🤖LLM Discussion

news.ycombinator.com··Hacker News

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

🪟Context Windows

aermia.com··Hacker News

luciobaiocchi/heard: Offline group-safety mesh for hikers: ESP32 + GPS + LoRa, with a firmware-in-the-loop simulator and 3D replay viewer

📶ESP32 Code

github.com··Hacker News

Log in to enable infinite scrolling