Fine-tuning

Feeds to Scour
SubscribedAll
Scoured 160 posts in 6.8 ms

brunokeymolen/lora: LoRa (Long Range) communication related projects

馃摱ESP32Content type: Code
github.comHacker News

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

馃LLMsContent type: Academic
arxiv.org

Orchestrate your LLM pipeline. Locally

馃挰Natural Language Processing
llmforge.appHacker News

Introducing North Mini Code: Cohere鈥檚 First Model For Developers

馃Data scienceContent type: Blog
huggingface.coHacker News

Why LLMs (still) lack taste

馃LLM

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

馃獰Context Windows
aermia.comHacker News
Less-relevant results

Alleged Fable sabotage of an ML project

馃Data science
xcancel.comHacker News

Anthropic's Fable 5 Silent Sabotage Mode

馃Obsidian

If Claude Fable stops helping you, you鈥檒l never know

馃AI Agents

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

馃LLM Inference

DiffusionGemma: The Developer Guide- Google Developers Blog

馃LLM InferenceContent type: Blog

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

馃Data science
smolhub.comr/LocalLLaMA

Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning

馃TransformersContent type: Academic
arxiv.org

Vibe Diaries: Training Nanochat

馃敜Tokenization
vibediary.devHacker News

[NEW MODEL] SupraLabs just released Supra1.5-50M Base (Experimental)!

馃敜Tokenization
huggingface.cor/LocalLLaMA

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

馃LLM Inference

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

馃幃Reinforcement Learning
venturebeat.comHacker News

Apple WWDC On-Device AI Deep Dive - Google Docs

馃LLMs
gist.isHacker News

DiffusionGemma: 4x Faster Text Generation

馃Data scienceContent type: NewsContent type: Blog

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

馃敩Deep LearningContent type: Code
github.comHacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help