Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 LLM Finetuning
Specific
fine-tuning, LoRA, PEFT, instruction tuning, RLHF
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
150778
posts in
9.8
ms
The Art of (Mis)alignment: How Fine-Tuning Methods Effectively
Misalign
and
Realign
LLMs in Post-Training
🧠
LLMs
arxiv.org
·
11h
Gemma
4
Fine-Tuning
Guide
🔓
Open Source AI
unsloth.ai
·
1d
·
Hacker News
Show HN:
ECX
a 'Jail-Fix' for
RLHF
Neutrality Loops in LLMs
💻
Local AI
zenodo.org
·
6d
·
Hacker News
Fine-tuning
vs RAG vs
prompting
🧠
LLMs
akanksharaghav.medium.com
·
6h
You
Fine-Tuned
Your Model. Now It’s Worse. Here’s the Concept You Were Never
Taught
.
💻
Local AI
pub.towardsai.net
·
1d
Fine-Tuning
Gemma
4 with Cloud Run Jobs: Serverless GPUs (NVIDIA RTX 6000 Pro) for pet
breed
…
🔓
Open Source AI
medium.com
·
5h
Show HN: Pre-training,
fine-tuning
, and
evals
platform
🚀
LLM Deployment
oumi.ai
·
6d
·
Hacker News
How I Built a
Fine-Tuned
Medical AI App and
Deployed
It End-to-End on AWS
🚀
LLM Deployment
medium.com
·
14h
wuwangzhang1216/abliterix
: Fully automatic censorship removal for language models. LoRA abliteration + Optuna TPE optimization.
🚀
LLM Deployment
github.com
·
2d
·
r/LocalLLaMA
When Should You Use RAG vs
Fine-Tuning
in Microsoft
Foundry
?
🚀
LLM Deployment
techcommunity.microsoft.com
·
13h
Cseti/LTX2.3-22B
_IC-LoRA-Cameraman_v1
💻
Local AI
huggingface.co
·
2d
Fine-tuning
Whisper
to my speech: 27% to 6.5%
WER
🔬
Small LMs
vivekkairi.com
·
5d
·
Hacker News
RAG
vs
Fine-Tuning
: What I Learned Building a Real AI Product
🏢
LLM Adoption
medium.com
·
3d
Unlocking
LoRA
Moe
RL for Qwen3.5
🚀
LLM Deployment
osmosis.ai
·
6d
·
Hacker News
Large Language Model Post-Training: A
Unified
View of Off-Policy and On-Policy Learning
🧠
LLMs
arxiv.org
·
11h
Model
Packaging
Tools Every
MLOps
Engineer Should Know
🚀
LLM Deployment
freecodecamp.org
·
4d
RLHF
Pipelines Have a
QC
Blind Spot
🛡️
AI Safety
medium.com
·
3d
Controlling
Distributional
Bias in Multi-Round LLM Generation via
KL-Optimized
Fine-Tuning
🚀
LLM Deployment
arxiv.org
·
2d
Reinforcement
Learning From Human Feedback (
RLHF
) in Large Language Models(LLMs)
🚀
LLM Deployment
pub.towardsai.net
·
6d
Show HN:
LunaLora
: Multi-LoRA System to Combat Catastrophic
Forgetting
💻
Local AI
github.com
·
6d
·
Hacker News
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help