Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Fine-tuning
🎛️ Fine-tuning
LoRA, Model Training, LLM Adaptation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
317
posts in
6.9
ms
Parameter-Efficient
Fine-Tuning
with Learnable Rank
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Parameter-Efficient Fine-Tuning with Learnable Rank
Why LLMs (still) lack taste
💬
LLMs
beyondtheprior.com
·
1d
1 day ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Tracing Eval-Awareness Emergence Through
Training
of OLMo 3
🎯
RLHF
lesswrong.com
·
11h
11 hours ago
Actions for Tracing Eval-Awareness Emergence Through Training of OLMo 3
brunokeymolen/lora
:
LoRa
(Long Range) communication related projects
🎯
Fine-Tuning
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
Actions for brunokeymolen/lora: LoRa (Long Range) communication related projects
Fine-tune
FLUX.2 [Klein] with a
LoRA
under 60 minutes
🎯
Fine-Tuning
Content type:
Blog
huggingface.co
·
6d
6 days ago
·
Hacker News
Actions for Fine-tune FLUX.2 [Klein] with a LoRA under 60 minutes
Reasoning
RL
in 2026: GRPO,
DPO
, RLVR, Agentic PO & Beyond
🎯
RLHF
turingpost.com
·
3d
3 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Research Proposal: Decoupled
RISC-LLM
Architectures via Circadian Synaptic Consolidation
🤖
AI
aermia.com
·
4d
4 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
Small Data, Big Noise: Adversarial
Training
for Robust
Parameter-Efficient
Fine-Tuning
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning
Some Interesting Papers on RLVR
🎮
Reinforcement Learning
lesswrong.com
·
1d
1 day ago
Actions for Some Interesting Papers on RLVR
Which
LoRA
? An Empirical Study on the Effectiveness of
LoRA
Techniques During Multilingual Instruction
Tuning
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Which LoRA? An Empirical Study on the Effectiveness of LoRA Techniques During Multilingual Instruction Tuning
How to reduce capability degradation from
off-model
SFT
✍️
Prompt Engineering
lesswrong.com
·
2d
2 days ago
Actions for How to reduce capability degradation from off-model SFT
Sen-sou/Bobs-Lora-Loader-Anima
: A custom
LoRA
loader node for ComfyUI with advanced block-weighting controls for SDXL, FLUX and Anima
models
. Features presets for common use-cases like 'Character' and 'Style', and a 'Custom' mode for
fine-grained
control over individual model blocks.
🎨
Generative AI
Content type:
Code
github.com
·
3d
3 days ago
·
r/StableDiffusion
Actions for Sen-sou/Bobs-Lora-Loader-Anima: A custom LoRA loader node for ComfyUI with advanced block-weighting controls for SDXL, FLUX and Anima models. Features presets for common use-cases like 'Character' and 'Style', and a 'Custom' mode for fine-grained control over individual model blocks.
When
RL
Fails after
SFT
: Rejuvenating
Model
Plasticity for Robust
SFT-to-RL
Handoff
🎯
RLHF
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff
A Unifying Lens on
Supervised
Fine-Tuning
Through Target Distribution Design
🎯
RLHF
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design
RASFT: Rollout-Adaptive
Supervised
Fine-Tuning
for Reasoning
🎯
RLHF
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning
Instruction
Finetuning
DeepSeek-R1-8B
Model
Using
LoRA
and NEFTune
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune
The
Fine-Tuning
Trap
: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning
Attention Amnesia in Hybrid LLMs: When CoT
Fine-Tuning
Breaks Long-Range Recall, and How to Fix It
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It
Emergence of Context Characteristics Sensitivity in
Large
Language
Models
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Emergence of Context Characteristics Sensitivity in Large Language Models
Mult-DPO
: Multinomial Direct Preference Optimization for Recommender Systems
🎯
RLHF
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help