Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Fine-tuning
🎛️ Fine-tuning
LoRA, Model Training, LLM Adaptation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
202
posts in
8.0
ms
Parameter-Efficient
Fine-Tuning
with Learnable Rank
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Parameter-Efficient Fine-Tuning with Learnable Rank
Measuring Embedding Drift: Why Hybrid Search Saves Stale
Models
.
🎯
Fine-Tuning
pub.towardsai.net
·
15h
15 hours ago
Actions for Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.
Some Interesting Papers on RLVR
🎮
Reinforcement Learning
lesswrong.com
·
1d
1 day ago
Actions for Some Interesting Papers on RLVR
A Deep Dive into Calibration of
Language
Models
: Platt Scaling, Isotonic Regression, Temperature Scaling
✍️
Prompt Engineering
kdnuggets.com
·
5d
5 days ago
Actions for A Deep Dive into Calibration of Language Models: Platt Scaling, Isotonic Regression, Temperature Scaling
Five Ways to
Fine-Tune
Chronos-2, the Time Series Foundation
Model
🎯
Fine-Tuning
towardsdatascience.com
·
6d
6 days ago
Actions for Five Ways to Fine-Tune Chronos-2, the Time Series Foundation Model
Tracing Eval-Awareness Emergence Through
Training
of OLMo 3
🎯
RLHF
lesswrong.com
·
10h
10 hours ago
Actions for Tracing Eval-Awareness Emergence Through Training of OLMo 3
RASFT: Rollout-Adaptive
Supervised
Fine-Tuning
for Reasoning
🎯
RLHF
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning
Small Data, Big Noise: Adversarial
Training
for Robust
Parameter-Efficient
Fine-Tuning
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Small Data, Big Noise: Adversarial Training for Robust Parameter-Efficient Fine-Tuning
How to reduce capability degradation from
off-model
SFT
✍️
Prompt Engineering
lesswrong.com
·
2d
2 days ago
Actions for How to reduce capability degradation from off-model SFT
Fine-tuning
vs RAG vs MeMo: Where should
LLM
Knowledge Live?
🎯
Fine-Tuning
pub.towardsai.net
·
4d
4 days ago
Actions for Fine-tuning vs RAG vs MeMo: Where should LLM Knowledge Live?
Which
LoRA
? An Empirical Study on the Effectiveness of
LoRA
Techniques During Multilingual Instruction
Tuning
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Which LoRA? An Empirical Study on the Effectiveness of LoRA Techniques During Multilingual Instruction Tuning
The
Fine-Tuning
Trap
: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning
When
RL
Fails after
SFT
: Rejuvenating
Model
Plasticity for Robust
SFT-to-RL
Handoff
🎯
RLHF
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for When RL Fails after SFT: Rejuvenating Model Plasticity for Robust SFT-to-RL Handoff
Emergence of Context Characteristics Sensitivity in
Large
Language
Models
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Emergence of Context Characteristics Sensitivity in Large Language Models
A Unifying Lens on
Supervised
Fine-Tuning
Through Target Distribution Design
🎯
RLHF
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for A Unifying Lens on Supervised Fine-Tuning Through Target Distribution Design
Instruction
Finetuning
DeepSeek-R1-8B
Model
Using
LoRA
and NEFTune
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune
PriFT: Prior-Support Guided
Supervised
Fine-Tuning
🎯
RLHF
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PriFT: Prior-Support Guided Supervised Fine-Tuning
Sequential Data Poisoning in
LLM
Post-Training
🎯
RLHF
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Sequential Data Poisoning in LLM Post-Training
Auditing
Training
Data in
Domain-adapted
LLMs:
LoRA-MINT
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Auditing Training Data in Domain-adapted LLMs: LoRA-MINT
Attention Amnesia in Hybrid LLMs: When CoT
Fine-Tuning
Breaks Long-Range Recall, and How to Fix It
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help