Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Fine-tuning
🎛️ Fine-tuning
LoRA, Model Training, LLM Adaptation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
316
posts in
8.4
ms
Rethinking
LoRA
Memory Through the Lens of KV Cache Compression
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Rethinking LoRA Memory Through the Lens of KV Cache Compression
Recover-LoRA
for Aggressive Quantization: Reclaiming Accuracy in 2-Bit
Language
Models
via Low-Rank Adaptation with Knowledge Distillation on Synthetic Data
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Recover-LoRA for Aggressive Quantization: Reclaiming Accuracy in 2-Bit Language Models via Low-Rank Adaptation with Knowledge Distillation on Synthetic Data
EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant
RLHF
Platforms
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms
Imbuing
Large
Language
Models
with Bidirectional Logic for Robust Chain Repair
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Imbuing Large Language Models with Bidirectional Logic for Robust Chain Repair
BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
🎯
RLHF
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
Dominant-Layer ZO: A Single Layer Dominates Zeroth-Order
Fine-Tuning
of LLMs
💬
LLMs
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Dominant-Layer ZO: A Single Layer Dominates Zeroth-Order Fine-Tuning of LLMs
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help