Fine-Tuning

Transfer Learning, LoRA, Model Adaptation, Training Data, Hyperparameters

Feeds to Scour
SubscribedAll
Scoured 128 posts in 8.6 ms

Fisher-Guided Progressive Parameter Selection for Adaptive Fine-Tuning

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Phantom transitions in language model fine-tuning

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Mechanistic Analysis of Alignment Algorithms in Language Models

 🎯RLHF  Content type: Academic
arxiv.org·

High-Dimensional Theory of LoRA Fine-Tuning in a Solvable Attention Model

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Whose Norms? Disentangling Cultural and Personal Alignment in Large Language Models

 🎯RLHF  Content type: Academic
arxiv.org·

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

 💬LLMs  Content type: Academic
arxiv.org·

Rethinking LoRA Memory Through the Lens of KV Cache Compression

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Emergence of Context Characteristics Sensitivity in Large Language Models

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Null-Space Constrained Low-Rank Adaptation for Response-Specified Large Language Model Unlearning

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Customization under Fire: Plugin Poisoning in Text-to-Image Ecosystem

 🎨Generative AI  Content type: Academic
arxiv.org·

BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

 🎯RLHF  Content type: Academic
arxiv.org·

Lost in the Flow with Code Talkers: Unveiling the Instruction-Tuning Tax of Large Language Models in Code Tasks

 Code Generation  Content type: Academic
arxiv.org·

Subtitle-Aligned Fine-Tuning of Whisper for Swiss German ASR: Benchmark Contamination, Convention Mismatch, and an Honest Baseline at 25.6% WER (13.8% cWER)

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Dominant-Layer ZO: A Single Layer Dominates Zeroth-Order Fine-Tuning of LLMs

 💬LLMs  Content type: Academic
arxiv.org·

Data Synthesis and Parameter-Efficient Fine-Tuning for Low-Resource NMT: A Case Study on Q'eqchi' Mayan

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Recover-LoRA for Aggressive Quantization: Reclaiming Accuracy in 2-Bit Language Models via Low-Rank Adaptation with Knowledge Distillation on Synthetic Data

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Distilling Safe LLM Systems via Soft Prompts for On Device Settings

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

MailoHLS: Multi-Adapter Structure-Aware Learning for Pareto-Driven HLS Pragma Optimization

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help