Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Fine-tuning
🎛️ Fine-tuning
LoRA, Model Training, LLM Adaptation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
319
posts in
7.1
ms
Fisher-Guided Progressive
Parameter
Selection for Adaptive
Fine-Tuning
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Fisher-Guided Progressive Parameter Selection for Adaptive Fine-Tuning
PriFT: Prior-Support Guided
Supervised
Fine-Tuning
🎯
RLHF
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PriFT: Prior-Support Guided Supervised Fine-Tuning
Sequential Data Poisoning in
LLM
Post-Training
🎯
RLHF
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Sequential Data Poisoning in LLM Post-Training
Auditing
Training
Data in
Domain-adapted
LLMs:
LoRA-MINT
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Auditing Training Data in Domain-adapted LLMs: LoRA-MINT
AuRA: Internalizing Audio Understanding into LLMs as
LoRA
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for AuRA: Internalizing Audio Understanding into LLMs as LoRA
The Neutral Mask: How
RLHF
Provides Shallow Alignment while Leaving Partisan Structure Intact in a
Large
Language
Model
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Supervised
Fine-tuning
with Synthetic Rationale Data Hurts Real-World Disease Prediction
🎯
RLHF
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Supervised Fine-tuning with Synthetic Rationale Data Hurts Real-World Disease Prediction
RL
Excursions during
Pre-Training
: Re-examining Policy Optimization for LLM
training
🎯
RLHF
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for RL Excursions during Pre-Training: Re-examining Policy Optimization for LLM training
Alignment Defends LLMs from Property Inference Attacks
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Alignment Defends LLMs from Property Inference Attacks
Distilling Safe
LLM
Systems via Soft Prompts for On Device Settings
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Distilling Safe LLM Systems via Soft Prompts for On Device Settings
PEFT
of SLM for Telecommunications Customer Support: A Comparative Study of
LoRA
Configurations with Energy Consumption Analysis
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis
Training
LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization
On the Geometry of On-Policy Distillation
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for On the Geometry of On-Policy Distillation
Benchmarking Empirical Privacy Protection for
Adaptations
of
Large
Language
Models
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Benchmarking Empirical Privacy Protection for Adaptations of Large Language Models
High-Dimensional Theory of
LoRA
Fine-Tuning
in a Solvable Attention Model
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for High-Dimensional Theory of LoRA Fine-Tuning in a Solvable Attention Model
Breaking the Tokenizer Barrier: On-Policy Distillation across
Model
Families
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Breaking the Tokenizer Barrier: On-Policy Distillation across Model Families
Data Synthesis and
Parameter-Efficient
Fine-Tuning
for Low-Resource NMT: A Case Study on Q'eqchi' Mayan
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Data Synthesis and Parameter-Efficient Fine-Tuning for Low-Resource NMT: A Case Study on Q'eqchi' Mayan
Domain-Adapted
Small
Language
Models
with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data
🎯
Fine-Tuning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data
TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank
Adaptation
🎯
RLHF
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank Adaptation
Better Literary Translation: A Multi-Aspect Data Generation and
LLM
Training
Approach
🎯
RLHF
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Better Literary Translation: A Multi-Aspect Data Generation and LLM Training Approach
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help