Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Fine-tuning
🎛️ Fine-tuning
Specific
LoRA, RLHF, model fine-tuning, instruction tuning, SFT
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
99
posts in
50.3
ms
Sequential Data Poisoning in
LLM
Post-Training
🔥
Burn
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Sequential Data Poisoning in LLM Post-Training
SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption
🔐
Cryptography
eprint.iacr.org
·
1d
1 day ago
Actions for SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption
Less-relevant results
If Claude Fable stops helping you, you'll never know
🧭
Content Discovery
Content type:
Blog
jonready.com
·
22h
22 hours ago
·
Lobsters
,
Hacker News
Actions for If Claude Fable stops helping you, you'll never know
GGUF vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization (and Which One to Pick)
🗄️
Database Internals
vettedconsumer.com
·
4d
4 days ago
·
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
The
Fine-Tuning
Trap
: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning
🎯
BM25
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning
Alignment
Defends LLMs from Property Inference Attacks
🌲
LSM Trees
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Alignment Defends LLMs from Property Inference Attacks
Training
LLMs to Enforce Multi-Level
Instruction
Hierarchies via Gravity-Weighted Direct Preference Optimization
🌲
LSM Trees
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Training LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference Optimization
Auditing
Training
Data in Domain-adapted LLMs:
LoRA-MINT
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Auditing Training Data in Domain-adapted LLMs: LoRA-MINT
Parameter-Efficient
Fine-Tuning
with Learnable Rank
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Parameter-Efficient Fine-Tuning with Learnable Rank
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language
Model
📐
Embeddings
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Emergence of Context Characteristics Sensitivity in Large Language
Models
📐
Embeddings
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Emergence of Context Characteristics Sensitivity in Large Language Models
RL
Excursions during
Pre-Training
: Re-examining Policy Optimization for
LLM
training
🎲
Procedural Generation
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for RL Excursions during Pre-Training: Re-examining Policy Optimization for LLM training
RASFT: Rollout-Adaptive Supervised
Fine-Tuning
for Reasoning
🌲
LSM Trees
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning
PEFT
of SLM for Telecommunications Customer Support: A Comparative Study of
LoRA
Configurations with Energy Consumption Analysis
🧠
Query Planners
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis
PriFT: Prior-Support Guided Supervised
Fine-Tuning
⚙️
Compilers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PriFT: Prior-Support Guided Supervised Fine-Tuning
Defending Against Malicious
Finetuning
by Scaling
Train-time
Adversarial Attacks
🔥
Burn
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Defending Against Malicious Finetuning by Scaling Train-time Adversarial Attacks
High-Dimensional Theory of
LoRA
Fine-Tuning
in a Solvable Attention Model
👤
Search Personalization
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for High-Dimensional Theory of LoRA Fine-Tuning in a Solvable Attention Model
Emergent
alignment
and the projectability of ethical personas
📰
Content Curation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Emergent alignment and the projectability of ethical personas
Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
🎯
Recommendation Algorithms
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help