Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 Language Models
transformer architecture, fine-tuning, pretraining, RLHF
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
149952
posts in
13.2
ms
Large Language Model Post-Training: A
Unified
View of Off-Policy and On-Policy Learning
🧠
LLMs
arxiv.org
·
8h
Fine-tuning
Whisper
to my speech: 27% to 6.5%
WER
🧠
LLMs
vivekkairi.com
·
4d
·
Hacker News
LaCy
: What Small Language Models Can and Should Learn is Not Just a Question of Loss
🧠
LLMs
machinelearning.apple.com
·
1d
Recursive
Language Models - A
Systematic
Approach to Large-Scale Document Analysis – Part I.
🧠
LLMs
constitutionaldiscourse.com
·
3d
Reinforcement
Learning From Human Feedback (
RLHF
) in Large Language Models(LLMs)
🧠
LLMs
pub.towardsai.net
·
6d
Exploring
Continual
Fine-Tuning for Enhancing Language
Ability
in Large Language Model
🧠
LLMs
arxiv.org
·
2d
Linear
Representations
of Hierarchical
Concepts
in Language Models
🧠
LLMs
arxiv.org
·
8h
GRASS
: Gradient-based Adaptive
Layer-wise
Importance Sampling for Memory-efficient Large Language Model Fine-tuning
🧠
LLMs
arxiv.org
·
8h
In-Context Learning in Speech Language Models: Analyzing the Role of Acoustic Features,
Linguistic
Structure, and
Induction
Heads
🧠
LLMs
arxiv.org
·
1d
Dead
Weights
, Live Signals:
Feedforward
Graphs of Frozen Language Models
🧠
LLMs
arxiv.org
·
8h
Short Data, Long Context:
Distilling
Positional
Knowledge in Transformers
🧠
LLMs
arxiv.org
·
2d
What do Language Models Learn and When? The Implicit
Curriculum
Hypothesis
🧠
LLMs
arxiv.org
·
8h
A Parameter-Efficient Transfer Learning Approach through
Multitask
Prompt Distillation and Decomposition for Clinical
NLP
🧠
LLMs
arxiv.org
·
1d
MIPT-SSM
: Scaling Language Models with $O(1)$ Inference Cache via Phase Transitions
🧠
LLMs
arxiv.org
·
8h
One Model for All:
Multi-Objective
Controllable
Language Models
🧠
LLMs
arxiv.org
·
3d
Distributed
Interpretability
and Control for Large Language Models
⚙️
MLOps
arxiv.org
·
1d
ReRec
:
Reasoning-Augmented
LLM-based Recommendation Assistant via Reinforcement Fine-tuning
🧠
LLMs
arxiv.org
·
8h
State-of-the-Art
Arabic
Language Modeling with Sparse
MoE
Fine-Tuning and Chain-of-Thought Distillation
🧠
LLMs
arxiv.org
·
1d
On-Policy
Distillation
of Language Models for Autonomous Vehicle
Motion
Planning
🛡️
AI Safety
arxiv.org
·
8h
LPC-SM
: Local Predictive Coding and Sparse Memory for Long-Context Language Modeling
🧠
LLMs
arxiv.org
·
3d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help