Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃 language models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
187196
posts in
27.1
ms
Decoupling the Benefits of
Subword
Tokenization for Language Model Training via
Byte-level
Simulation
聽
馃挰
LLM
arxiv.org
路
3h
How Fast Should a Model Commit to Supervision? Training Reasoning Models on the
Tsallis
Loss
Continuum
聽
馃挰
LLM
arxiv.org
路
2d
PRTS
: A Primitive Reasoning and
Tasking
System via Contrastive Representations
聽
馃挰
LLM
arxiv.org
路
3h
AutoPyVerifier
: Learning Compact Executable
Verifiers
for Large Language Model Outputs
聽
馃挰
LLM
arxiv.org
路
3d
ADE
: Adaptive
Dictionary
Embeddings -- Scaling Multi-Anchor Representations to Large Language Models
聽
馃挰
LLM
arxiv.org
路
2d
When 2D Tasks Meet
1D
Serialization
: On
Serialization
Friction in Structured Tasks
聽
馃挰
LLM
arxiv.org
路
3h
Three Models of
RLHF
Annotation
: Extension, Evidence, and Authority
聽
馃挰
LLM
arxiv.org
路
2d
DPN-LE
: Dual Personality
Neuron
Localization and Editing for Large Language Models
聽
馃挰
LLM
arxiv.org
路
3h
Evaluating Large Language Models on Computer Science University
Exams
in Data
Structures
聽
馃挰
LLM
arxiv.org
路
3d
HealthBench
Professional: Evaluating Large Language Models on Real
Clinician
Chats
聽
馃挰
LLM
arxiv.org
路
3h
Structural
Generalization
on
SLOG
without Hand-Written Rules
聽
馃挰
LLM
arxiv.org
路
1d
Less Is More: Engineering Challenges of On-Device Small Language Model
Integration
in a Mobile
Application
聽
馃挰
LLM
arxiv.org
路
3d
Domain-Adapted
Small Language Models for Reliable Clinical
Triage
聽
馃挰
LLM
arxiv.org
路
1d
Text-Utilization
for
Encoder-dominated
Speech Recognition Models
聽
馃挰
LLM
arxiv.org
路
1d
Mixture
of Heterogeneous
Grouped
Experts for Language Modeling
聽
馃挰
LLM
arxiv.org
路
3d
Large Language Models for
Multilingual
Code Intelligence: A
Survey
聽
馃挰
LLM
arxiv.org
路
1d
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from
Raw
Corpora
聽
馃挰
LLM
arxiv.org
路
2d
Extracting
books from production language models
聽
馃挰
LLM
arxiv.org
路
4d
On the
Trainability
of Masked Diffusion Language Models via
Blockwise
Locality
聽
馃挰
LLM
arxiv.org
路
2d
From
Similarity
to Structure: Training-free LLM Context Compression with Hybrid Graph
Priors
聽
馃挰
LLM
arxiv.org
路
3d
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help