Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language models, transformer, GPT, pretraining
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
143
posts in
12.6
ms
Small Experiments, Cheaper Decisions: A Case Study in Staged Promotion for
Micro-Pretraining
⚙️
Model Training
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Small Experiments, Cheaper Decisions: A Case Study in Staged Promotion for Micro-Pretraining
Hallucination Cascade: Analyzing Error Propagation in Multi-Agent
LLM
Systems
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Hallucination Cascade: Analyzing Error Propagation in Multi-Agent LLM Systems
Corpus Augmentation for Sign
Language
Translation via
LLM-Guided
Video Stitching
⚙️
Model Training
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Corpus Augmentation for Sign Language Translation via LLM-Guided Video Stitching
Data-Constrained
Language
Model
Pretraining
: Improved Regularization and Scaling Laws
⚙️
Model Training
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Data-Constrained Language Model Pretraining: Improved Regularization and Scaling Laws
Multi-Hop Knowledge Composition is Bound by
Pretraining
Exposure
⚙️
Model Training
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Multi-Hop Knowledge Composition is Bound by Pretraining Exposure
Making Locality-aware GEMM Compatible with Page-Granularity Placement on Chiplet GPUs
🖥️
ML Systems
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Making Locality-aware GEMM Compatible with Page-Granularity Placement on Chiplet GPUs
A retrieval conditioned rebinding circuit for dynamic entity tracking in
large
language
models
🔄
Transformers
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for A retrieval conditioned rebinding circuit for dynamic entity tracking in large language models
ActiveMimic: Egocentric Video
Pretraining
with Active Perception
⚙️
Model Training
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for ActiveMimic: Egocentric Video Pretraining with Active Perception
PermDoRA -- Understanding Adapter Interference in
Language
Models
: Limits of Parameter-Space Geometry
🔄
Transformers
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for PermDoRA -- Understanding Adapter Interference in Language Models: Limits of Parameter-Space Geometry
MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in
Language
Models
🧠
AI Research
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models
ViP-VL: Vietnamese Self-supervised Speech
Pretraining
Model
with Vector-Quantization Learning
⚙️
Model Training
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for ViP-VL: Vietnamese Self-supervised Speech Pretraining Model with Vector-Quantization Learning
Cross Paraphrastic Invariance Learning for Hallucination Detection
⚙️
Model Training
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Cross Paraphrastic Invariance Learning for Hallucination Detection
Domain-Adapted Small
Language
Models
with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA
Fine-Tuning
on Scarce Data
⚙️
Model Training
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data
SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation
🧠
AI Research
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation
Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in
LLMs
🔍
Interpretability
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in LLMs
Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning
⚙️
Model Training
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning
LifeSentence:
Language
models
can encode human life course trajectories from longitudinal panel data
🧠
AI Research
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for LifeSentence: Language models can encode human life course trajectories from longitudinal panel data
The Amplifying Mirror: Locating and Steering the Partisan Direction inside a
Large
Language
Model
🔍
Interpretability
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Amplifying Mirror: Locating and Steering the Partisan Direction inside a Large Language Model
Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
SpikeDecoder: Realizing the
GPT
Architecture
with Spiking Neural Networks
🔄
Transformers
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help