LLMs

Feeds to Scour
SubscribedAll
Scoured 143 posts in 12.6 ms

Small Experiments, Cheaper Decisions: A Case Study in Staged Promotion for Micro-Pretraining

 ⚙️Model Training  Content type: Academic
arxiv.org·

Hallucination Cascade: Analyzing Error Propagation in Multi-Agent LLM Systems

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Corpus Augmentation for Sign Language Translation via LLM-Guided Video Stitching

 ⚙️Model Training  Content type: Academic
arxiv.org·

Data-Constrained Language Model Pretraining: Improved Regularization and Scaling Laws

 ⚙️Model Training  Content type: Academic
arxiv.org·

Multi-Hop Knowledge Composition is Bound by Pretraining Exposure

 ⚙️Model Training  Content type: Academic
arxiv.org·

Making Locality-aware GEMM Compatible with Page-Granularity Placement on Chiplet GPUs

 🖥️ML Systems  Content type: Academic
arxiv.org·

A retrieval conditioned rebinding circuit for dynamic entity tracking in large language models

 🔄Transformers  Content type: Academic
arxiv.org·

ActiveMimic: Egocentric Video Pretraining with Active Perception

 ⚙️Model Training  Content type: Academic
arxiv.org·

PermDoRA -- Understanding Adapter Interference in Language Models: Limits of Parameter-Space Geometry

 🔄Transformers  Content type: Academic
arxiv.org·

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

 🧠AI Research  Content type: Academic
arxiv.org·

ViP-VL: Vietnamese Self-supervised Speech Pretraining Model with Vector-Quantization Learning

 ⚙️Model Training  Content type: Academic
arxiv.org·

Cross Paraphrastic Invariance Learning for Hallucination Detection

 ⚙️Model Training  Content type: Academic
arxiv.org·

Domain-Adapted Small Language Models with Hybrid Post-Processing: Achieving Cost-Efficient, Low-Latency Multi-Label Structured Prediction via LoRA Fine-Tuning on Scarce Data

 ⚙️Model Training  Content type: Academic
arxiv.org·

SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation

 🧠AI Research  Content type: Academic
arxiv.org·

Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in LLMs

 🔍Interpretability  Content type: Academic
arxiv.org·

Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning

 ⚙️Model Training  Content type: Academic
arxiv.org·

LifeSentence: Language models can encode human life course trajectories from longitudinal panel data

 🧠AI Research  Content type: Academic
arxiv.org·

The Amplifying Mirror: Locating and Steering the Partisan Direction inside a Large Language Model

 🔍Interpretability  Content type: Academic
arxiv.org·

Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

 🔄Transformers  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help