LLMs

Feeds to Scour
SubscribedAll
Scoured 76 posts in 7.2 ms

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

Operationalizing Linguistic Methods through Prompt-Engineering Skills: An Automatic Chinese Web Neologism Detection Pipeline

馃搻Formal LanguagesContent type: Academic
arxiv.org

Attention at the Theoretical Minimum: A Mathematics of Arrays Framework for Memory-Optimal Transformer Kernels

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org

LoRi: Low-Rank Distillation for Implicit Reasoning

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

NGram-MoSE: Efficient Remote Sensing Super-Resolution via N-Gram Context and Mixture-of-Experts

馃搻Formal LanguagesContent type: Academic
arxiv.org

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

馃ИAgent EvaluationContent type: Academic
arxiv.org

AISC deployment in dynamic UAV-assisted MEC network: a reinforcement learning method based on heterogeneous graph attention neural network

馃尦Decision-Time PlanningContent type: Academic
arxiv.org

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org
Less-relevant results

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models

馃ИAgent EvaluationContent type: Academic
arxiv.org

AttentionCap: Transformer Based Capacitance Matrix Learning Toward Full-Chip Extraction

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org

Reconstructing Multi-Decadal Forest Disturbances: A Spatio-Temporal Transformer Approach

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

A Mechanistic Analysis of Adversarial Fine-tuning of Vision Transformers

馃ИAgent EvaluationContent type: Academic
arxiv.org

Pre-Intervention Prediction of Sparse Autoencoder Steering Side Effects

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

Optimal Post-Training Quantization Scales and Where to Find Them

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org

Learned Subspace Compression for Communication-Efficient Pipeline Parallelism

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org

Phantom transitions in language model fine-tuning

馃搻Formal LanguagesContent type: Academic
arxiv.org

AQIFormer: A Transformer-Based Multi-View Architecture for Cross-City Air Quality Classification

馃ИAgent EvaluationContent type: Academic
arxiv.org

Hidden Consensus:Preference-Validity Compression in Human Feedback

馃ИAgent EvaluationContent type: Academic
arxiv.org

No more posts from sworddish's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help