NLP

Transformers, Language Models, Text Processing, Chatbots

Feeds to Scour
SubscribedAll
Scoured 51 posts in 6.0 ms

LLMCodec: Adapting Video Codecs for Efficient Weight Compression of Large Language Models

馃Neural ArchitectureContent type: Academic
arxiv.org

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

馃Neural ArchitectureContent type: Academic
arxiv.org

SafeRun: Enabling Determinism in LLM Planning for Running

馃攧MLOpsContent type: Academic
arxiv.org

Auditing Training Data in Domain-adapted LLMs: LoRA-MINT

馃攧MLOpsContent type: Academic
arxiv.org

A retrieval conditioned rebinding circuit for dynamic entity tracking in large language models

馃搳EmbeddingsContent type: Academic
arxiv.org

Reducing Hallucinations in Complex Question Answering using Simple Graph-based Retrieval-Augmented Generation (long version)

馃搳EmbeddingsContent type: Academic
arxiv.org

Automatic Extraction of Structured Information from Brain MRI Reports Using an Open-Weight Large Language Model

馃搳EmbeddingsContent type: Academic
arxiv.org

Post-training is (Massive) Supervised Learning

馃AIContent type: Academic
arxiv.org

Minimizing the Hidden Cost of Scales: Graph-Guided Ultra-Low-Bit Quantization for Large Language Models

馃Neural ArchitectureContent type: Academic
arxiv.org

LLM-Based Code Documentation Generation and Multi-Judge Evaluation

鈿欙笍LLVMContent type: Academic
arxiv.org

Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in LLMs

馃摫Edge AIContent type: Academic
arxiv.org

Ten Headache Specialists versus Artificial Intelligence for Clinical Literature Summarization: A Critical Evaluation and Comparison

馃摫Edge AIContent type: Academic
arxiv.org

Steganography Without Modification: Hidden Communication via LLM Seeds

馃搳EmbeddingsContent type: Academic
arxiv.orgHacker News

FuseFSS: Efficient Secure LLM Inference with Function Secret Sharing

鈿欙笍LLVMContent type: Academic
arxiv.org

GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection

馃摫Edge AIContent type: Academic
arxiv.org

Dynamic Linear Attention

馃Neural ArchitectureContent type: Academic
arxiv.org

AI-Driven Test Case Generation from Natural Language Requirements: A Survey of Techniques and Research Gaps

馃攧MLOpsContent type: Academic
arxiv.org

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

馃攧MLOpsContent type: Academic
arxiv.org

BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation

馃Neural ArchitectureContent type: Academic
arxiv.org

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

鈿欙笍LLVMContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help