Transformers

Feeds to Scour
SubscribedAll
Scoured 105 posts in 9.9 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI  Content type: Code
github.com··Hacker News

know the mother tongue of your LLMs

 💬LLMs

Meta-Attention: Teaching Models When Not to Answer

 🤖AI
hackernoon.com·

Causal Semantic Alignment for LLM-based Time Series Forecasting

 🤖AI  Content type: Academic
arxiv.org·

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

 🎯Fine-Tuning
pub.towardsai.net
·

The Edge LLM Offload Story

 🤖AI
semiengineering.com·
Less-relevant results

What Does Abliteration Actually Cost?

 🤖AI
lesswrong.com·

Show HN: LLM memory without context bleed; 100% precision vs. <10% vector search

 🤖AI

SafeRun: Enabling Determinism in LLM Planning for Running

 🤖AI  Content type: Academic
arxiv.org·

nex-agi/Nex-N2-mini • Huggingface

 🤖AI

LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…

 🤖AI
pub.towardsai.net
·

Google Gemma 4 12B: Architecture, Benchmarks, Access, and Hands-on Guide for Developers

 💬LLMs  Content type: Blog
analyticsvidhya.com·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🤖AI  Content type: Code
github.com··Hacker News

Reachability and asymptotics of Gaussian Transformer dynamics

 🤖Machine Learning  Content type: Academic
arxiv.org·

Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval

 🤖AI  Content type: Academic
arxiv.org·

Transformer Based Model for Spatiotemporal Feature Learning in EEG Emotion Recognition

 🧮Complexity Theory  Content type: Academic
arxiv.org·

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

 💬LLMs  Content type: Academic
arxiv.org·

Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context Language Modeling

 Hardware Acceleration  Content type: Academic
arxiv.org·

Post-training is (Massive) Supervised Learning

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

 🎯Fine-Tuning  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help