⚡ Transformers - jhcha.oyo · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🤖AI Code

github.com··Hacker News

know the mother tongue of your LLMs

mothertoken.inigoimaz.com··Hacker News

Meta-Attention: Teaching Models When Not to Answer

hackernoon.com·

Causal Semantic Alignment for LLM-based Time Series Forecasting

🤖AI Academic

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

🎯Fine-Tuning

pub.towardsai.net

·

The Edge LLM Offload Story

semiengineering.com·

Less-relevant results

What Does Abliteration Actually Cost?

lesswrong.com·

Show HN: LLM memory without context bleed; 100% precision vs. <10% vector search

tenureai.dev··Hacker News, Hacker News

SafeRun: Enabling Determinism in LLM Planning for Running

🤖AI Academic

nex-agi/Nex-N2-mini • Huggingface

huggingface.co··r/LocalLLaMA

LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…

pub.towardsai.net

·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

🤖AI Code

github.com··Hacker News

Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval

🤖AI Academic

Reachability and asymptotics of Gaussian Transformer dynamics

🤖Machine Learning Academic

Transformer Based Model for Spatiotemporal Feature Learning in EEG Emotion Recognition

🧮Complexity Theory Academic

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

💬LLMs Academic

A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training

📈Optimization Academic

Post-training is (Massive) Supervised Learning

🎛️Fine-tuning Academic

Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context Language Modeling

⚡Hardware Acceleration Academic

Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning

🎮Reinforcement Learning Academic

Log in to enable infinite scrolling