⚡ Transformers - jhcha.oyo · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🤖AI Code

github.com··Hacker News

know the mother tongue of your LLMs

mothertoken.inigoimaz.com··Hacker News

Meta-Attention: Teaching Models When Not to Answer

hackernoon.com·

Causal Semantic Alignment for LLM-based Time Series Forecasting

🤖AI Academic

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

🎯Fine-Tuning

pub.towardsai.net

·

The Edge LLM Offload Story

semiengineering.com·

Less-relevant results

What Does Abliteration Actually Cost?

lesswrong.com·

Show HN: LLM memory without context bleed; 100% precision vs. <10% vector search

tenureai.dev··Hacker News, Hacker News

SafeRun: Enabling Determinism in LLM Planning for Running

🤖AI Academic

nex-agi/Nex-N2-mini • Huggingface

huggingface.co··r/LocalLLaMA

LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…

pub.towardsai.net

·

Google Gemma 4 12B: Architecture, Benchmarks, Access, and Hands-on Guide for Developers

💬LLMs Blog

analyticsvidhya.com·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

🤖AI Code

github.com··Hacker News

Reachability and asymptotics of Gaussian Transformer dynamics

🤖Machine Learning Academic

Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval

🤖AI Academic

Transformer Based Model for Spatiotemporal Feature Learning in EEG Emotion Recognition

🧮Complexity Theory Academic

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

💬LLMs Academic

Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context Language Modeling

⚡Hardware Acceleration Academic

Post-training is (Massive) Supervised Learning

🎛️Fine-tuning Academic

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

🎯Fine-Tuning Academic

Log in to enable infinite scrolling