LLM-Centric RAG with Multi-Granular Indexing and Confidence Constraints
arxiv.orgยท41m
๐Ÿ“‰Model Quantization
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.comยท2hยท
Discuss: r/LLM
๐Ÿ‘๏ธAttention Optimization
Flag this post
C.J. Stroud exits game after hard hit, being evaluated for concussion
nytimes.comยท10h
๐ŸŽฎNVIDIA
Flag this post
Vision = Language: I Decoded VLM Tokens to See What AI 'Sees' ๐Ÿ”ฌ
reddit.comยท10hยท
Discuss: r/LocalLLaMA
๐ŸงฉAttention Kernels
Flag this post
Synthesized Generative Modeling via Graph-Constrained Semantic Embedding
dev.toยท13hยท
Discuss: DEV
๐ŸŽ“Model Distillation
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท1d
๐Ÿ“‰Model Quantization
Flag this post
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
paperium.netยท7hยท
Discuss: DEV
๐Ÿ“‰Model Quantization
Flag this post
I made a tensor runtime & inference framework in C (good for learning how inference works)
github.comยท4hยท
๐Ÿ“œTorchScript
Flag this post
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
arxiv.orgยท41m
๐Ÿ’กLSP
Flag this post
Semantic search with embeddings in PHP: a hands-on guide using Neuron AI and Ollama
ollama.comยท1dยท
Discuss: DEV
๐ŸงฉAttention Kernels
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.comยท11hยท
Discuss: Substack
๐Ÿ“‰Model Quantization
Flag this post
Can-t stop till you get enough
cant.bearblog.devยท11hยท
Discuss: Hacker News
๐Ÿ“œTorchScript
Flag this post
A Beginnerโ€™s Guide to Getting Started with add_messages Reducer in LangGraph
langcasts.comยท2dยท
Discuss: DEV
๐Ÿค–AI Coding Tools
Flag this post
Polish emerges as top language in multilingual AI benchmark testing
ppc.landยท20h
๐Ÿ”„ONNX
Flag this post
Testing Unnatural Prompt Engineering Across Five Large Language Models
blog.codeminer42.comยท2d
๐ŸŽ“Model Distillation
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
dev.toยท13hยท
Discuss: DEV
๐Ÿ”„ONNX
Flag this post
Hou Tu Pranownse Inglish
zompist.comยท11hยท
Discuss: Hacker News
๐Ÿ”Type Checkers
Flag this post
Unlock Autonomy: Next-Gen LLMs Learn to Decode Themselves by Arvind Sundararajan
dev.toยท12hยท
Discuss: DEV
๐Ÿค–AI Coding Tools
Flag this post
From hours to seconds: AI tools to detect animal calls
seangoedecke.comยท23hยท
Discuss: Hacker News
๐Ÿ“‰Model Quantization
Flag this post