Retrieval Augmented Generation, Vector Search, Embeddings, Context

Engineer's Guide to Local LLMs with LLaMA.cpp on Linux
avatsaev.substack.com·20h·
Discuss: r/LocalLLaMA
🧮Jemalloc
Flag this post
Space DJ: Navigating a Musical Universe
magenta.withgoogle.com·9h·
Discuss: Hacker News
📹WebRTC
Flag this post
Post-training methods for language models
developers.redhat.com·2d
💬Prompt Engineering
Flag this post
Grok AI: A Deep Dive into xAI’s Maverick Chatbot
future.forem.com·1d·
Discuss: DEV
🗂️Obsidian
Flag this post
Disassembling Terabytes of Random Data with Zig and Capstone to Prove a Point
jstrieb.github.io·1d·
🔓Binary Exploitation
Flag this post
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-CompactVision-Language Model
paperium.net·3d·
Discuss: DEV
🌳Tree-sitter
Flag this post
Bioaccumulation Modeling via Spatio-Temporal Transformer Networks for Environmental Risk Assessment
dev.to·14h·
Discuss: DEV
👁️Computer Vision
Flag this post
Build123d (A Python CAD programming library) Roadmap
github.com·1d·
Discuss: Hacker News
🎨Design Systems
Flag this post
Enhanced Anti-Reflection Coating Design via Stochastic Gradient Descent on Parametric Nanostructure Optimization
dev.to·18h·
Discuss: DEV
🌟Ray Tracing
Flag this post
A beginner's guide to the Flux-Kontext-Fast model by Prunaai on Replicate
dev.to·1d·
Discuss: DEV
Incremental Computation
Flag this post
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
arxiv.org·2d
🧮Vector Databases
Flag this post
Efficient Curvature-aware Graph Network
arxiv.org·2d
🕸️Graph Theory
Flag this post
Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs
arxiv.org·2d
📝Parsing
Flag this post
SAIL-RL: Guiding MLLMs in When and How to Think via Dual-Reward RL Tuning
arxiv.org·1d
💬Prompt Engineering
Flag this post
When One Modality Sabotages the Others: A Diagnostic Lens on Multimodal Reasoning
arxiv.org·1d
🚀MLOps
Flag this post
Why stop at 1 million tokens when you can have 10? My journey to extreme context on a gaming GPU. [P]
reddit.com·2d·
📱Edge AI
Flag this post
Show HN: Tool2agent – a protocol for LLM tool feedback workflows
github.com·5h·
Discuss: Hacker News
💎Refinement Types
Flag this post
BondBERT: What we learn when assigning sentiment in the bond market
arxiv.org·1d
💰TigerBeetle
Flag this post
Accelerating MySQL Query Optimization via Reinforcement Learning & Hypergraph Analysis
dev.to·8h·
Discuss: DEV
🔍Query Optimization
Flag this post