🤖 LLMs - jyunzhang · Scour

SLUUG Talk: Demystifying Large Language Models on Linux

🤖Machine Learning Code

github.com··DEV

rag-explained-how-it-works

🔍RAG Blog

IA-RAG: Interval-Algebra-Driven Temporal Reasoning for Dynamic Knowledge Retrieval

🔍RAG Academic

Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 80.2 tok/s)

🦙Ollama Blog

A handy llama-server launcher with easy model and configuration customisation

📝NLP Code

github.com··r/LocalLLaMA

Beyond Basic RAG (Part 3): Agentic RAG, CRAG, Self-RAG and GraphRAG Explained | M012 | Mehul Ligade

pub.towardsai.net

·

MolE-RAG: Molecular Structure-Enhanced Retrieval-Augmented Generation for Chemistry

🔍RAG Academic

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

🤖Machine Learning Code

github.com··r/LocalLLaMA

Fine-tuning vs RAG vs MeMo: Where should LLM Knowledge Live?

pub.towardsai.net

·

LLM Fine-Tuning vs RAG: A Production Decision Framework for Engineering Teams

🤖AI Blog

Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit

🎭Anthropic Claude Academic

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

🏠Self-hosting Code

github.com··Hacker News

Unlocking the Power of RAG Systems with LangChain and Vector Databases

🔍RAG Blog

Revisiting Vul-RAG: Reproducibility and Replicability of RAG-based Vulnerability Detection with Open-Weight Models

🔍RAG Academic

Kodiqa-Solutions/Kodiqa-agent: 🧠 One agent. Every model. Zero limits. — Open-source AI coding agent that runs anywhere. 7 providers, 69 commands, local or cloud. Your terminal, your rules.

🔧Developer Tools Code

github.com··Hacker News

Built an AI-Powered Spring Boot Log Analyzer Using RAG + Ollama

💻JAVA JS RUST Blog

··DEV

zaydmulani09/mnemo: Local-first AI memory layer for any LLM. Persistent knowledge graph, entity extraction, semantic retrieval. Works with Ollama, OpenAI, Anthropic, or any OpenAI-compatible backend.

🦙Ollama Code

github.com··Hacker News

Add a PASS/WARN/FAIL Quality Gate to Your RAG Pipeline in 30 Seconds

🔍RAG Blog

I Benchmarked 3 Local LLMs on My Laptop — Here's What the Numbers Actually Show

🦙Ollama Blog

I Accidentally Spent $400 on GPT-4o in One Month. Here's How to Never Do That.

🎭Anthropic Claude Blog

Log in to enable infinite scrolling