LLMs

Large Language Models, GPT, Claude, Transformers, Prompt Engineering

Feeds to Scour
SubscribedAll
Scoured 681 posts in 12.4 ms

SLUUG Talk: Demystifying Large Language Models on Linux

 🤖Machine Learning  Content type: Code
github.com··DEV

rag-explained-how-it-works

 🔍RAG  Content type: Blog
dev.to··DEV

IA-RAG: Interval-Algebra-Driven Temporal Reasoning for Dynamic Knowledge Retrieval

 🔍RAG  Content type: Academic
arxiv.org·

Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 80.2 tok/s)

 🦙Ollama  Content type: Blog
dev.to··DEV

A handy llama-server launcher with easy model and configuration customisation

 📝NLP  Content type: Code
github.com··r/LocalLLaMA

Beyond Basic RAG (Part 3): Agentic RAG, CRAG, Self-RAG and GraphRAG Explained | M012 | Mehul Ligade

 🔍RAG
pub.towardsai.net
·

MolE-RAG: Molecular Structure-Enhanced Retrieval-Augmented Generation for Chemistry

 🔍RAG  Content type: Academic
arxiv.org·

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 🤖Machine Learning  Content type: Code
github.com··r/LocalLLaMA

Fine-tuning vs RAG vs MeMo: Where should LLM Knowledge Live?

 🔍RAG
pub.towardsai.net
·

LLM Fine-Tuning vs RAG: A Production Decision Framework for Engineering Teams

 🤖AI  Content type: Blog
dev.to··DEV

Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit

 🎭Anthropic Claude  Content type: Academic
arxiv.org·

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

 🏠Self-hosting  Content type: Code
github.com··Hacker News

Unlocking the Power of RAG Systems with LangChain and Vector Databases

 🔍RAG  Content type: Blog
dev.to··DEV

Revisiting Vul-RAG: Reproducibility and Replicability of RAG-based Vulnerability Detection with Open-Weight Models

 🔍RAG  Content type: Academic
arxiv.org·

Kodiqa-Solutions/Kodiqa-agent: 🧠 One agent. Every model. Zero limits. — Open-source AI coding agent that runs anywhere. 7 providers, 69 commands, local or cloud. Your terminal, your rules.

 🔧Developer Tools  Content type: Code
github.com··Hacker News

Built an AI-Powered Spring Boot Log Analyzer Using RAG + Ollama

 💻JAVA JS RUST  Content type: Blog
dev.to
··DEV

zaydmulani09/mnemo: Local-first AI memory layer for any LLM. Persistent knowledge graph, entity extraction, semantic retrieval. Works with Ollama, OpenAI, Anthropic, or any OpenAI-compatible backend.

 🦙Ollama  Content type: Code
github.com··Hacker News

Add a PASS/WARN/FAIL Quality Gate to Your RAG Pipeline in 30 Seconds

 🔍RAG  Content type: Blog
dev.to··DEV

I Benchmarked 3 Local LLMs on My Laptop — Here's What the Numbers Actually Show

 🦙Ollama  Content type: Blog
dev.to··DEV

I Accidentally Spent $400 on GPT-4o in One Month. Here's How to Never Do That.

 🎭Anthropic Claude  Content type: Blog
dev.to··DEV

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help