LLMs

Large Language Models, GPT, Claude, Transformers, Prompt Engineering

Feeds to Scour
SubscribedAll
Scoured 669 posts in 14.0 ms

Using Scikit-LLM with Open-Source LLMs

 🦙Ollama

Classical RAG vs Agentic RAG: a practical decision guide

 🔍RAG  Content type: Blog
dev.to··DEV

RAG-Based Testing Series — Part 1: What Is RAG & Why Your Old Testing Playbook Won't Work Here

 🔍RAG
linkedin.com··DEV

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

 🤖AI
pub.towardsai.net
·

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

 🤖AI  Content type: Academic
arxiv.org·

Claude vs GPT-4: Which AI API Is Better for Developers? (2026)

 🎭Anthropic Claude
kalyna.pro··DEV

Initial impressions of Claude Fable 5

 🎭Anthropic Claude
simonwillison.net··Hacker News

A handy llama-server launcher with easy model and configuration customisation

 📝NLP  Content type: Code
github.com··r/LocalLLaMA

What Are Tokens in LLMs?

 📝NLP  Content type: Blog

LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…

 🤖Transformers
pub.towardsai.net
·

rag-explained-how-it-works

 🔍RAG  Content type: Blog
dev.to··DEV

TA-RAG: Tone-Aware Retrieval-Augmented Generation for Peer-Support Health Communication

 🔍RAG  Content type: Academic
arxiv.org·

zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability

 🦙Ollama  Content type: Code
github.com··Hacker News

The complete guide to claude code configuration file

 🎭Anthropic Claude  Content type: Blog
dev.to··DEV

What is Agentic RAG? Building Multi-Agent Agentic RAG Systems

 🔍RAG
pub.towardsai.net
·

shoo99/paper-rag: A private, fully-local RAG over your own PDFs: BGE-M3 + embedded Qdrant + a local LLM via Ollama. ~150 lines, nothing leaves your machine.

 🔍RAG  Content type: Code
github.com··DEV

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

 📝NLP  Content type: Academic
arxiv.org·

LLM Inference Handbook 2026

 🏗️Systems Design
pub.towardsai.net
·

Open-LLM-VTuber Review: Offline AI Companion with Live2D

 🦙Ollama  Content type: Blog
dev.to··DEV

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖Machine Learning  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help