LLMs

Large Language Models, GPT, Claude, Transformers, Prompt Engineering

Feeds to Scour
SubscribedAll
Scoured 957 posts in 8.2 ms

TA-RAG: Tone-Aware Retrieval-Augmented Generation for Peer-Support Health Communication

 🔍RAG  Content type: Academic
arxiv.org·

zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability

 🦙Ollama  Content type: Code
github.com··Hacker News

Evaluating RAG Reliability under Clean, Misleading, and Mixed Retrieval

 🔍RAG  Content type: Academic
arxiv.org·

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

 📝NLP  Content type: Academic
arxiv.org·

fix(gateway): fail closed for unknown model auth · openclaw/openclaw@85343ea

 📝NLP  Content type: Code
github.com·

shoo99/paper-rag: A private, fully-local RAG over your own PDFs: BGE-M3 + embedded Qdrant + a local LLM via Ollama. ~150 lines, nothing leaves your machine.

 🔍RAG  Content type: Code
github.com··DEV

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖Machine Learning  Content type: Code
github.com··Hacker News

SLUUG Talk: Demystifying Large Language Models on Linux

 🤖Machine Learning  Content type: Code
github.com··DEV

IA-RAG: Interval-Algebra-Driven Temporal Reasoning for Dynamic Knowledge Retrieval

 🔍RAG  Content type: Academic
arxiv.org·

Alvaro-Manzo/promptshift: Model-aware prompt adapter for Claude — translate any prompt to GPT, Gemini, Mistral, Llama and more

 📝NLP  Content type: Code

MolE-RAG: Molecular Structure-Enhanced Retrieval-Augmented Generation for Chemistry

 🔍RAG  Content type: Academic
arxiv.org·

A handy llama-server launcher with easy model and configuration customisation

 📝NLP  Content type: Code
github.com··r/LocalLLaMA

Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit

 🎭Anthropic Claude  Content type: Academic
arxiv.org·

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

 🎭Anthropic Claude  Content type: Code
github.com··r/SideProject

Revisiting Vul-RAG: Reproducibility and Replicability of RAG-based Vulnerability Detection with Open-Weight Models

 🔍RAG  Content type: Academic
arxiv.org·

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 🤖Machine Learning  Content type: Code
github.com··r/LocalLLaMA

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

 🏠Self-hosting  Content type: Code
github.com··Hacker News

Reducing Hallucinations in Complex Question Answering using Simple Graph-based Retrieval-Augmented Generation (long version)

 🔍RAG  Content type: Academic
arxiv.org·

Kodiqa-Solutions/Kodiqa-agent: 🧠 One agent. Every model. Zero limits. — Open-source AI coding agent that runs anywhere. 7 providers, 69 commands, local or cloud. Your terminal, your rules.

 🔧Developer Tools  Content type: Code
github.com··Hacker News

QCFuse: Query-Aware Cache Fusion via Compressed View for Efficient RAG Serving

 🔍RAG  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help