LLMs

Feeds to Scour
SubscribedAll
Scoured 473 posts in 22.8 ms

Benchmarking Large Language Models for Safety Data Extraction

馃AI EngineeringContent type: Academic
arxiv.org

How to Run an LLM Locally: Ultimate Guide to Local AI 2026

馃AI EngineeringContent type: Blog

Most people use Ollama or llama.cpp for local LLMs, but these are the tools I switch to when it gets serious

馃AI Engineering
xda-developers.com

vLLM Internalised: The Mechanics of Modern LLM Inference

馃AI EngineeringContent type: Blog
medium.com

Unsloth Minimax M3 GGUF

馃AI Engineering

A reporting checklist for large language models in behavioural science

馃AI EngineeringContent type: Academic
nature.com

Mlx-optiq: per-layer mixed-precision LLM quantization for Apple Silicon

馃AI EngineeringContent type: VideoContent type: DiscussionContent type: Tutorial

microsoft/LLMLingua: [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

馃AI EngineeringContent type: Code
github.comDEV

Kimi K2.7-Code: Open-Weight 1T Model That Beats Claude Opus on Tool Use

馃AI EngineeringContent type: Blog
wowhow.cloudDEV

Introduction to (Multimodal) LLM-as-a-Judge

馃AI EngineeringContent type: NewsContent type: Blog

Back to Basics: Build Your Own LLM from Scratch

馃拵Ruby
thejeshgn.com

How LLMs are Actually Trained

馃AI EngineeringContent type: NewsContent type: Blog
blog.algomaster.io

Get ChatGPT, Gemini, Claude, and more for life for $60

馃AI Engineering
macworld.com

How ChatGPT Actually Works (Beginner Friendly)

馃AI AgentsContent type: Blog
medium.com

Why LLMs (still) lack taste

馃AI Engineering

Have we made a unicorn? Continuous SVG-pelican style benchmark

馃敟HotwireContent type: Reference

Chain-of-Thought Prompting Is Not What You Think

馃AI Agents
siliconopera.com

Intelligent inference scheduling with llm-d on Red Hat AI

馃AI Engineering
developers.redhat.com

Build Claude Alternative in Cloud in 20mins

馃AI EngineeringContent type: Reference
docs.dagploy.comHacker News

US blocks Claude Fable 5 and Mythos 5: is frontier AI now too dangerous?

馃AI EngineeringContent type: Blog
techzine.eu

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help