Large Language Model

Feeds to Scour
SubscribedAll
Scoured 561 posts in 7.3 ms

What Are Tokens in LLMs?

 🤖AI  Content type: Blog

Why Your LLM Gets Dumber With More Context

 📊Dataset Curation
siliconopera.com·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🤖AI  Content type: Blog
adambien.blog·

From Chatbot Hallucinations to Deterministic Agents: Forcing Local LLMs to Run Production-Grade…

 🤖AI  Content type: Blog
medium.com
·

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

 👁️Computer Vision

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

 👁️Computer Vision
aermia.com··Hacker News

The Tech Download: Mistral's Arthur Mensch on agentic AI, chips and enterprise adoption

 👁️Computer Vision  Content type: News
cnbc.com·

Does ChatGPT need a psychiatrist? Similarities between human psychopathology and errors in large language models

 📊Dataset Curation  Content type: Academic
nature.com··Hacker News

France’s Mistral in Funding Talks at About €20 Billion Valuation

 🤖AI  Content type: News
bloomberg.com
·

You don't need Copilot for code completion, try this instead

 🤖AI

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

 🤖AI  Content type: Discussion

massimo92/spark: CLI tool for serving LLMs with vLLM on NVIDIA DGX Spark. One file, zero friction.

 🤖AI  Content type: Code
github.com··Hacker News

lightmetal: GPU LLM Inference From a Single Java 25 JAR

 🤖AI  Content type: Blog
adambien.blog·

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

 🤖AI  Content type: Blog
blogs.nvidia.com·

DiffusionGemma: Discrete diffusion in a large language model

 👁️Computer Vision

I ran local LLMs on my phone for a month, and now my desktop setup feels like overkill

 🤖AI
xda-developers.com·

Report: GKE Inference Gateway delivers up to 92% faster AI responses

 🤖AI  Content type: Blog

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

 🤖AI

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

 🤖AI  Content type: Academic
arxiv.org·

Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit

 🤖AI
huggingface.co··r/LocalLLaMA

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help