LLMs

large language models, LLM, foundation models, AI agents

Feeds to Scour
SubscribedAll
Scoured 384 posts in 7.9 ms

When AI Agents “Pay Attention”

 ⚖️AI Ethics
psychologytoday.com·

A Plea to the Labs: Let the Models Diagnose.

 💬ChatGPT  Content type: Blog

MLPerf and the rise of latency-aware LLM benchmarking

 🔌MCP
edn.com·

MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent

 ✳️OpenAI  Content type: Blog
bric.pe.kr··DEV

The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has

 💬ChatGPT
xda-developers.com·

Treating LLMs as Programming Books

 ⚖️AI Ethics  Content type: Blog
jola.dev··Hacker News

What Are Tokens in LLMs?

 💬ChatGPT  Content type: Blog

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🤖AI
phoronix.com··r/artificial

lightmetal: GPU LLM Inference From a Single Java 25 JAR

 ✳️OpenAI  Content type: Blog
adambien.blog·

OpenCV 5 Debuts with Improved ONNX Support and Native AI Upgrades

 ✳️OpenAI  Content type: News
hackster.io·

How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?

 ⚖️AI Ethics  Content type: Blog
medium.com·

Should LLM Agents Decide in Social Simulations? Comparing Finite-State and LLM-Based Decision Policies

 🔌MCP  Content type: Academic
arxiv.org·

‘Getting control where we can’—Europe wants sovereign AI but most of the chips are from the U.S.

 ⚖️AI Ethics  Content type: News
fortune.com
·

NVIDIA's Cosmos 3: The World's First Fully Open AI Omnimodel

 ✳️OpenAI  Content type: News
aimagazine.com·

Deep Learning Weekly: Issue 458

 🔌MCP

Google’s DiffusionGemma is 4x faster than its other Gemma models

 🤖AI
thenewstack.io·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

 💬ChatGPT  Content type: News  Content type: Blog
developer.nvidia.com·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI  Content type: Code
github.com··Hacker News, r/LLM

Transitioning from Azure Language Features to Foundry Models

 💬ChatGPT

Token4Token — pay-per-token inference on Gnosis + Swarm

 🤖AI
t4t.eth.link··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help