LLMs

Large Language Models, GPT, Transformers, Natural Language Processing

Feeds to Scour
SubscribedAll
Scoured 608 posts in 4.6 ms

Don't dethrone consciousness

 🤖Machine Learning  Content type: News

Agentic AI for Insurance Underwriting: Beyond Chatbots and Prompts

 🤖AI  Content type: Blog

A wild idea: Abstract reality using ontology

 🔗Obsidian  Content type: Discussion

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

 🤖Machine Learning

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 🤖AI
techradar.com
·

AI Agents Running Businesses: Andon Labs on Project Vend

 🤖Machine Learning
startuphub.ai·

Can News Predict the Market? Limits of Zero-Shot Financial NLP and the Role of Explainable AI

 🤖Machine Learning  Content type: Academic
arxiv.org·

I used ChatGPT and Gemini side-by-side for a month on Android, and only one behaved like a senior AI tool

 🤖AI
androidpolice.com·

I finally built the central AI hub I've been wanting, and Open WebUI made it stupidly simple

 🐍Python
xda-developers.com·

bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss

 🔥PyTorch  Content type: Code
github.com··r/LocalLLaMA

Model Evaluations: Prove Your Routing Policy Actually Works

 🤖Machine Learning  Content type: Blog
digitalocean.com·

Show HN: Axiomax – Cryptographic proof of AI inference carbon footprint

 🔌API Design

GPU Servers for Best Performance

 🤖Machine Learning
leaseweb.com··DEV

GraphInfer-Bench: Benchmarking LLM's Inference Capability on Graphs

 🔥PyTorch  Content type: Academic
arxiv.org·

Running Ollama on a 15W CPU sounded ridiculous until I got it working with decent results

 🤖AI
xda-developers.com·

Appraising Artworks with Joins and LLMs (Ultorg Database UI)

 🤖AI
ultorg.com··Hacker News

Why agentic AI needs an open inference stack

 🤖AI
redhat.com·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🤖AI  Content type: Code
github.com··Hacker News

Claude vs GPT-4: Which AI API Is Better for Developers? (2026)

 👨developer
kalyna.pro··DEV

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🤖Machine Learning
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help