What I learned building Python notebooks to run any AI model (LLM, Vision, Audio) — across CPU, GPU, and NPU
⚡High Performance Computing
Flag this post
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
arxiv.org·1d
💻Programming
Flag this post
A tiny and simple Open Source library to call LLM APIs with in-built rate-limiting, retries, circuit breaker...
🏛️Software Architecture Patterns
Flag this post
I'm the author of LocalAI (the local OpenAI-compatible API). We just released v3.7.0 with full Agentic Support (tool use!), Qwen 3 VL, and the latest llama.cpp
💻Programming
Flag this post
Show HN: Oodle – Unified Debugging with OpenSearch and Grafana
🏛️Software Architecture Patterns
Flag this post
Ranking LLMs based on 180k French votes (French government's AI arena)
🏛️Software Architecture Patterns
Flag this post
Introducing Agent-o-rama: build, trace, evaluate, and monitor stateful LLM agents in Java or Clojure
🏛️Software Architecture Patterns
Flag this post
How We Built a Custom Vision LLM to Improve Document Processing at Grab
⚡High Performance Computing
Flag this post
I Use AI
💻Programming
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
⚡High Performance Computing
Flag this post
When Five Dumb AIs Beat One Smart AI: The Case for Multi-Agent Systems
🏛️Software Architecture Patterns
Flag this post
Automating error analysis for AI agents – what works and doesn't
⚡High Performance Computing
Flag this post
Real-time stock volatility prediction with deep learning on a time-series DB
⚡High Performance Computing
Flag this post
Loading...Loading more...