Ranking LLMs based on 180k French votes (French government's AI arena)
๐Earley Parsing
Flag this post
Presentation: Scaling API Independence: Mocking, Contract Testing & Observability in Large Microservices Environments
infoq.comยท2h
๐ฆMonorepos
Flag this post
Building Custom LLM Judges for AI Agent Accuracy
databricks.comยท19h
๐ฎMetacircular Evaluators
Flag this post
Google's AI space moonshot
therundown.aiยท5h
๐ญProgram Synthesis
Flag this post
This feels like the early Internet moment for AI.
threadreaderapp.comยท1d
๐Lua
Flag this post
Inside Zendeskโs dual AI leap: From reliable agents to real-time intelligence with GPT-5 and HyperArc
venturebeat.comยท1d
๐ฎLanguage Ergonomics
Flag this post
LGCC: Enhancing Flow Matching Based Text-Guided Image Editing with Local Gaussian Coupling and Context Consistency
arxiv.orgยท10h
๐ฑMinimal ML
Flag this post
I've created a leetcode for system design
๐Nanopass
Flag this post
Building Syllabi โ Agentic AI with Vercel AI SDK, Dynamic Tool Loading, and RAG
๐ฌInteractive REPLs
Flag this post
Algorithmic Alchemy: Transmuting Dynamic Programming with Gradients by Arvind Sundararajan
๐งฉConstraint Solvers
Flag this post
Unsupervised Learning for Industrial Defect Detection: A Case Study on Shearographic Data
arxiv.orgยท10h
๐บ๏ธRegion Inference
Flag this post
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
๐ญErlang OTP
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.orgยท1d
๐ญFunctional Compilers
Flag this post
I Want to Break Free! Persuasion and Anti-Social Behavior of LLMs in Multi-Agent Settings with Social Hierarchy
arxiv.orgยท10h
๐ฏFinite Automata
Flag this post
Announcing the fastest inference for realtime voice AI agents
together.aiยท1d
๐Tokenizer Performance
Flag this post
Neurosymbolic Deep Learning Semantics
arxiv.orgยท10h
๐ง Semantic Parsing
Flag this post
Loading...Loading more...