zFLoRA: Zero-Latency Fused Low-Rank Adapters
arxiv.org·2d
🏠Local LLM Deployment
Flag this post
Mitre ATT&CK v18 released
🪟Awesome windows command-line
Flag this post
ParallelMind Engine: First AI System with Parallel Logical Reasoning (202+ problems/sec)
🏠Local LLM Deployment
Flag this post
Speedrunning an RL Environment
🏠Local LLM Deployment
Flag this post
Linux/WASM
🪟Awesome windows command-line
Flag this post
Welcome to Aspire: Your stack, streamlined – Aspire is going polyglot
🖥️Self-hosted apps
Flag this post
Smaller Surfaces
🏠Local LLM Deployment
Flag this post
Beyond Brute Force: 4 Secrets to Smaller, Smarter, and Dramatically Cheaper AI
hackernoon.com·19h
🏠Local LLM Deployment
Flag this post
Our newest model: Chandra (OCR)
🏠Local LLM Deployment
Flag this post
OpenAI’s Apps SDK: A Developer’s Guide to Getting Started
thenewstack.io·22h
🖥️Self-hosted apps
Flag this post
Show HN: Postflare AI – An AI-Powered Social Media Strategist and Bulk Scheduler
🖥️Self-hosted apps
Flag this post
OwlAI Assistant for Small Business
🖥️Self-hosted apps
Flag this post
The End of Cloud Inference
🏠Local LLM Deployment
Flag this post
Loading...Loading more...