💬 LLMs - simiasherextra · Scour

Running LLM Inference on Kubernetes: What It Actually Takes

⚙️ROS Blog

fairwinds.com·

fix(ollama): use provider thinking default in SDK session factory (#9… · openclaw/openclaw@4f3c2cd

⚙️ROS Code

Report: GKE Inference Gateway delivers up to 92% faster AI responses

🌐AGI Blog

cloud.google.com··Hacker News

Acoda: Adversarial Code Obfuscation for Defending against LLM-based Analysis

✨Neural Radiance Fields Academic

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

🔌Embedded Systems

aermia.com··Hacker News

How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?

🌐AGI Blog

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

✨Neural Radiance Fields Blog

dnhkng.github.io·

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

🏳️‍🌈LGBT Tech

huggingface.co··r/LocalLLaMA

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

✨Neural Radiance Fields News Blog

developer.nvidia.com·

I built an open-source persistent memory layer for AI coding agents

⚙️ROS Code

github.com··r/GithubCopilot

I finally built the central AI hub I've been wanting, and Open WebUI made it stupidly simple

🔓Open Source

xda-developers.com·

Benchmarking Large Language Models for Safety Data Extraction

🛡️AI Safety Academic

Claude vs GPT-4: Which AI API Is Better for Developers? (2026)

kalyna.pro··DEV

Foundation Models: Apple Isn’t Building an AI Model. It’s Building an AI Platform.

🛡️AI Safety Blog

Large companies can add a local LLM filter layer to considerably reducing their AI costs

umrashrf.github.io··Hacker News

What are AI parameters — and why does everyone keep talking about billions of them?

🌐AGI Blog

Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

🛡️AI Safety Academic

LLM AI Chatbots are letting me down every single day

🧠AI Research

umrashrf.github.io··Hacker News

Quantum circuits help AI overcome memory limitations with minimal new parameters

🧠AI Research

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

⚙️ROS Code

github.com··DEV

Sign up or log in to see more results

Log in to enable infinite scrolling