🤖 AI - moznotes · Scour

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

⚡Systems Performance

huggingface.co··Hacker News, Hacker News, r/LocalLLaMA

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

🍎Apple Code

github.com··Hacker News

Nvidia’s best model is now live

🔬Tech & Science

thenewstack.io·

I bet everything on eight weeks: solo #1 on MTEB English v2

🔧MLOps Blog

sentimark.ai··Hacker News

An announcement from the Steering Council regarding the JIT project

👁️Observability

discuss.python.org··Lobsters, Hacker News, Hacker News, Hacker News, r/Python

apple/coreai-models: Model export recipes, Python primitives, and Swift runtime utilities for on-device AI

🍎Apple Code

github.com··Hacker News

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

🧠LLM Engineering Blog

huggingface.co··Hacker News, r/LocalLLaMA

Forgis-Labs/HEPA: HEPA: Self-supervised horizon-conditioned event predictive architecture for time series. Spotlight at FMSD @ ICML 2026.

⚙️AI Infrastructure Code

github.com··Hacker News

nex-agi/Nex-N2-mini • Huggingface

🧠LLM Engineering

huggingface.co··r/LocalLLaMA

Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec

🔬Tech & Science Code

github.com··Hacker News

princezuda/-RequiemGPT-: Fully open source and open weights built and trained by fable five with one prompt. An experience in how AI actually works

🧠LLM Engineering Code

github.com··Hacker News

Nex N2 Pro: Frontier agentic performance at 400B

huggingface.co··Hacker News

linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore

🔧MLOps Code

github.com··Hacker News

ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.

⚡Systems Performance Code

github.com··Hacker News

patriceckhart/zot: Yet another coding agent harness, lightweight and written in go.

🖥️Self-hosted apps Code

github.com··Hacker News

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

🧠LLM Engineering Code

github.com··Hacker News

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

⚙️AI Infrastructure Code

github.com··Hacker News

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

⚙️AI Infrastructure Code

github.com··Hacker News

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

⚙️AI Infrastructure Code

github.com··r/LocalLLaMA

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

⚙️AI Infrastructure Code

github.com··Hacker News

No more posts from moznotes's subscribed feeds.

Scour all 25257 feeds Learn more about Feeds

Sign up or log in to see more results

Log in to enable infinite scrolling