AI

Feeds to Scour
SubscribedAll
Scoured 64 posts in 7.6 ms

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

 Systems Performance

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🍎Apple  Content type: Code
github.com··Hacker News

Nvidia’s best model is now live

 🔬Tech & Science
thenewstack.io·

I bet everything on eight weeks: solo #1 on MTEB English v2

 🔧MLOps  Content type: Blog
sentimark.ai··Hacker News

An announcement from the Steering Council regarding the JIT project

 👁️Observability

apple/coreai-models: Model export recipes, Python primitives, and Swift runtime utilities for on-device AI

 🍎Apple  Content type: Code
github.com··Hacker News

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

 🧠LLM Engineering  Content type: Blog

Forgis-Labs/HEPA: HEPA: Self-supervised horizon-conditioned event predictive architecture for time series. Spotlight at FMSD @ ICML 2026.

 ⚙️AI Infrastructure  Content type: Code
github.com··Hacker News

nex-agi/Nex-N2-mini • Huggingface

 🧠LLM Engineering

Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec

 🔬Tech & Science  Content type: Code
github.com··Hacker News

princezuda/-RequiemGPT-: Fully open source and open weights built and trained by fable five with one prompt. An experience in how AI actually works

 🧠LLM Engineering  Content type: Code
github.com··Hacker News

Nex N2 Pro: Frontier agentic performance at 400B

 🔧MLOps

linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore

 🔧MLOps  Content type: Code
github.com··Hacker News

ApodexAI/AgentHarness: Evaluation harness for Apodex-1.0 on public deep-research benchmarks.

 Systems Performance  Content type: Code
github.com··Hacker News

patriceckhart/zot: Yet another coding agent harness, lightweight and written in go.

 🖥️Self-hosted apps  Content type: Code
github.com··Hacker News

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

 🧠LLM Engineering  Content type: Code
github.com··Hacker News

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 ⚙️AI Infrastructure  Content type: Code
github.com··Hacker News

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 ⚙️AI Infrastructure  Content type: Code
github.com··Hacker News

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 ⚙️AI Infrastructure  Content type: Code
github.com··r/LocalLLaMA

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

 ⚙️AI Infrastructure  Content type: Code
github.com··Hacker News

No more posts from moznotes's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help