🤖 Data science - saeedesmaili · Scour

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

🧠LLM Inference

local-llm.utop.workers.dev··Hacker News

LLM AI Chatbots are letting me down every single day

💬Natural Language Processing

umrashrf.github.io··Hacker News

Phase transition in large language models and the criticality of natural languages

💬Natural Language Processing Academic

Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.

🤖AI Agents Code

github.com··Hacker News

Words do not have determined meanings

🎯Fine-tuning Discussion

news.ycombinator.com··Hacker News

Zero and Few Shot Load Forecasting with Large Language Models

🤖Machine Learning Academic

Post-training is (Massive) Supervised Learning

🎯Fine-tuning Academic

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

🧠LLM Inference Code

github.com··Hacker News

Larch: Learned Query Optimization for Semantic Predicates

🕸️Knowledge Graphs Academic

apple/coreai-models: Model export recipes, Python primitives, and Swift runtime utilities for on-device AI

🔬Deep Learning Code

github.com··Hacker News

AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference

🔬Deep Learning Academic

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🧠LLM Inference Code

github.com··Hacker News, r/LLM

Spiking Neural Network inference on FPGAs with hls4ml

🤖Machine Learning Academic

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

🧠Neural Networks Academic

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

🔬Deep Learning Code

github.com··Hacker News

Toward Compiler World Models: Learning Latent Dynamics for Efficient Tensor Program Search

🔥PyTorch Academic

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

🧠LLM Inference Code

github.com··r/LocalLLaMA

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

🤖Machine Learning Academic

BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation

💬Natural Language Processing Academic

alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.

🎯Fine-tuning Code

Log in to enable infinite scrolling