Data science

Feeds to Scour
SubscribedAll
Scoured 121 posts in 7.9 ms

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🧠LLM Inference

LLM AI Chatbots are letting me down every single day

 💬Natural Language Processing

Phase transition in large language models and the criticality of natural languages

 💬Natural Language Processing  Content type: Academic
arxiv.org·

Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.

 🤖AI Agents  Content type: Code
github.com··Hacker News

Words do not have determined meanings

 🎯Fine-tuning  Content type: Discussion

Zero and Few Shot Load Forecasting with Large Language Models

 🤖Machine Learning  Content type: Academic
arxiv.org·

Post-training is (Massive) Supervised Learning

 🎯Fine-tuning  Content type: Academic
arxiv.org·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🧠LLM Inference  Content type: Code
github.com··Hacker News

Larch: Learned Query Optimization for Semantic Predicates

 🕸️Knowledge Graphs  Content type: Academic
arxiv.org·

apple/coreai-models: Model export recipes, Python primitives, and Swift runtime utilities for on-device AI

 🔬Deep Learning  Content type: Code
github.com··Hacker News

AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference

 🔬Deep Learning  Content type: Academic
arxiv.org·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🧠LLM Inference  Content type: Code
github.com··Hacker News, r/LLM

Spiking Neural Network inference on FPGAs with hls4ml

 🤖Machine Learning  Content type: Academic
arxiv.org·

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

 🧠Neural Networks  Content type: Academic
arxiv.org·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 🔬Deep Learning  Content type: Code
github.com··Hacker News

Toward Compiler World Models: Learning Latent Dynamics for Efficient Tensor Program Search

 🔥PyTorch  Content type: Academic
arxiv.org·

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 🧠LLM Inference  Content type: Code
github.com··r/LocalLLaMA

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

 🤖Machine Learning  Content type: Academic
arxiv.org·

BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation

 💬Natural Language Processing  Content type: Academic
arxiv.org·

alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.

 🎯Fine-tuning  Content type: Code
github.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help