Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Data science
🤖 Data science
LLM, Machine learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121
posts in
7.9
ms
Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
🧠
LLM Inference
local-llm.utop.workers.dev
·
4d
4 days ago
·
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
LLM
AI Chatbots are letting me down every single day
💬
Natural Language Processing
umrashrf.github.io
·
6d
6 days ago
·
Hacker News
Actions for LLM AI Chatbots are letting me down every single day
Phase transition in
large
language
models
and the criticality of natural languages
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Phase transition in large language models and the criticality of natural languages
Tejas-TA/predikit: The missing bridge between your ML
models
and your AI agents.
🤖
AI Agents
Content type:
Code
github.com
·
22h
22 hours ago
·
Hacker News
Actions for Tejas-TA/predikit: The missing bridge between your ML models and your AI agents.
Words do not have determined meanings
🎯
Fine-tuning
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for Words do not have determined meanings
Zero and Few Shot Load Forecasting with
Large
Language
Models
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Zero and Few Shot Load Forecasting with Large Language Models
Post-training is (Massive)
Supervised
Learning
🎯
Fine-tuning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Post-training is (Massive) Supervised Learning
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and Qwen 3.6 MTP
modes
🧠
LLM Inference
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
Larch:
Learned
Query Optimization for Semantic Predicates
🕸️
Knowledge Graphs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Larch: Learned Query Optimization for Semantic Predicates
apple/coreai-models
: Model export recipes, Python primitives, and Swift runtime utilities for on-device AI
🔬
Deep Learning
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
Actions for apple/coreai-models: Model export recipes, Python primitives, and Swift runtime utilities for on-device AI
AgentCompile: An
LLM-Guided
Compiler for Direct CUDA Inference
🔬
Deep Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🧠
LLM Inference
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Spiking
Neural
Network
inference on FPGAs with hls4ml
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Spiking Neural Network inference on FPGAs with hls4ml
Towards Robust Arabic Speech Emotion Recognition with
Deep
Learning
🧠
Neural Networks
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Towards Robust Arabic Speech Emotion Recognition with Deep Learning
KJLdefeated/RL.cu: RLVR training for
LLM
in CUDA/C++
🔬
Deep Learning
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
Toward Compiler World
Models
:
Learning
Latent Dynamics for Efficient Tensor Program Search
🔥
PyTorch
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Toward Compiler World Models: Learning Latent Dynamics for Efficient Tensor Program Search
heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient
LLM
inference.
🧠
LLM Inference
Content type:
Code
github.com
·
4d
4 days ago
·
r/LocalLLaMA
Actions for heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
SafeECGMatch: Calibration-Aware Joint Frequency and Time Space
Semi-Supervised
Learning
for Open-Set ECG Classification
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification
BLM-SGAN: Bidirectional
Language
Modeling
for Semantic-Spatial Text-to-Image Generation
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation
alexziskind1/model-shelf
:
Model
Shelf is a local-first
model
resolver that helps AI agents and scripts find
model
weights on your own storage before downloading from
Hugging
Face
. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.
🎯
Fine-tuning
Content type:
Code
github.com
·
6d
6 days ago
Actions for alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help