Binary Neural Networks, Low-precision Training, Efficient Inference, Weight Compression

Visual Exploration of Gradient Descent (many images)
lesswrong.com·11h
🎯Qdrant
PC-SNN: Predictive Coding-based Local Hebbian Plasticity Learning in Spiking Neural Networks
arxiv.org·20h
🔢BitNet Inference
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
nature.com·8h·
🧠LLM Inference
embeddinggemma with Qdrant compatible uint8 tensors output
huggingface.co·19h·
Discuss: r/LocalLLaMA
🎯Qdrant
Learning languages with the help of algorithms
johndcook.com·4h
📇Indexing Strategies
Material that listens: Chip-based approach enables speech recognition and more
techxplore.com·6h
Hardware Acceleration
Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20
github.com·9h·
Discuss: Hacker News
🏗️LLM Infrastructure
Efficient Cold-Start Recommendation via BPE Token-Level Embedding Initialization with LLM
arxiv.org·20h
🎯Recommendation Algorithms
Secrets of Chinese AI Model DeepSeek Revealed in Landmark Paper
scientificamerican.com·3h
🏗️LLM Infrastructure
RU-Net for Automatic Characterization of TRISO Fuel Cross Sections
arxiv.org·20h
📊Vector Databases
Discrete Time System Properties- Plainly
pub.towardsai.net·19h
🔍AI Interpretability
Making LLMs more accurate by using all of their layers
research.google·7h
🏗️LLM Infrastructure
IsoSched: Preemptive Tile Cascaded Scheduling of Multi-DNN via Subgraph Isomorphism
arxiv.org·20h
🧠Inference Serving
CoVariance Filters and Neural Networks over Hilbert Spaces
arxiv.org·20h
📊Vector Databases
ZTree: A Subgroup Identification Based Decision Tree Learning Framework
arxiv.org·20h
📊Vector Databases
Large Language Models Imitate Logical Reasoning, but at what Cost?
arxiv.org·20h
🧠LLM Inference
High-Energy Concentration for Federated Learning in Frequency Domain
arxiv.org·20h
🗜️Zstd
Few to Big: Prototype Expansion Network via Diffusion Learner for Point Cloud Few-shot Semantic Segmentation
arxiv.org·20h
📊Embeddings
JANUS: A Dual-Constraint Generative Framework for Stealthy Node Injection Attacks
arxiv.org·20h
🛡️AI Security
DeepSeek-R1 on Nature: How Pure Reinforcement Learning Unlocks LLM Reasoning
reddit.com·1h·
Discuss: r/LocalLLaMA
🏗️LLM Infrastructure