Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI

Multi-Head Latent Attention
sebastianraschka.com·1d·
Discuss: Hacker News
🧠Intelligence Compression
Flag this post
Speak Freely: Private Language Models on a Shoestring Budget by Arvind Sundararajan
dev.to·1d·
Discuss: DEV
🎙️Whisper
Flag this post
Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems
arxiv.org·41m
🧠Intelligence Compression
Flag this post
LLMs: Decoding the Geometry of Alignment
dev.to·22h·
Discuss: DEV
🌀Riemannian Computing
Flag this post
Mamba-3: Improved Sequence Modeling Using State Space Principles
openreview.net·20h·
Discuss: Hacker News
🌳Context free grammars
Flag this post
The Free Software Foundation considers large language models
lwn.net·11h·
Discuss: Hacker News
🔓Open Source Software
Flag this post
video-SALMONN S: Streaming Audio-Visual LLMs Beyond Length Limits via Memory
arxiv.org·1d
🧠Neural Compression
Flag this post
MegaFold: An Open-Sourced AlphaFold-3 Training System
supercomputing-system-ai-lab.github.io·6h·
Discuss: Hacker News
Incremental Computation
Flag this post
Self-improving language models are becoming reality with MIT's updated SEAL technique
venturebeat.com·1d·
Discuss: Hacker News
Incremental Computation
Flag this post
Unveiling the Vulnerability of Graph-LLMs: An Interpretable Multi-Dimensional Adversarial Attack on TAGs
arxiv.org·41m
🧮Kolmogorov Complexity
Flag this post
Show HN: Local Full-Rank Fine-Tuning Library for LLMs with Evolutionary Methods
github.com·1d·
🧠Intelligence Compression
Flag this post
Disaggregation in Large Language Models: The Next Evolution in AI Infra
infoq.com·1d·
Discuss: Hacker News
🖥️Hardware Architecture
Flag this post
Self-Verifying Reflection Helps Transformers with CoT Reasoning
arxiv.org·41m
⚙️Proof Engineering
Flag this post
Hypernetworks: Neural Networks for Hierarchical Data
blog.sturdystatistics.com·10h·
Discuss: Hacker News
🧠Machine Learning
Flag this post
Medical Interpretability and Knowledge Maps of Large Language Models
arxiv.org·1d
🧠Intelligence Compression
Flag this post
Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models
arxiv.org·1d
🧠Machine Learning
Flag this post
Hierarchical Alignment: Surgical Fine-Tuning via Functional Layer Specialization in Large Language Models
arxiv.org·41m
🔗Monadic Parsing
Flag this post
FedLoDrop: Federated LoRA with Dropout for Generalized LLM Fine-tuning
arxiv.org·41m
🧮Kolmogorov Complexity
Flag this post
Teaching Language Models to Faithfully Express their Uncertainty
arxiv.org·41m
💻Programming languages
Flag this post
Audit-of-Understanding: Posterior-Constrained Inference for Mathematical Reasoning in Language Models
arxiv.org·1d
🧮Constraint SMT
Flag this post