🛠 Ml-eng - miterion · Scour

facebookresearch/MUSE: A library for Multilingual Unsupervised or Supervised word Embeddings

github.com·5h

Sweden eliminates Czech Republic from Olympic women’s hockey in stunning upset

nytimes.com

·3h

Are Two LLMs Better Than One? A Student-Teacher Dual-Head LLMs Architecture for Pharmaceutical Content Optimization

arxiv.org·17h

🎓Model Distillation

Lexer, Parser, Codegen

github.com·15h·

Discuss: DEV

Atomistic, but non-complete lattices

dominiczypen.wordpress.com·11h

Olmix: A framework for data mixing throughout LM development

allenai.org·5h

⚡ONNX Runtime

Scaling Model and Data for Multilingual Machine Translation with Open Large Language Models

arxiv.org·17h

🏎️TensorRT

GLM 5 has a regression in international language writing according to NCBench

nc-bench.com·9h·

Discuss: r/LocalLLaMA

Languages aren’t real

facebook.com·21h

🔍Type Checkers

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·1d·

Discuss: Hacker News

📉Model Quantization

kathrynschuler.com·21h

The Evolving Role of the ML Engineer

towardsdatascience.com·7h

bsky.app·13h·

Discuss: Bluesky

📉Model Quantization

wouldloveitall.bearblog.dev·5h

🧩Attention Kernels

Ming-flash-omni-2.0: 100B MoE (6B active) omni-modal model - unified speech/SFX/music generation

huggingface.co·1d·

Discuss: r/LocalLLaMA

⚡Flash Attention

How Transformer Architecture Powers LLMs

dev.to·1d·

Discuss: DEV

🧩Attention Kernels

Presentation: Building Embedding Models for Large-Scale Real-World Applications

infoq.com

·6h

🎓Model Distillation

Ma reggel : M1 : February 13, 2026 8:20am-8:31am CET : Free Borrow & Streaming

archive.org·14h

Context Graphs: Building Production World Models for the Age of AI Agents

hackernoon.com·9h

🤖AI Coding Tools

Sexta-feira 13: entenda o significado na numerologia

oantagonista.com.br·7h

Loading more...