🧠 LLM Training · Scour

🧠 LLM TrainingSpecific

LLM training, pretraining, RLHF, model training, arxiv ML

If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?

Discussed on DEV

What's the advice for LLM poisoning of artwork these days?

Discussed on Lobsters

·

From Intern to AI Agent: How Hugging Face’s ML Intern Is Redefining Work

Comparing Transformers and Hybrid Models at the Token Level

baidu/Unlimited-OCR

Covered by 5 sources including The Rundown AI, VentureBeat

Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning

not much happened today | AINews

Covers 6 stories including GLM-5.2 is the new leading open weights model on Artificial Analysis

Show HN: Pragmatiq – open-source framework for foundational models in banking

Discussed on Hacker News

Out of Stealth (Kinda)

Covers uv

Discussed on Hacker News

Transformer-Based Language Models Across Domain Verticals: Architectures, Applications and Critical Assessment

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Covers Opencode – open-source alternative to Claude Code

Breaking Browser-Use Models Using Domain Randomization

Discussed on Hacker News and Hacker News

Show HN: I built an 11-LLM consensus engine to detect AI hallucination

Covers Show HN: An AI that reliably builds full-stack apps by preventing LLM mistakes

Discussed on Hacker News

Natural Ungrokking: Asymmetric Control of Which Rules Survive Pretraining

Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments

Covers 2 stories including vllm-project/vllm

Covered by 3 sources including GitHub, indiehacker.news

Discussed on r/LocalLLaMA

TuringViT: Making SOTA Vision Transformers Accessible to All

A Physics-Informed Fourier-Wavelet Transformer for Multiscale Computational Fluid Dynamics Surrogate Modeling

Aligning MusicLLM with Emotion using Instruction Tuning and Feedback-Driven Alignment

MosaicLeaks: Can your research agent keep a secret?

Covered by tldr.tech

Discussed on Hacker News

Tri-Efficient Transfer Learning for Point Cloud Videos

Sign up or log in to see more results

Log in to enable infinite scrolling