🧠 LLM Training - buckman · Scour

🎮Reinforcement Learning fareedkhan-dev.github.io·

Train LLM from Scratch

Discussed on Hacker News

🧠LLM GitHub·

Rust port of transformers (1M lines of code)

Discussed on Hacker News

🤗Hugging Face kaggle.com·

LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune

Discussed on DEV

🎮Reinforcement Learning mlx-lora-studio.netlify.app·

MLX LoRA Studio — Fine-tune LLMs on your Mac

Covers ml-explore/mlx

🧠LLM Nature·

Memorization in large language models in medicine prevalence characteristics and implications

🤖AI digitalocean.com·

Efficient LLM Compression with SparseGPT and Wanda on GPU Cloud

Covers NVIDIA Triton Inference Server — NVIDIA Triton Inference Server

🤗Hugging Face developer.nvidia.com·

Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes

🟩Nvidia GitHub·

Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch

Discussed on Hacker News

🤗Hugging Face huggingface.co·

Beyond LoRA: Can you beat the most popular fine-tuning technique?

📊Compute Markets projecthuginn.com·

cheaper AI training on idle GPUs

Discussed on Hacker News

🧠LLM Reasoning medium.com

·

RAFT: Teach LLMs to be better at RAG

🖥️GPU kaggle.com·

QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)

Discussed on DEV

🤖Large Language Models i-programmer.info·

Stanford's CME296 Diffusion & Large Vision Models

🤖AI day1training.com·

Distributed AI on AWS

Discussed on Hacker News

🖥️GPU igor´sLAB·

AMD at MLPerf Training 6.0: Instinct MI355X approaches Blackwell and scales across multiple servers for the first time

🤖ML Machine Learning Blog·

Pre-Training Isn’t Bitter Enough

Covered by Deep Learning Weekly

🧠LLM medium.com

·

AI Model Fine-Tuning Data Guide: Quality, Formats & Flywheel.

🧠LLMs lesswrong.com·

Alignement pretraining could backfire

Covers Teaching Claude why

🟩Nvidia Databricks·

Cloned

Covers NVIDIA Triton Inference Server — NVIDIA Triton Inference Server

Covered by lebigdata.fr

⚡Quantization GitHub·

Lightricks/LTX-2

Covered by DEV Community, huggingface.co

Log in to enable infinite scrolling