Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Training
🧠 LLM Training
Specific
large language models, fine-tuning, pretraining, RLHF
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
256
posts in
20.4
ms
🎮
Reinforcement Learning
fareedkhan-dev.github.io
·
1d
1 day ago
Train
LLM
from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🧠
LLM
GitHub
·
4d
4 days ago
Rust port of
transformers
(1M lines of code)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Rust port of transformers (1M lines of code)
🤗
Hugging Face
kaggle.com
·
17h
17 hours ago
LoRA
: I
Trained
<1% of a 1.5B
Model
and Matched a Full Fine-Tune
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune
🎮
Reinforcement Learning
mlx-lora-studio.netlify.app
·
2d
2 days ago
MLX
LoRA
Studio —
Fine-tune
LLMs on your Mac
Covers
ml-explore/mlx
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for MLX LoRA Studio — Fine-tune LLMs on your Mac
🧠
LLM
Nature
·
3d
3 days ago
Memorization in
large
language
models
in medicine prevalence characteristics and implications
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Memorization in large language models in medicine prevalence characteristics and implications
🤖
AI
digitalocean.com
·
2d
2 days ago
Efficient
LLM
Compression with SparseGPT and Wanda on
GPU
Cloud
Covers
NVIDIA Triton Inference Server — NVIDIA Triton Inference Server
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Efficient LLM Compression with SparseGPT and Wanda on GPU Cloud
🤗
Hugging Face
developer.nvidia.com
·
6d
6 days ago
Fine-Tuning
Biological Foundation
Models
with LoRA Using NVIDIA BioNeMo Recipes
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes
🟩
Nvidia
GitHub
·
2d
2 days ago
Show HN: NanoEuler – GPT-2 scale
model
in pure C/CUDA from scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch
🤗
Hugging Face
huggingface.co
·
4d
4 days ago
Beyond
LoRA
: Can you beat the most popular
fine-tuning
technique?
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Beyond LoRA: Can you beat the most popular fine-tuning technique?
📊
Compute Markets
projecthuginn.com
·
4d
4 days ago
cheaper AI
training
on idle GPUs
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for cheaper AI training on idle GPUs
🧠
LLM Reasoning
medium.com
·
3d
3 days ago
RAFT: Teach LLMs to be better at RAG
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for RAFT: Teach LLMs to be better at RAG
🖥️
GPU
kaggle.com
·
17h
17 hours ago
QLoRA
:
Fine-Tuning
a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)
🤖
Large Language Models
i-programmer.info
·
5d
5 days ago
Stanford's CME296 Diffusion &
Large
Vision
Models
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Stanford's CME296 Diffusion & Large Vision Models
🤖
AI
day1training.com
·
4d
4 days ago
Distributed
AI on AWS
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Distributed AI on AWS
🖥️
GPU
igor´sLAB
·
3d
3 days ago
AMD at MLPerf
Training
6.0: Instinct MI355X approaches Blackwell and scales across multiple servers for the first time
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AMD at MLPerf Training 6.0: Instinct MI355X approaches Blackwell and scales across multiple servers for the first time
🤖
ML
Machine Learning Blog
·
4d
4 days ago
Pre-Training
Isn’t Bitter Enough
Covered by
Deep Learning Weekly
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Pre-Training Isn’t Bitter Enough
🧠
LLM
medium.com
·
6d
6 days ago
AI
Model
Fine-Tuning
Data Guide: Quality, Formats & Flywheel.
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI Model Fine-Tuning Data Guide: Quality, Formats & Flywheel.
🧠
LLMs
lesswrong.com
·
4d
4 days ago
Alignement
pretraining
could backfire
Covers
Teaching Claude why
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Alignement pretraining could backfire
🟩
Nvidia
Databricks
·
4d
4 days ago
Cloned
Covers
NVIDIA Triton Inference Server — NVIDIA Triton Inference Server
Covered by
lebigdata.fr
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cloned
⚡
Quantization
GitHub
·
3d
3 days ago
Lightricks/LTX-2
Covered by
DEV Community
,
huggingface.co
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Lightricks/LTX-2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report