Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Training
🧠 LLM Training
Specific
LLM training, pretraining, RLHF, model training, arxiv ML
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
245
posts in
13.6
ms
🎯
RLHF
fareedkhan-dev.github.io
·
4d
4 days ago
Train
LLM
from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🤖
AI Development
arXiv
·
21h
21 hours ago
The Hitchhiker's Guide to Agentic AI: From Foundations to Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Hitchhiker's Guide to Agentic AI: From Foundations to Systems
🧠
LLM Tooling
GitHub
·
2d
2 days ago
Generate per-session
LoRA
adapters in <1s for agentic inference efficiency
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Generate per-session LoRA adapters in <1s for agentic inference efficiency
🧠
LLM Research
Hugging Face
·
8h
8 hours ago
HRM-Text: Efficient
Pretraining
Beyond Scaling
Covers
sapientinc/HRM-Text: HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for HRM-Text: Efficient Pretraining Beyond Scaling
⚙️
LLM Fine-tuning
mlx-lora-studio.netlify.app
·
6d
6 days ago
MLX
LoRA
Studio —
Fine-tune
LLMs on your Mac
Covers
ml-explore/mlx
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for MLX LoRA Studio — Fine-tune LLMs on your Mac
🧠
LLM Tooling
vucense.com
·
5h
5 hours ago
TurboQuant on Windows and LM Studio 2026: Complete Setup Guide
Covers
2 stories
See all stories this covers
including
Discover and run local LLMs
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TurboQuant on Windows and LM Studio 2026: Complete Setup Guide
📄
AI Papers
arXiv
·
5h
5 hours ago
LoRA
: Low-Rank Adaptation of
Large
Language
Models
Covered by
14 sources
See all sources covering this story
including
Martin Fowler
,
Towards Data Science
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LoRA: Low-Rank Adaptation of Large Language Models
🧠
LLM Research
medium.com
·
10h
10 hours ago
Large
Language
Models
: Architectures, Pretraining, and Roadmaps
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Large Language Models: Architectures, Pretraining, and Roadmaps
🤖
Agentic Engineering
IT之家
·
1d
1 day ago
阿里千问发布首个原生语言世界模型 Qwen-AgentWorld,可在七大领域中模拟智能体交互环境
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 阿里千问发布首个原生语言世界模型 Qwen-AgentWorld,可在七大领域中模拟智能体交互环境
🤖
AI
fineset.io
·
2d
2 days ago
Show HN: Describe a research topic, get a daily-updated
ArXiv/S2
dataset
Covered by
Hugging Face
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Describe a research topic, get a daily-updated ArXiv/S2 dataset
🧠
LLM Research
Ai2
·
9h
9 hours ago
Which tokens does a hybrid
model
predict better?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Which tokens does a hybrid model predict better?
🔀
LoRA
kaggle.com
·
4d
4 days ago
LoRA
: I
Trained
<1% of a 1.5B
Model
and Matched a Full Fine-Tune
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LoRA: I Trained <1% of a 1.5B Model and Matched a Full Fine-Tune
🧠
LLM Engineering
linuxgizmos.com
·
1d
1 day ago
LILYGO T-Impulse Plus wearable dev board comes with
LoRa
, GNSS, OLED, and IMU
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LILYGO T-Impulse Plus wearable dev board comes with LoRa, GNSS, OLED, and IMU
🧠
LLM Research
Bloomberg
·
3d
3 days ago
Tech Disruptors: Invisible Technologies on
RLHF
and
LLM
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tech Disruptors: Invisible Technologies on RLHF and LLM Training
🧠
LLM Engineering
GitHub
·
6d
6 days ago
Lightricks/LTX-2
Covered by
DEV Community
,
Hugging Face
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Lightricks/LTX-2
🧠
LLM Research
biorxiv.org
·
3d
3 days ago
CellTosg2Sequence: A Unified Text-Omics-Signaling-Graph
Large
Language
Model
for Single-Cell Analysis
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for CellTosg2Sequence: A Unified Text-Omics-Signaling-Graph Large Language Model for Single-Cell Analysis
🗣️
Large Language Models
ai-brief.liziran.com
·
4d
4 days ago
榜单分预测不了部署,机械臂自迭代99%
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 榜单分预测不了部署,机械臂自迭代99%
🧠
LLM Research
igor´sLAB
·
6d
6 days ago
AMD at MLPerf
Training
6.0: Instinct MI355X approaches Blackwell and scales across multiple servers for the first time
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AMD at MLPerf Training 6.0: Instinct MI355X approaches Blackwell and scales across multiple servers for the first time
🧠
LLM Engineering
Hacker News
·
3d
3 days ago
Good results
fine
tuning
a local
LLM
like Qwen 3:0.6B to categorize questions
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions
🧠
LLM Research
GitHub
·
6d
6 days ago
Show HN: NanoEuler – GPT-2 scale
model
in pure C/CUDA from scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report