Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Research
🧠 LLM Research
large language models, transformer, pretraining, fine-tuning
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
718
posts in
36.1
ms
📄
AI Papers
arXiv
·
6h
6 hours ago
LoRA: Low-Rank Adaptation of
Large
Language
Models
Covered by
14 sources
See all sources covering this story
including
Martin Fowler
,
Towards Data Science
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LoRA: Low-Rank Adaptation of Large Language Models
🎯
RLHF
fareedkhan-dev.github.io
·
4d
4 days ago
Train
LLM
from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🗣️
Large Language Models
Machine Learning Mastery
·
2d
2 days ago
Clustering Unstructured Text with
LLM
Embeddings and HDBSCAN
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Clustering Unstructured Text with LLM Embeddings and HDBSCAN
🧠
LLM Training
Hugging Face
·
9h
9 hours ago
HRM-Text: Efficient
Pretraining
Beyond Scaling
Covers
sapientinc/HRM-Text: HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for HRM-Text: Efficient Pretraining Beyond Scaling
🧠
LLM
ByteByteGo Newsletter
·
1d
1 day ago
Large
Language
Models
vs Small
Language
Models
Covers
6 stories
See all stories this covers
including
Attention is all you need (2017)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Large Language Models vs Small Language Models
⚡
Transformers
astledsa.substack.com
·
6d
6 days ago
Tree
Transformers
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tree Transformers
🗣️
Large Language Models
medium.com
·
11h
11 hours ago
Large
Language
Models
: Architectures, Pretraining, and Roadmaps
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Large Language Models: Architectures, Pretraining, and Roadmaps
🗣️
Large Language Models
medium.com
·
2d
2 days ago
How LLMs Actually Work
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How LLMs Actually Work
🗣️
Large Language Models
medium.com
·
13h
13 hours ago
Temperature and Sampling in
Transformers
: How LLMs Decide the Next Word
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Temperature and Sampling in Transformers: How LLMs Decide the Next Word
🧠
AI Models
Bloomberg
·
3d
3 days ago
Tech Disruptors: Invisible Technologies on
RLHF
and
LLM
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tech Disruptors: Invisible Technologies on RLHF and LLM Training
🤖
AI/ML
medium.com
·
11h
11 hours ago
The Coming War Between Memory and Compute in AI Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Coming War Between Memory and Compute in AI Systems
📄
AI Papers
arXiv
·
1d
1 day ago
RoFormer: Enhanced
Transformer
with Rotary Position Embedding
Covered by
13 sources
See all sources covering this story
including
pathtostaff.com
,
DEV Community
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for RoFormer: Enhanced Transformer with Rotary Position Embedding
🗣️
Large Language Models
IT之家
·
17h
17 hours ago
富士通介绍 PHOTON 框架:1.2B 模型多查询性能 475 倍于
Transformer
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 富士通介绍 PHOTON 框架:1.2B 模型多查询性能 475 倍于 Transformer
🤖
AI Development
hamanlp.org
·
1d
1 day ago
Lean Zig by building an
LLM
from scratch
Covers
Zig Software Foundation ⚡ Zig Programming Language
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Lean Zig by building an LLM from scratch
🤖
LLM, Agent
Deep (Learning) Focus
·
3d
3 days ago
Agentic
RL
: Frameworks and Best Practices
Covers
3 stories
See all stories this covers
including
MCP is an open protocol that standardizes how apps provide context to LLMs
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Agentic RL: Frameworks and Best Practices
🤖
AI Development
medium.com
·
21h
21 hours ago
Why I Stopped Focusing on ML Algorithms and Started Focusing on Data and Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why I Stopped Focusing on ML Algorithms and Started Focusing on Data and Systems
🗣️
Large Language Models
foojay
·
1d
1 day ago
BoxLang 1.14.0 : Query
Transformers
– Take Full Control of Your Query Results
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for BoxLang 1.14.0 : Query Transformers – Take Full Control of Your Query Results
🤖
人工智能
SiliconANGLE
·
6h
6 hours ago
Mirendil raises $200M to speed up scientific
research
with AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Mirendil raises $200M to speed up scientific research with AI
🤖
AI Development
GitHub
·
6d
6 days ago
Show HN: NanoEuler – GPT-2 scale
model
in pure C/CUDA from scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch
🔬
Deep Learning
Neuroscience News
·
2d
2 days ago
Human Memory Limits Make AI Better at Grammar
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Human Memory Limits Make AI Better at Grammar
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report