Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Research
🧠 LLM Research
large language models, transformer, pretraining, fine-tuning
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
724
posts in
24.6
ms
📄
AI Papers
arXiv
·
1h
1 hour ago
LoRA: Low-Rank Adaptation of
Large
Language
Models
Covered by
14 sources
See all sources covering this story
including
Martin Fowler
,
Towards Data Science
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LoRA: Low-Rank Adaptation of Large Language Models
🎯
RLHF
fareedkhan-dev.github.io
·
4d
4 days ago
Train
LLM
from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🗣️
Large Language Models
medium.com
·
6h
6 hours ago
Large
Language
Models
: Architectures, Pretraining, and Roadmaps
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Large Language Models: Architectures, Pretraining, and Roadmaps
🧠
LLM Training
Hugging Face
·
4h
4 hours ago
HRM-Text: Efficient
Pretraining
Beyond Scaling
Covers
sapientinc/HRM-Text: HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for HRM-Text: Efficient Pretraining Beyond Scaling
🗣️
Large Language Models
medium.com
·
8h
8 hours ago
Temperature and Sampling in
Transformers
: How LLMs Decide the Next Word
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Temperature and Sampling in Transformers: How LLMs Decide the Next Word
🗣️
Large Language Models
Machine Learning Mastery
·
2d
2 days ago
Clustering Unstructured Text with
LLM
Embeddings and HDBSCAN
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Clustering Unstructured Text with LLM Embeddings and HDBSCAN
🧠
LLM Training
Nature
·
6d
6 days ago
Memorization in
large
language
models
in medicine prevalence characteristics and implications
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Memorization in large language models in medicine prevalence characteristics and implications
🤖
AI/ML
medium.com
·
5h
5 hours ago
The Coming War Between Memory and Compute in AI Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Coming War Between Memory and Compute in AI Systems
🗣️
Large Language Models
IT之家
·
12h
12 hours ago
富士通介绍 PHOTON 框架:1.2B 模型多查询性能 475 倍于
Transformer
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 富士通介绍 PHOTON 框架:1.2B 模型多查询性能 475 倍于 Transformer
🧠
LLM
ByteByteGo Newsletter
·
1d
1 day ago
Large
Language
Models
vs Small
Language
Models
Covers
6 stories
See all stories this covers
including
Attention is all you need (2017)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Large Language Models vs Small Language Models
🤖
AI Development
medium.com
·
16h
16 hours ago
Why I Stopped Focusing on ML Algorithms and Started Focusing on Data and Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why I Stopped Focusing on ML Algorithms and Started Focusing on Data and Systems
⚡
Transformers
astledsa.substack.com
·
5d
5 days ago
Tree
Transformers
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tree Transformers
🧠
LLM Training
biorxiv.org
·
2d
2 days ago
CellTosg2Sequence: A Unified Text-Omics-Signaling-Graph
Large
Language
Model
for Single-Cell Analysis
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for CellTosg2Sequence: A Unified Text-Omics-Signaling-Graph Large Language Model for Single-Cell Analysis
🤖
人工智能
SiliconANGLE
·
1h
1 hour ago
Mirendil raises $200M to speed up scientific
research
with AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Mirendil raises $200M to speed up scientific research with AI
🗣️
Large Language Models
link.aps.org
·
2d
2 days ago
Transformer-based
operator learning framework for self-energy in strongly correlated systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Transformer-based operator learning framework for self-energy in strongly correlated systems
🧠
AI Models
Bloomberg
·
3d
3 days ago
Tech Disruptors: Invisible Technologies on
RLHF
and
LLM
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tech Disruptors: Invisible Technologies on RLHF and LLM Training
🗣️
Large Language Models
medium.com
·
2d
2 days ago
How LLMs Actually Work
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How LLMs Actually Work
🤖
AI Development
zentara.co
·
14h
14 hours ago
LLM
Refusal Behavior on Open-Weight
Model
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLM Refusal Behavior on Open-Weight Model
🤖
AI Development
TechSpot
·
8h
8 hours ago
OpenAI debuts Jalapeño, a custom chip built to cut ChatGPT costs and reduce Nvidia reliance
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for OpenAI debuts Jalapeño, a custom chip built to cut ChatGPT costs and reduce Nvidia reliance
🗣️
Large Language Models
Ai2
·
5h
5 hours ago
Which tokens does a hybrid
model
predict better?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Which tokens does a hybrid model predict better?
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report