Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃 LLMs
Specific
large language models, GPT, transformers, foundation models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
187221
posts in
17.8
ms
epscylonb/1386.ai.rocm
: A lightweight transformer language model built from scratch in PyTorch, trained on a single consumer GPU with a full pipeline for data processing, pretraining, and instruction tuning.
聽
馃
AI
github.com
路
2d
路
Hacker News
Shorthand
for Thought: Compressing LLM Reasoning via Entropy-Guided
Supertokens
聽
馃
AI
arxiv.org
路
21h
Information
Extraction
from Electricity
Invoices
with General-Purpose Large Language Models
聽
馃
AI
arxiv.org
路
21h
allocz/slm
: zero-dependency TUI LLM chat
聽
馃幃
Reinforcement Learning
github.com
路
1d
路
Hacker News
,
r/golang
Structural
Generalization
on
SLOG
without Hand-Written Rules
聽
馃
AI
arxiv.org
路
21h
Factorized
Latent
Reasoning for LLM-based Recommendation
聽
馃幃
Reinforcement Learning
arxiv.org
路
21h
Language
Anchoring
: A
Systematic
Method for LLM Multilingual Adaptation
聽
馃幃
Reinforcement Learning
github.com
路
4d
路
Hacker News
What
Kind
of Language is Easy to Language-Model Under
Curriculum
Learning?
聽
馃幃
Reinforcement Learning
arxiv.org
路
21h
Select to Think: Unlocking
SLM
Potential with Local
Sufficiency
聽
馃幃
Reinforcement Learning
arxiv.org
路
21h
AsishKumarDalal/memoryllm
: using
differntiable
neural computer architecture with GPT2 to provide memory
聽
馃
AI
github.com
路
5d
路
DEV
CoQuant
: Joint Weight-Activation
Subspace
Projection for Mixed-Precision LLMs
聽
馃幃
Reinforcement Learning
arxiv.org
路
21h
itayinbarr/little-coder
: A coding agent optimized to smaller LLMs
聽
馃幃
Reinforcement Learning
github.com
路
3d
路
Hacker News
Delineating
Knowledge
Boundaries
for Honest Large Vision-Language Models
聽
馃
AI
arxiv.org
路
21h
Decoupling Knowledge and Task
Subspaces
for
Composable
Parametric Retrieval Augmented Generation
聽
馃
AI
arxiv.org
路
21h
TLPO
: Token-Level Policy Optimization for
Mitigating
Language Confusion in Large Language Models
聽
馃幃
Reinforcement Learning
arxiv.org
路
21h
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from
Raw
Corpora
聽
馃
AI
arxiv.org
路
1d
MoRFI
:
Monotonic
Sparse Autoencoder Feature Identification
聽
馃
AI
arxiv.org
路
21h
LLM-ReSum
: A Framework for LLM
Reflective
Summarization through Self-Evaluation
聽
馃
AI
arxiv.org
路
1d
Addressing Performance
Saturation
for LLM RL via
Precise
Entropy Curve Control
聽
馃幃
Reinforcement Learning
arxiv.org
路
21h
When to
Retrieve
During Reasoning:
Adaptive
Retrieval for Large Reasoning Models
聽
馃
AI
arxiv.org
路
21h
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help