Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃挰 LLMs
Specific
GPT, Large Language Models, Transformers, NLP
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
187351
posts in
17.5
ms
RNN
to Transformer
NMT
: PyTorch Migration with 2.8x BLEU Gain
聽
馃
Neural Networks
tildalice.io
路
6d
Learning to
Orchestrate
Agents in Natural Language with the
Conductor
聽
馃
AI Agents
openreview.net
路
3d
路
Hacker News
Select to Think: Unlocking
SLM
Potential with Local
Sufficiency
聽
馃搻
ML Theory
arxiv.org
路
23h
epscylonb/1386.ai.rocm
: A lightweight transformer language model built from scratch in PyTorch, trained on a single consumer GPU with a full pipeline for data processing, pretraining, and instruction tuning.
聽
馃
Machine Learning
github.com
路
2d
路
Hacker News
Associative-State
Universal Transformers: Sparse Retrieval Meets Structured
Recurrence
聽
馃搻
ML Theory
arxiv.org
路
23h
Shorthand
for Thought: Compressing LLM Reasoning via Entropy-Guided
Supertokens
聽
馃攳
RAG
arxiv.org
路
23h
MoRFI
:
Monotonic
Sparse Autoencoder Feature Identification
聽
馃
Machine Learning
arxiv.org
路
23h
Information
Extraction
from Electricity
Invoices
with General-Purpose Large Language Models
聽
馃攳
RAG
arxiv.org
路
23h
Language
Anchoring
: A
Systematic
Method for LLM Multilingual Adaptation
聽
馃
AI Agents
github.com
路
4d
路
Hacker News
TLPO
: Token-Level Policy Optimization for
Mitigating
Language Confusion in Large Language Models
聽
馃
AI Agents
arxiv.org
路
23h
LLM-Flax
: Generalizable Robotic Task Planning via
Neuro-Symbolic
Approaches with Large Language Models
聽
馃
AI Agents
arxiv.org
路
23h
AsishKumarDalal/memoryllm
: using
differntiable
neural computer architecture with GPT2 to provide memory
聽
馃
Machine Learning
github.com
路
5d
路
DEV
Structural
Generalization
on
SLOG
without Hand-Written Rules
聽
馃搻
ML Theory
arxiv.org
路
23h
Adaptive and Fine-grained Module-wise Expert Pruning for Efficient
LoRA-MoE
Fine-Tuning
聽
馃
Machine Learning
arxiv.org
路
23h
CoQuant
: Joint Weight-Activation
Subspace
Projection for Mixed-Precision LLMs
聽
馃搻
ML Theory
arxiv.org
路
23h
Showoff
Saturday: Using LLMs +
Zod
to create a deterministic parsing engine for educational content.
聽
馃攳
RAG
github.com
路
6d
路
r/webdev
Human-in-the-Loop Benchmarking of Heterogeneous LLMs for Automated
Competency
Assessment in Secondary Level
Mathematics
聽
馃搻
ML Theory
arxiv.org
路
23h
Ceci
n'est pas une
explication
: Evaluating Explanation Failures as Explainability Pitfalls in Language Learning Systems
聽
馃搻
ML Theory
arxiv.org
路
23h
Evaluating Large Language Models on Computer Science University
Exams
in Data
Structures
聽
馃搻
ML Theory
arxiv.org
路
2d
Addressing Performance
Saturation
for LLM RL via
Precise
Entropy Curve Control
聽
馃幃
Reinforcement Learning
arxiv.org
路
23h
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help