Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 Language Models
transformer architecture, fine-tuning, pretraining, RLHF
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
186683
posts in
19.5
ms
Language Models: Does the brain really know what
word
is coming next?
🧠
LLMs
elifesciences.org
·
3d
Fine-Tuning
: the series
⚙️
MLOps
byhand.ai
·
6d
GenNA
: Conditional generation of
nucleotide
sequences guided by natural-language annotations
🧠
LLMs
biorxiv.org
·
5d
G-Loss:
Graph-Guided
Fine-Tuning
of Language Models
🧠
LLMs
arxiv.org
·
1d
How does
Reinforcement
Learning
Affect
Models
🧠
LLMs
lesswrong.com
·
3d
Adaptive and Fine-grained Module-wise Expert Pruning for Efficient
LoRA-MoE
Fine-Tuning
⚙️
MLOps
arxiv.org
·
19h
Full
Fine-Tuning
✅
Dev Best Practices
byhand.ai
·
6d
Unsupervised
protein language models learn patterns of
enzyme
function
🗄️
Vector Databases
biorxiv.org
·
6d
Applications of the Transformer Architecture in
AI-Assisted
English Reading
Comprehension
🧠
LLMs
arxiv.org
·
2d
Language models know what matters and the
foundations
of
ethics
better than you
🧠
LLMs
lesswrong.com
·
3d
Information
Extraction
from Electricity
Invoices
with General-Purpose Large Language Models
🧠
LLMs
arxiv.org
·
19h
Three Models of
RLHF
Annotation
: Extension, Evidence, and Authority
🧠
LLMs
arxiv.org
·
1d
An Empirical Study of Methods for
SFTing
Opaque
Reasoning Models
⚙️
MLOps
lesswrong.com
·
6d
TLPO
: Token-Level Policy Optimization for
Mitigating
Language Confusion in Large Language Models
🧠
LLMs
arxiv.org
·
19h
Carbon-Taxed
Transformers: A Green Compression Pipeline for
Overgrown
Language Models
🧠
LLMs
arxiv.org
·
1d
Identifying the Achilles' Heel: An Iterative Method for
Dynamically
Uncovering
Factual
Errors in Large Language Models
🧠
LLMs
arxiv.org
·
19h
A Dual-Task
Paradigm
to Investigate Sentence
Comprehension
Strategies in Language Models
🧠
LLMs
arxiv.org
·
19h
Analysing
Lightweight Large Language Models for Biomedical Named Entity Recognition on Diverse
Ouput
Formats
🧠
LLMs
arxiv.org
·
19h
Evaluating
Temporal
Consistency
in Multi-Turn Language Models
🧠
LLMs
arxiv.org
·
2d
PAINT: Partial-Solution Adaptive
Interpolated
Training for Self-Distilled
Reasoners
🧠
LLMs
arxiv.org
·
19h
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help