Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃 LLMs
Specific
large language models, GPT, inference, fine-tuning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
149488
posts in
10.0
ms
AsyncTLS
: Efficient Generative LLM Inference with
Asynchronous
Two-level Sparse Attention
聽
馃殌
Inference
arxiv.org
路
6h
LLM (Large Language Model)
聽
馃
AI Engineering
ministryoftesting.com
路
2d
Thoughts
on Large Language Models (2023)
聽
馃殌
Inference
nikola.plejic.com
路
3d
路
Hacker News
milanm/AutoGrad-Engine
: A complete GPT language model (training and inference) in ~600 lines of pure C#, zero dependencies
聽
馃殌
Inference
github.com
路
19h
路
Hacker News
Using LLMs as
Classifiers
聽
馃敡
MLOps
medium.com
路
4d
What
Rebuilding
GPT-2 From Scratch
Taught
Me About How LLMs Really Work
聽
馃殌
Inference
medium.com
路
3d
How I Reduced ML
Inferencing
Resource Usage at Scale by 9x @
LMWN
聽
馃殌
Inference
korntewin-b.medium.com
路
4d
Reasoning-Based
Refinement
of Unsupervised Text
Clusters
with LLMs
聽
馃敡
MLOps
arxiv.org
路
6h
Controlling
Distributional
Bias in Multi-Round LLM Generation via
KL-Optimized
Fine-Tuning
聽
馃殌
Inference
arxiv.org
路
2d
Exploring
Continual
Fine-Tuning for Enhancing Language
Ability
in Large Language Model
聽
馃殌
Inference
arxiv.org
路
2d
Large Language Model Post-Training: A
Unified
View of Off-Policy and On-Policy Learning
聽
馃殌
Inference
arxiv.org
路
6h
The Model
Agreed
, But Didn't Learn:
Diagnosing
Surface Compliance in Large Language Models
聽
馃殌
Inference
arxiv.org
路
2d
Learning is
Forgetting
: LLM Training As
Lossy
Compression
聽
馃殌
Inference
arxiv.org
路
6h
Application-Driven
Pedagogical
Knowledge Optimization of Open-Source LLMs via Reinforcement Learning and
Supervised
Fine-Tuning
聽
馃殌
Inference
arxiv.org
路
1d
Rethinking
Data
Mixing
from the Perspective of Large Language Models
聽
馃殌
Inference
arxiv.org
路
6h
Can Large Language Models
Reinvent
Foundational
Algorithms?
聽
馃殌
Inference
arxiv.org
路
2d
Flux
Attention: Context-Aware Hybrid Attention for Efficient LLMs
Inference
聽
馃殌
Inference
arxiv.org
路
6h
Towards Identification and
Intervention
of Safety-Critical
Parameters
in Large Language Models
聽
馃殌
Inference
arxiv.org
路
6h
ART: Attention Replacement
Technique
to Improve
Factuality
in LLMs
聽
馃殌
Inference
arxiv.org
路
1d
Robust LLM Performance
Certification
via Constrained Maximum
Likelihood
Estimation
聽
馃殌
Inference
arxiv.org
路
3d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help