Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🧠 Deep Learning
Neural Networks, Convolutional Networks, Model Training, GPU
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7306
posts in
9.0
ms
MegaTrain
: Full Precision Training of
100B
+ Parameter Large Language Models on a Single GPU
🗣️
Large Language Models
arxiv.org
·
2d
·
Hacker News
,
r/artificial
DeepFocus-BP
: Error-Aware Adaptive
Backpropagation
via Dynamic Alpha-Beta Routing (Achieving 66% FLOPs Reduction with Improved Accuracy)
🧠
Neural Networks
zenodo.org
·
6d
·
Hacker News
milanm/AutoGrad-Engine
: A complete GPT language model (training and inference) in ~600 lines of pure C#, zero dependencies
🗣️
Large Language Models
github.com
·
20h
·
Hacker News
Low-Rank Key Value Attention: Reducing
KV
Cache Memory and
Maintaining
Head Diversity
🤖
Transformers
fin.ai
·
18h
·
Hacker News
Zero-Shot Alignment: Harm Detection via
Incongruent
Attention
Mechanisms
∂
Automatic Differentiation
lesswrong.com
·
1d
Inference
Arena
– new
benchmark
of local inference and training
🗣️
Large Language Models
kvark.github.io
·
4d
·
Hacker News
Beyond
ReconVLA
:
Annotation-Free
Visual Grounding via Language-Attention Masked Reconstruction
🤖
AI
hackernoon.com
·
2d
PiTorch
: ML on
Baremetal
Raspberry Pis
🔥
PyTorch
masonjwang.com
·
1d
·
Hacker News
AI breakthrough cuts energy use by 100x while
boosting
accuracy
🧠
Neural Networks
sciencedaily.com
·
4d
·
Hacker News
,
r/singularity
Continual
learning for AI agents
∂
Automatic Differentiation
blog.langchain.com
·
4d
·
Hacker News
Agent Labs:
Workload-Harness
Fit
🗣️
Large Language Models
akashbajwa.co
·
6d
·
Hacker News
Towards
Knowledgeable
Deep Research: Framework and
Benchmark
∂
Automatic Differentiation
arxiv.org
·
7h
Spilling
the Neural
Tea
: A Journey Down the Side-Channel
🧠
Neural Networks
sigarch.org
·
3d
·
Hacker News
LLM
inference
engine from
scratch
in C++
🗣️
Large Language Models
anirudhsathiya.com
·
4d
·
Hacker News
SPUTNIKAI/LeechTransformer
: Leech-Lila: A Geometric Attention Transformer(Language Model) with the Leech Lattice Attention
🗣️
Large Language Models
github.com
·
1d
·
Hacker News
Internal noise in deep neural networks:
interplay
of depth,
neuron
number, and noise injection step
🧠
Neural Networks
arxiv.org
·
7h
Data
Warmup
: Complexity-Aware
Curricula
for Efficient Diffusion Training
🧠
Neural Networks
arxiv.org
·
7h
Scaling-Aware
Data
Selection
for End-to-End Autonomous Driving Systems
📊
Optimization
arxiv.org
·
7h
Stochastic Gradient
Descent
in the
Saddle-to-Saddle
Regime of Deep Linear Networks
📊
Optimization
arxiv.org
·
1d
DeepFocus-BP
: Error-Aware Adaptive Backpropagation via Dynamic Alpha-Beta Routing (Achieving 66% FLOPs Reduction with Improved Accuracy) - SOTA NLP Confirmed v3. (
Resnet
FAIL)
🧠
Neural Networks
zenodo.org
·
5d
·
Hacker News
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help