ML Research

Feeds to Scour
SubscribedAll
Scoured 192 posts in 8.9 ms

What Makes a Desired Graph for Relational Deep Learning?

 📚CS Research  Content type: Academic

Boosting Direct Preference Optimization with Penalization

 📚CS Research  Content type: Academic
arxiv.org·

Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

PAWS: Preference Learning with Advantage-Weighted Segments

 📚CS Research  Content type: Academic
arxiv.org·

Flatland: The Adventures of Gradient Descent with Large Step Sizes

 📚CS Research  Content type: Academic
arxiv.org·

Conformal Bayes under Label Shift: Post-Hoc Calibration vs. In-Training Adaptation

 📚CS Research  Content type: Academic
arxiv.org·

Different Layers, Different Manifolds: Module-Wise Weight-Space Geometry in Transformer Optimization

 📚CS Research  Content type: Academic
arxiv.org·

UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

I built a graph-memory layer on top of turbovec for local/constrained RAG — looking for feedback

 🎨Creative Coding  Content type: Code
github.com··r/LocalLLaMA

Learning-Augmented Approximation for Unrelated-Machines Makespan Scheduling

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Preserving Plasticity in Continual Learning via Dynamical Isometry

 📚CS Research  Content type: Academic
arxiv.org·

ProcessThinker: Enhancing Multi-modal Large Language Models Reasoning via Rollout-based Process Reward

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Spatiotemporal Imputation with Graph-Informed Flow Matching

 📚CS Research  Content type: Academic
arxiv.org·

Context-Driven Incremental Compression for Multi-Turn Dialogue Generation

 📚CS Research  Content type: Academic

nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding

 📚CS Research  Content type: Academic
arxiv.org·

In Defense of Information Leakage in Concept-based Models

 📚CS Research  Content type: Academic
arxiv.org·

Decoding Insect Song: A Multitask Semisupervised Orthoptera Bioacoustic Classifier

 📚CS Research  Content type: Academic
arxiv.org·

Minibatch Selection via Partition Matroid Constrained Gradient Matching

 📚CS Research  Content type: Academic
arxiv.org·

Tree-Structured Orthonormal Decomposition of the Aitchison Simplex

 📚CS Research  Content type: Academic
arxiv.org·

PianoKontext: Expressive Performance Rendering from Deadpan Context

 📚CS Research  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help