Decision-Time Planning

Feeds to Scour
SubscribedAll
Scoured 44 posts in 13.7 ms

ROSUM-MCTS: Monte Carlo Tree Search-Inspired HDL Code Summarization with Structural Rewards

馃挰LLMsContent type: Academic
arxiv.org
Less-relevant results

Nex-N2-mini: A 35B Model Built for Autonomous Agents

馃ИAgent Evaluation
hackernoon.com

Think Fast and Far: Long-Horizon Online POMDP Planning via Rapid State Sampling

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

Monte Carlo Pass Search: Using Trajectory Generation for 3D Counterfactual Pass Evaluation in Football

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

A Temporal Spatial Minimax Rate for Smoothly-Varying Distributions in Wasserstein Space

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

Global-Local Monte Carlo Tree Search in Vision-Language Models for Text-to-3D Indoor Scene Generation

馃挰LLMsContent type: Academic
arxiv.org

Segment-level Tree Search for Long Meeting Document Summarization

馃挰LLMsContent type: Academic
arxiv.org

Amortized Nonlinear Model Predictive Control

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org

Forecast and Model Predictive Control of Distributed Energy Resource Aggregators for Net-Demand Balancing

馃ИAgent EvaluationContent type: Academic
arxiv.org

LATTEArena: An Evaluation Framework for LLM-powered Tabular Feature Engineering (Extended Version)

馃挰LLMsContent type: Academic
arxiv.org

RedEdit: Agentic Red-Teaming of Image Safety Classifiers via MCTS-Guided Photo-Editing

馃ИAgent EvaluationContent type: Academic
arxiv.org

Agentic Search for Counterfactual Recourse under Fixed LLM Budgets

馃ИAgent EvaluationContent type: Academic
arxiv.org

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

馃挰LLMsContent type: Academic
arxiv.org

Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies

馃ИAgent EvaluationContent type: Academic
arxiv.org

Unlocking feedforward capabilities in Model Predictive Control algorithms to deal with measurable disturbances

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

Literature-Guided Minimax Optimization of Virtual Epilepsy Neurostimulation

馃儚Imperfect Information GamesContent type: Academic
arxiv.org

Adaptive Model Predictive Control of Nonlinear Generic Urban Air Mobility Using Linear Parameter-Varying Systems

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org

Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning

馃挰LLMsContent type: Academic
arxiv.org

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

馃ЗNeural-Symbolic AIContent type: Academic
arxiv.org

Information-Theoretic Bounds for Sparse Covariance Estimation in the Vertical-Split Distributed Model

馃挰LLMsContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help