🌳 Decision-Time Planning - sworddish · Scour

ROSUM-MCTS: Monte Carlo Tree Search-Inspired HDL Code Summarization with Structural Rewards

💬LLMs Academic

Less-relevant results

Nex-N2-mini: A 35B Model Built for Autonomous Agents

🧪Agent Evaluation

hackernoon.com·

Think Fast and Far: Long-Horizon Online POMDP Planning via Rapid State Sampling

🃏Imperfect Information Games Academic

Monte Carlo Pass Search: Using Trajectory Generation for 3D Counterfactual Pass Evaluation in Football

🃏Imperfect Information Games Academic

A Temporal Spatial Minimax Rate for Smoothly-Varying Distributions in Wasserstein Space

🃏Imperfect Information Games Academic

Global-Local Monte Carlo Tree Search in Vision-Language Models for Text-to-3D Indoor Scene Generation

💬LLMs Academic

Segment-level Tree Search for Long Meeting Document Summarization

💬LLMs Academic

Amortized Nonlinear Model Predictive Control

🧩Neural-Symbolic AI Academic

Forecast and Model Predictive Control of Distributed Energy Resource Aggregators for Net-Demand Balancing

🧪Agent Evaluation Academic

LATTEArena: An Evaluation Framework for LLM-powered Tabular Feature Engineering (Extended Version)

💬LLMs Academic

RedEdit: Agentic Red-Teaming of Image Safety Classifiers via MCTS-Guided Photo-Editing

🧪Agent Evaluation Academic

Agentic Search for Counterfactual Recourse under Fixed LLM Budgets

🧪Agent Evaluation Academic

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

💬LLMs Academic

Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies

🧪Agent Evaluation Academic

Unlocking feedforward capabilities in Model Predictive Control algorithms to deal with measurable disturbances

🃏Imperfect Information Games Academic

Literature-Guided Minimax Optimization of Virtual Epilepsy Neurostimulation

🃏Imperfect Information Games Academic

Adaptive Model Predictive Control of Nonlinear Generic Urban Air Mobility Using Linear Parameter-Varying Systems

🧩Neural-Symbolic AI Academic

Two to Tango: Coupled Task-Reference Selection for Safe LLM Fine-tuning

💬LLMs Academic

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

🧩Neural-Symbolic AI Academic

Information-Theoretic Bounds for Sparse Covariance Estimation in the Vertical-Split Distributed Model

💬LLMs Academic

Log in to enable infinite scrolling