Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
arxiv.org·1d
⚡Real-time AI Systems
Flag this post
IVGAE-TAMA-BO: A novel temporal dynamic variational graph model for link prediction in global food trade networks with momentum structural memory and Bayesian o...
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
PORTool: Tool-Use LLM Training with Rewarded Tree
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
Modulation of temporal decision-making in a deep reinforcement learning agent under the dual-task paradigm
arxiv.org·1d
⚡Real-time AI Systems
Flag this post
Balanced Multimodal Learning via Mutual Information
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Thought-For-Food: Reasoning Chain Induced Food Visual Question Answering
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Context-Aware Stochastic Modeling of Consumer Energy Resource Aggregators in Electricity Markets
arxiv.org·2d
🌐Distributed LLM Systems
Flag this post
Towards Reliable Pediatric Brain Tumor Segmentation: Task-Specific nnU-Net Enhancements
arxiv.org·1d
🔢Quantization of LLMs
Flag this post
CoT-Saliency: Unified Chain-of-Thought Reasoning for Heterogeneous Saliency Tasks
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
FGO MythBusters: Explaining how Kalman Filter variants achieve the same performance as FGO in navigation applications
arxiv.org·1d
🔢Quantization of LLMs
Flag this post
Value Drifts: Tracing Value Alignment During LLM Post-Training
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
X-TRACK: Physics-Aware xLSTM for Realistic Vehicle Trajectory Prediction
arxiv.org·1d
⚡Real-time AI Systems
Flag this post
Temporal Fusion Transformer for Multi-Horizon Probabilistic Forecasting of Weekly Retail Sales
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Feature-Guided Analysis of Neural Networks: A Replication Study
arxiv.org·1d
📊AI Performance Profiling
Flag this post
Can SAEs reveal and mitigate racial biases of LLMs in healthcare?
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Loading...Loading more...