✨ Model optimizations in LLMs - pleto · Scour

Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error

arxiv.org·5d

🧠Large Language Models (LLMs)

Flag this post

Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games

arxiv.org·1d

⚡Real-time AI Systems

Flag this post

IVGAE-TAMA-BO: A novel temporal dynamic variational graph model for link prediction in global food trade networks with momentum structural memory and Bayesian o...

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

PORTool: Tool-Use LLM Training with Rewarded Tree

arxiv.org·5d

🧠Large Language Models (LLMs)

Flag this post

Modulation of temporal decision-making in a deep reinforcement learning agent under the dual-task paradigm

arxiv.org·1d

⚡Real-time AI Systems

Flag this post

Balanced Multimodal Learning via Mutual Information

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Thought-For-Food: Reasoning Chain Induced Food Visual Question Answering

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Context-Aware Stochastic Modeling of Consumer Energy Resource Aggregators in Electricity Markets

arxiv.org·2d

🌐Distributed LLM Systems

Flag this post

Towards Reliable Pediatric Brain Tumor Segmentation: Task-Specific nnU-Net Enhancements

arxiv.org·1d

🔢Quantization of LLMs

Flag this post

CoT-Saliency: Unified Chain-of-Thought Reasoning for Heterogeneous Saliency Tasks

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

FGO MythBusters: Explaining how Kalman Filter variants achieve the same performance as FGO in navigation applications

arxiv.org·1d

🔢Quantization of LLMs

Flag this post

Value Drifts: Tracing Value Alignment During LLM Post-Training

arxiv.org·5d

🧠Large Language Models (LLMs)

Flag this post

X-TRACK: Physics-Aware xLSTM for Realistic Vehicle Trajectory Prediction

arxiv.org·1d

⚡Real-time AI Systems

Flag this post

Temporal Fusion Transformer for Multi-Horizon Probabilistic Forecasting of Weekly Retail Sales

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Feature-Guided Analysis of Neural Networks: A Replication Study

arxiv.org·1d

📊AI Performance Profiling

Flag this post

Can SAEs reveal and mitigate racial biases of LLMs in healthcare?

arxiv.org·1d

🧠Large Language Models (LLMs)

Flag this post

Fast Answering Pattern-Constrained Reachability Queries with Two-Dimensional Reachability Index

arxiv.org·1d

🌐Distributed LLM Systems

Flag this post

Loading more...