Culture Cartography: Mapping the Landscape of Cultural Knowledge
arxiv.org·1d
✨Model optimizations in LLMs
Flag this post
Aligning LLM agents with human learning and adjustment behavior: a dual agent approach
arxiv.org·20h
🤖Agents using LLMs
Flag this post
Identifying the Periodicity of Information in Natural Language
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
Reevaluating Self-Consistency Scaling in Multi-Agent Systems
arxiv.org·20h
✨Model optimizations in LLMs
Flag this post
LLMs Position Themselves as More Rational Than Humans: Emergence of AI Self-Awareness Measured Through Game Theory
arxiv.org·20h
🤖Agents using LLMs
Flag this post
Balanced Multimodal Learning via Mutual Information
arxiv.org·20h
🔢Quantization of LLMs
Flag this post
VISTA Score: Verification In Sequential Turn-based Assessment
arxiv.org·1d
💬Prompt optimizations for LLM serving
Flag this post
GrowthHacker: Automated Off-Policy Evaluation Optimization Using Code-Modifying LLM Agents
arxiv.org·20h
✨Model optimizations in LLMs
Flag this post
Analyzing Sustainability Messaging in Large-Scale Corporate Social Media
arxiv.org·20h
📊AI Performance Profiling
Flag this post
MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts
arxiv.org·20h
💬Prompt optimizations for LLM serving
Flag this post
Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes
arxiv.org·20h
⚡Real-time AI Systems
Flag this post
When to Trust the Answer: Question-Aligned Semantic Nearest Neighbor Entropy for Safer Surgical VQA
arxiv.org·20h
🔍Retrieval-augmented generation
Flag this post
Do Math Reasoning LLMs Help Predict the Impact of Public Transit Events?
arxiv.org·20h
✨Model optimizations in LLMs
Flag this post
Can SAEs reveal and mitigate racial biases of LLMs in healthcare?
arxiv.org·20h
✨Model optimizations in LLMs
Flag this post
Fleming-VL: Towards Universal Medical Visual Reasoning with Multimodal LLMs
arxiv.org·20h
🔍Retrieval-augmented generation
Flag this post
Self-Harmony: Learning to Harmonize Self-Supervision and Self-Play in Test-Time Reinforcement Learning
arxiv.org·20h
✨Model optimizations in LLMs
Flag this post
SpatialTraceGen: High-Fidelity Traces for Efficient VLM Spatial Reasoning Distillation
arxiv.org·20h
✨Model optimizations in LLMs
Flag this post
Reward Collapse in Aligning Large Language Models
arxiv.org·4d
💬Prompt optimizations for LLM serving
Flag this post
PlotCraft: Pushing the Limits of LLMs for Complex and Interactive Data Visualization
arxiv.org·20h
📊AI Performance Profiling
Flag this post
Thought Branches: Interpreting LLM Reasoning Requires Resampling
arxiv.org·1d
✨Model optimizations in LLMs
Flag this post
Loading...Loading more...