LSP Protocol, IDE Integration, Code Completion, Syntax Analysis
LLM-Crowdsourced: A Benchmark-Free Paradigm for Mutual Evaluation of Large Language Models
arxiv.orgยท2d
CIMR: Contextualized Iterative Multimodal Reasoning for Robust Instruction Following in LVLMs
arxiv.orgยท2d
Preface to "Simulacra and Simulation: Sections from the Work of Janus"
lesswrong.comยท13h
From Prompt to Pipeline: Large Language Models for Scientific Workflow Development in Bioinformatics
arxiv.orgยท4d
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
arxiv.orgยท4d
From Sufficiency to Reflection: Reinforcement-Guided Thinking Quality in Retrieval-Augmented Reasoning for LLMs
arxiv.orgยท2d
MemTool: Optimizing Short-Term Memory Management for Dynamic Tool Calling in LLM Agent Multi-Turn Conversations
arxiv.orgยท3d
Loading...Loading more...