GPU Computing, Browser APIs, Rust Graphics, Cross-platform Rendering
Platonic Representations for Poverty Mapping: Unified Vision-Language Codes or Agent-Induced Novelty?
arxiv.orgยท2d
Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following
arxiv.orgยท1d
ProtoN: Prototype Node Graph Neural Network for Unconstrained Multi-Impression Ear Recognition
arxiv.orgยท9h
Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models
arxiv.orgยท9h
Modality Bias in LVLMs: Analyzing and Mitigating Object Hallucination via Attention Lens
arxiv.orgยท2d
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
arxiv.orgยท1d
Concept Poisoning: Probing LLMs without probes
lesswrong.comยท1d
Reasoning Beyond Labels: Measuring LLM Sentiment in Low-Resource, Culturally Nuanced Contexts
arxiv.orgยท9h
Loading...Loading more...