Intermediate Representation, Code Generation, Optimization Passes, Compiler Backend
Accelerate LLM Inference with ONNX Runtime on Arm Neoverse-powered Microsoft Cobalt 100
community.arm.comยท1d
A practical blueprint for evaluating conversational AI at scale
dropbox.techยท1d
ICL Optimized Fragility
arxiv.orgยท1d
Fine-tuning Done Right in Model Editing
arxiv.orgยท4d
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
arxiv.orgยท5h
HalluGuard: Evidence-Grounded Small Reasoning Models to Mitigate Hallucinations in Retrieval-Augmented Generation
arxiv.orgยท1d
Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation
arxiv.orgยท5h
Loading...Loading more...