The Evolution from RAG to Agentic RAG to Agent Memory
leoniemonigatti.comยท4hยท
Discuss: Hacker News
๐ŸŽ“Model Distillation
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.comยท1dยท
Discuss: Substack
โšกONNX Runtime
Flag this post
Adaptive Stemming via Graph-Augmented Recurrent Variational Autoencoders
dev.toยท2dยท
Discuss: DEV
๐ŸŽ๏ธTensorRT
Flag this post
Functional embeddings enable Aggregation of multi-area SEEG recordings over subjects and sessions
arxiv.orgยท1d
๐Ÿ“‰Model Quantization
Flag this post
H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention
arxiv.orgยท12h
โšกFlash Attention
Flag this post
A Comparative Analysis of LLM Adaptation: SFT, LoRA, and ICL in Data-Scarce Scenarios
arxiv.orgยท12h
๐ŸŽ“Model Distillation
Flag this post
Towards Automated Petrography
arxiv.orgยท12h
๐Ÿ“‰Model Quantization
Flag this post
How neuroscientists are using AI
thetransmitter.orgยท12h
โšกONNX Runtime
Flag this post
Synthesized Generative Modeling via Graph-Constrained Semantic Embedding
dev.toยท2dยท
Discuss: DEV
๐ŸŽ“Model Distillation
Flag this post
Benchmarking Generative AI Against Bayesian Optimization for Constrained Multi-Objective Inverse Design
arxiv.orgยท12h
๐ŸŽ“Model Distillation
Flag this post
CoT-Saliency: Unified Chain-of-Thought Reasoning for Heterogeneous Saliency Tasks
arxiv.orgยท12h
๐ŸŽ๏ธTensorRT
Flag this post
Investigating Label Bias and Representational Sources of Age-Related Disparities in Medical Segmentation
arxiv.orgยท12h
๐ŸŽ๏ธTensorRT
Flag this post
It Doesnโ€™t Need to Be a Chatbot
towardsdatascience.comยท16h
๐Ÿค–AI Coding Tools
Flag this post
Disciplined Biconvex Programming
arxiv.orgยท12h
๐Ÿ“‰Model Quantization
Flag this post
Towards Reliable Pediatric Brain Tumor Segmentation: Task-Specific nnU-Net Enhancements
arxiv.orgยท12h
๐ŸŽ๏ธTensorRT
Flag this post
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
arxiv.orgยท12h
๐ŸŽ๏ธTensorRT
Flag this post
ReLaX-Net: Reusing Layers for Parameter-Efficient Physical Neural Networks
arxiv.orgยท12h
๐ŸŽฏTensor Cores
Flag this post
Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes
arxiv.orgยท12h
๐ŸŽ๏ธTensorRT
Flag this post
Deep Learning Approach to Anomaly Detection in Enterprise ETL Processes with Autoencoders
arxiv.orgยท12h
๐ŸŽ“Model Distillation
Flag this post