📉 Model Quantization - miterion · Scour

Dense vs Sparse vs Multi-Vector Embeddings Explained: What Every AI Engineer Should Know

pub.towardsai.net·12h

Flag this post

Reversal Invariance in Autoregressive Language Models

arxiv.org·11h

🏎️TensorRT

Flag this post

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

dev.to·19h·

Discuss: DEV

⚡Flash Attention

Flag this post

EVTAR: End-to-End Try on with Additional Unpaired Visual Reference

arxiv.org·11h

🏎️TensorRT

Flag this post

TA-LSDiff:Topology-Aware Diffusion Guided by a Level Set Energy for Pancreas Segmentation

arxiv.org·11h

🏎️TensorRT

Flag this post

Efficiency vs. Alignment: Investigating Safety and Fairness Risks in Parameter-Efficient Fine-Tuning of LLMs

arxiv.org·11h

Flag this post

Qwen3 VL 30b a3b is pure love

reddit.com·1d·

Discuss: r/LocalLLaMA

Flag this post

Thought-For-Food: Reasoning Chain Induced Food Visual Question Answering

arxiv.org·11h

🧩Attention Kernels

Flag this post

Physics-Informed Neural Network Frameworks for the Analysis of Engineering and Biological Dynamical Systems Governed by Ordinary Differential Equations

arxiv.org·11h

⚡ONNX Runtime

Flag this post

NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion

arxiv.org·11h

Flag this post

Self-Harmony: Learning to Harmonize Self-Supervision and Self-Play in Test-Time Reinforcement Learning

arxiv.org·11h

🏎️TensorRT

Flag this post

High Resolution Seismic Waveform Generation using Denoising Diffusion

arxiv.org·11h

🏎️TensorRT

Flag this post

Identification of Capture Phases in Nanopore Protein Sequencing Data Using a Deep Learning Model

arxiv.org·11h

Flag this post

MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts

arxiv.org·11h

Flag this post

ZoFia: Zero-Shot Fake News Detection with Entity-Guided Retrieval and Multi-LLM Interaction

arxiv.org·11h

Flag this post

Defining Energy Indicators for Impact Identification on Aerospace Composites: A Physics-Informed Machine Learning Perspective

arxiv.org·11h

🏎️TensorRT

Flag this post

ReLaX-Net: Reusing Layers for Parameter-Efficient Physical Neural Networks

arxiv.org·11h

📊Gradient Accumulation

Flag this post

Towards 1000-fold Electron Microscopy Image Compression for Connectomics via VQ-VAE with Transformer Prior

arxiv.org·11h

🏎️TensorRT

Flag this post

Automatically Finding Rule-Based Neurons in OthelloGPT

arxiv.org·11h

⚡ONNX Runtime

Flag this post

Panther: A Cost-Effective Privacy-Preserving Framework for GNN Training and Inference Services in Cloud Environments

arxiv.org·11h

Flag this post

Loading more...