✨ Model optimizations in LLMs - pleto · Scour

Ternary public-key cryptosystem

🔧Systems-level optimizations for LLM serving Academic

SEAM: Shortcut-Aware Real-Time Detection of Scripted vs. Spontaneous Speech for Interview Guardrails

🧠Large Language Models (LLMs) Academic

Gated Bidirectional Linear Attention for Generative Retrieval

🔍Retrieval-augmented generation Academic

AI Level of Detail: Distance-Aware ML Model Precision Selection for Real-Time Human Motion Prediction in Games

⚡Real-time AI Systems Academic

Information-Theoretic Bounds for Sparse Covariance Estimation in the Vertical-Split Distributed Model

🧠Large Language Models (LLMs) Academic

CSI Phase Averaging for High-Sensitivity Wi-Fi Sensing in Low-Multipath Environments

🔢Quantization of LLMs Academic

EEGDancer: Dynamic Emotion Latent Space Masked Modeling with Reinforcement Learning for EEG Continuous Emotion Prediction

⚡Real-time AI Systems Academic

P-Cast Precision in FP8 Attention: Sink-Induced Collapse and the Optimality of S=2^8

🧠Large Language Models (LLMs) Academic

No more posts from pleto's subscribed feeds.

Scour all 25258 feeds Learn more about Feeds

Log in to enable infinite scrolling