Attention ISN'T all you need?! New Qwen3 variant Brumby-14B-Base leverages Power Retention technique
venturebeat.comยท10h
๐Ÿ‘๏ธAttention Optimization
Flag this post
New AI models Cursor and Cognition (Windsurf) built on Chinese base models
linkedin.comยท2dยท
Discuss: r/China
๐Ÿค–AI Coding Tools
Flag this post
Trace Anything: Representing Any Video in 4D via Trajectory Fields
paperium.netยท2dยท
Discuss: DEV
๐ŸŽ๏ธTensorRT
Flag this post
How Transformer Models Detect Anomalies in System Logs
hackernoon.comยท1d
๐Ÿ“ŠGradient Accumulation
Flag this post
Using ensemble learning with hybrid graph neural networks and transformers to predict traffic in cities
arxiv.orgยท1h
๐ŸŽ๏ธTensorRT
Flag this post
Natural Building Blocks for Structured World Models: Theory, Evidence, and Scaling
arxiv.orgยท1h
๐Ÿ”„ONNX
Flag this post
Extensive FPGA and ASIC resource comparison for blind I/Q imbalance estimators and compensators
sciencedirect.comยท14h
๐ŸŽฏTensor Cores
Flag this post
Vision = Language: I Decoded VLM Tokens to See What AI 'Sees' ๐Ÿ”ฌ
reddit.comยท2dยท
Discuss: r/LocalLLaMA
๐Ÿ› Ml-eng
Flag this post
Panther: A Cost-Effective Privacy-Preserving Framework for GNN Training and Inference Services in Cloud Environments
arxiv.orgยท1d
๐Ÿ”„ONNX
Flag this post
Eyes on Target: Gaze-Aware Object Detection in Egocentric Video
arxiv.orgยท1d
๐Ÿ‘๏ธAttention Optimization
Flag this post
Redundancy Maximization as a Principle of Associative Memory Learning
arxiv.orgยท1h
๐Ÿ‘๏ธAttention Optimization
Flag this post
Measuring the Intrinsic Dimension of Earth Representations
arxiv.orgยท1h
๐ŸŽ๏ธTensorRT
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.orgยท1d
๐ŸŽ๏ธTensorRT
Flag this post
TA-LSDiff:Topology-Aware Diffusion Guided by a Level Set Energy for Pancreas Segmentation
arxiv.orgยท1d
๐ŸŽ๏ธTensorRT
Flag this post
Beyond Standard LLMs
magazine.sebastianraschka.comยท17hยท
Discuss: Hacker News, r/LLM
๐Ÿ‘๏ธAttention Optimization
Flag this post
Deep Learning Approach to Anomaly Detection in Enterprise ETL Processes with Autoencoders
arxiv.orgยท1d
๐Ÿ“ŠGradient Accumulation
Flag this post