Neural audio codecs: how to get audio into LLMs
kyutai.org·3d·
🎧Learned Audio
Flag this post
MEIcoder: Decoding Visual Stimuli from Neural Activity by Leveraging Most Exciting Inputs
arxiv.org·20h
🧠Neural Codecs
Flag this post
When Transformers Sing: Adapting SpectralKD for Text-Based Knowledge Distillation
towardsdatascience.com·1d
🧠Intelligence Compression
Flag this post
DeepSeek-OCR: Images Simplify Text for Large Language Models
heise.de·14h
🤖Advanced OCR
Flag this post
Wolfram Neural Networks Boot Camp; January 5–16
wolfram.com·9h
🧠Machine Learning
Flag this post
The Machine Learning Practitioner’s Guide to Fine-Tuning Language Models
machinelearningmastery.com·1d
📊Feed Optimization
Flag this post
Half-Quadratic Quantization of large machine learning models
dropbox.tech·2d
📊Quantization
Flag this post
Denosing Images of Cats and Dogs with Autoencoders
mayberay.bearblog.dev·3d·
🧠Neural Codecs
Flag this post
Retro Language Models: Rebuilding Karpathy's RNN in PyTorch
gilesthomas.com·5h·
Discuss: Hacker News
🧮Kolmogorov Bounds
Flag this post
Show HN: I spent $450 on GCP's Video API, so I built a local alternative
news.ycombinator.com·9h·
Discuss: Hacker News
🎬AV1 Encoding
Flag this post
Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples
arxiv.org·20h
🧠Neural Compression
Flag this post
Comparison: H.264 vs. H.265/HEVC vs. VP9
red5.net·2d·
Discuss: Hacker News
🎬Video Codecs
Flag this post
KNN: The Importance of Being Scaled
dev.to·8h·
Discuss: DEV
🧠Intelligence Compression
Flag this post
Decoding the Ear: A Framework for Objectifying Expressiveness from Human Preference Through Efficient Alignment
arxiv.org·20h
🎧Learned Audio
Flag this post
(2018) The Google Brain Team – Looking Back on 2017
blog.research.google·12h·
Discuss: Hacker News
🧠Machine Learning
Flag this post
Locking it down: A new technique to prevent LLM jailbreaks
news.sophos.com·14h
🧪Binary Fuzzing
Flag this post
When Models Manipulate Manifolds: The Geometry of a Counting Task
transformer-circuits.pub·3d·
Discuss: Hacker News
🌀Differential Geometry
Flag this post
DeLoad: Demand-Driven Short-Video Preloading with Scalable Watch-Time Estimation
arxiv.org·2d
📊Rate-Distortion Theory
Flag this post
Unraveling Emotions with Pre-Trained Models
arxiv.org·1d
🎧Learned Audio
Flag this post