๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐ŸŒŠ CBOR Streaming

Indefinite Length, Memory Efficiency, Large Data Sets, Parser States

Context Kills VRAM: How to Run LLMs on consumer GPUs | by Lyx | May, 2025 | Medium
medium.comยท3d
๐Ÿ’จCache Optimization
ufrisk/MemProcFS
github.comยท1d
๐Ÿ› ๏ธGreaseweazle
Revitalizing Legacy Code
javapro.ioยท1dยท
Discuss: Hacker News
๐Ÿฆ‹Format Evolution
Swiss boffins just trained a 'fully open' LLM on the Alps supercomputer
theregister.comยท14hยท
Discuss: Hacker News
๐Ÿด๓ ง๓ ข๓ ณ๓ ฃ๓ ด๓ ฟScottish Computing
Orca Build System
orca-app.devยท6hยท
Discuss: Lobsters, Hacker News, r/programming
๐Ÿ”„Language Evolution
Towards Serverless Processing of Spatiotemporal Big Data Queries
arxiv.orgยท1d
๐ŸŒŠStream Processing
Conformal Information Pursuit for Interactively Guiding Large Language Models
arxiv.orgยท2d
๐Ÿง Intelligence Compression
A Survey of Large Language Models on Generative Graph Analytics: Query, Learning, and Applications
arxiv.orgยท2d
๐Ÿ•ธ๏ธGraph Embeddings
CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark
arxiv.orgยท1d
๐Ÿ“Code Metrics
SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression
arxiv.orgยท1d
โš™๏ธCompression Benchmarking
Kioshun - in-memory cache that's actually pretty fast
reddit.comยท23hยท
Discuss: r/golang
๐Ÿ’จCache Analysis
Bitchat: notes on the path forward ("yo!")
github.comยท38mยท
Discuss: Hacker News
๐Ÿš€Indie Hacking
The Crucial Role of NUMA Awareness in High-Performance Deep Learning
towardsdatascience.comยท15h
๐Ÿ“ŠPerformance Profiling
Accelerating generative AI development with fully managed MLflow 3.0 on Amazon SageMaker AI
aws.amazon.comยท1h
๐Ÿ”„Reproducible Builds
On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study
arxiv.orgยท1d
๐Ÿ’ปProgramming languages
MGAA: Multi-Granular Adaptive Allocation fof Low-Rank Compression of LLMs
arxiv.orgยท2d
๐Ÿง Machine Learning
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models
arxiv.orgยท1d
๐Ÿ”—Parser Combinators
Monitoring Qwen 3 Agents with MLflow 3.x: End-to-End Tracing Tutorial
dev.toยท1dยท
Discuss: DEV
๐Ÿ‘๏ธObservatory Systems
Grok 4: Feature, Price ,Access and More
dev.toยท6hยท
Discuss: DEV
๐Ÿ”—Hypermedia APIs
FOLC-Net: A Federated-Optimized Lightweight Architecture for Enhanced MRI Disease Diagnosis across Axial, Coronal, and Sagittal Views
arxiv.orgยท16h
๐Ÿ’ŽInformation Crystallography
Loading...Loading more...
AboutBlogChangelogRoadmap