Weekly Bookmarks
inkdroid.orgยท1d
๐Open-source
Flag this post
AstuteRAG-FQA: Task-Aware Retrieval-Augmented Generation Framework for Proprietary Data Challenges in Financial Question Answering
arxiv.orgยท52m
๐Ruff
Flag this post
๐ TOON (Token-Oriented Object Notation) โ The Smarter, Lighter JSON for LLMs
โกONNX Runtime
Flag this post
build system tradeoffs
๐๏ธBuild Optimization
Flag this post
From Lossy to Lossless Reasoning
๐คAI Coding Tools
Flag this post
A Three-Stage Bayesian Transfer Learning Framework to Improve Predictions in Data-Scarce Domains
arxiv.orgยท3d
๐Model Distillation
Flag this post
A Quantitative Framework to Predict Wait-Time Impacts Due to AI-Triage Devices in a Multi-AI, Multi-Disease Workflow
arxiv.orgยท52m
โฑ๏ธCUDA Events
Flag this post
Semantic search with embeddings in JavaScript: a hands-on example using LangChain and Ollama
๐ Ml-eng
Flag this post
[Open Source] We deployed numerous agents in production and ended up building our own GenAI framework
๐MLOps
Flag this post
A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining
โกONNX Runtime
Flag this post
Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity
arxiv.orgยท52m
๐๏ธTensorRT
Flag this post
<p>**Abstract:** This paper presents a novel framework for enhancing logistics efficiency in cross-border e-commerce by leveraging automated predictive analytic...
freederia.comยท27m
โกONNX Runtime
Flag this post
Unhinged Uncensored Model Evolution: Feedback on Satyr V0.1 to Shape Future Releases!
๐Ruff
Flag this post
Enhanced SPH Turbulence Modeling via Adaptive Kernel Correction & Multi-Scale Data Assimilation
๐Kernel Fusion
Flag this post
Generating Accurate and Detailed Captions for High-Resolution Images
arxiv.orgยท52m
๐๏ธTensorRT
Flag this post
Qwen3 VL 30b a3b is pure love
๐Model Quantization
Flag this post
Loading...Loading more...