rdksupe's Feed

Feeds to Scour
SubscribedAll
Scoured 5,139 posts in 27.1 ms
Learn how Ray Serve LLM + vLLM stack achieves up to 24x higher throughput with direct streaming, HAProxy integration, and a new vLLM Ray executor backend. Read more ›
Covered by Google Cloud Blog
Discussed on Hacker News
Feeds
How positional embeddings, multi-head attention, residual connections, and feed-forward networks come together inside GPT models Read more ›
Feeds
⚙️MLOpsTildAlice·
Track ML experiments with MLflow in under 10 minutes — log params, metrics, and models in 3 lines of Python. Real benchmarks on sklearn and PyTorch. Read more ›
Feeds
Groups like ShinyHunters are demonstrating that attackers do not necessarily need malware or zero-day exploits to cause massive damage. The post appeared first on <a href=" Read more ›
Feeds
A vulnerability chain dubbed AutoJack in Microsoft's AutoGen Studio interface for prototyping AI agents could let attackers manipulate an agent into executing arbitrary commands on its host system simply by visiting a malicious webpage. [...] Read more ›
Feeds
ISC High Performance 2026 -- NVIDIA today announced the NVIDIA Vera Rubin platform delivers world-class supercomputers for science, combining native double-precision (FP64) performance, NVIDIA CUDA-X™ libraries and the full-stack capabilities of the NVIDIA AI platform. Read more ›
Feeds
📚RAGmedium.com
·
What is Retrieval Augmented generation? Read more ›
Feeds
When deploying a new VMware Cloud Foundation (VCF) 9.1 Fleet, users specify either a Simple or High Availability (HA) deployment model along with the desired deployment size: Small, Medium or Large. Unlike components such as NSX Manager, VCF Operations and VCF Automation, where deployment size and availability are configured independently, VCF Management Services (VCFMS) determines […] Read more ›
Feeds
In this article, you will learn how to build AI agents that can browse and interact with real websites using Playwright, browser-use, and LangGraph. Read more ›
Feeds
This is the third piece in a series on a prompt injection vector that leaked a system-prompt secret from GPT-4o using nothing but a… Read more ›
Feeds
Learn data pipeline best practices for architecture, ingestion, transformation, and deployment. Discover how modern data teams build efficient, reliable pipelines at scale. Read more ›
Feeds
Weaviate Cloud is now free to start across the entire product suite. Read more ›
Feeds
Introduction to Deep Learning and Neural Networks: The Only Guide You Need to Start Read more ›
Feeds
Rather than treating retrieval as a fixed recipe, in this blog we derive it from first principles. We explore why BM25 looks the way it… Read more ›
Feeds
From pretraining to RLHF/GRPO — every algorithm hand-written in pure PyTorch. Read more ›
Discussed on Hacker News
Feeds
🔥PyTorchmedium.com
·
If you come from a science or engineering background, you’ve probably run into the word tensor more times than you can count. If you… Read more ›
Feeds
Traditional evaluation of machine learning (ML) models typically focuses on achieving the maximum possible accuracy irrespective of the computational cost. In this article, we propose a paradigm shift towards evaluating performance based on computational effort-explicitly defined here as the total number of gradient descent steps required to reach an acceptable level of accuracy with high probability. Building upon the concept of computational... Read more ›
Feeds
How cloud-native tooling is enabling distributed AI inference on heterogeneous edge hardware, slashing latency and infrastructure costs for production workloads. Forward-thinking platform teams are moving AI inference out of centralized GPU data centers and into distributed Kubernetes clusters running closer to data sources, cutting response latency from hundreds of milliseconds to single digits. Mature cloud-native tooling including KServe, vLLM, and eBPF-based observability has made this sh... Read more ›
Discussed on DEV
Feeds
AI-assisted development is changing more than how software is written. It might also force us to reconsider the processes we use to identify, track, and manage vulnerabilities. Read more ›
Feeds
ISC High Performance 2026 -- NVIDIA today announced that a record 35 NVIDIA AI HPC supercomputers are in development across Europe — equipping more than 3 million researchers with next-generation infrastructure for continental AI, accelerated science and industrial innovation. Read more ›
Covered by Neowin, NVIDIA Blog
Feeds
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous post
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Discover
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help