joshwonghc's Feed

Feeds to Scour
SubscribedAll
Scoured 2426 posts in 12.7 ms

AI Agent Security Guide: How to Prevent Prompt Injection Attack

 ✍️Prompt Engineering  Content type: Blog
medium.com
·

How to Run an LLM Locally: Ultimate Guide to Local AI 2026

 🧠LLMs  Content type: Blog

Training the Model Was Only 20% of the Job: Lessons from Building an MLOps Platform

 🔧MLOps  Content type: Blog
medium.com
·

LLM Observability: What To Instrument and How To Act on It

 🔍LLM Tracing  Content type: Blog
blog.n8n.io·

A Small RAG Evaluation Harness for Production-Oriented LLM Systems

 💻AI Engineering  Content type: Blog
itstedpark.medium.com·

Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM

 🌐Open Source AI
everylocalai.com··DEV

BeamWeaver - LangChain/LangGraph-style agents and workflows for Elixir

 🤖AI Agents
elixirstatus.com·

Building AI-Powered Applications with Spring Boot AI: A Practical Guide for Java Developers

 💻AI Engineering  Content type: Blog
medium.com
·

Detecting AI-specific threats in Claude Enterprise from the Compliance API: a prefilter + LLM-as-judge pipeline with Sigma rules

 ✍️Prompt Engineering
papermtn.co.uk··r/netsec

My prompt is better than your prompt – how to optimize your prompts in the age of agentic AI

 🧠LLMs  Content type: Blog
metrics.blogg.gu.se·

DiffusionGemma 26B A4B results on my 5090

 🌐Open Source AI

Guardian Runtime – Local firewall for AI coding agents and runaway costs

 🤖AI Agents

Quiz: Embeddings and Vector Databases With ChromaDB

 📚RAG
realpython.com·

12B Gemma 4 QAT Deployment with NVIDIA L4, Cloud Run, MCP, and Antigravity CLI

 🌐Open Source AI  Content type: Blog
medium.com
·

The Era of Multi-Agent Imagined Experience

 🤖AI Agents
odyssey.ml··Hacker News

Why LLMs (still) lack taste

 🧠LLMs
Sign up or login to customize your feed and get personalized topic recommendations

microsoft/LLMLingua: [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

 💻AI Engineering  Content type: Code
github.com··DEV

milvuslite-kit configuration over code for vector search and rag workflows

 📚RAG  Content type: Blog
elanthirayan.medium.com··DEV

high-performance classification API (beats GPT-5.4-mini)

 🧠LLMs  Content type: Discussion
classer.ai··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help