Qwen

Feeds to Scour
SubscribedAll
Scoured 73 posts in 6.2 ms

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 🤖Machine Learning  Content type: Code
github.com··r/LocalLLaMA

Previewing nAnalyst, the layer that finally explains your network

 🧠Local LLMs
ntop.org·

ReasoningFlow: Discourse Structures for Understanding LLM Reasoning Traces

 🤖LLMs  Content type: Academic
arxiv.org·

Qwen3.7-Plus is Alibaba's bid to turn multimodal AI into a full-blown autonomous agent

 🤖AI Agents
the-decoder.com
·

China is moving beyond super-apps to embrace AI agents that do it all for you

 🤖Agents
digitaltrends.com·

SAE It Across Models: Explaining Features With Foreign NLA Verbalizers

 📐Embeddings
lesswrong.com·

These LLMs are the best at resisting Russian propaganda

 🟣Claude
arstechnica.com·

Running Ollama on a 15W CPU sounded ridiculous until I got it working with decent results

 🧠Local LLMs
xda-developers.com·

AIchain Skill: A Prompt as a Reusable Object

 🤖AI Agents  Content type: Code
github.com··DEV
Less-relevant results

know the mother tongue of your LLMs

 🧠OpenAI

FlexNPU: Transparent NPU Virtualization for Dynamic LLM Prefill-Decode Co-location

 🧠Local LLMs  Content type: Academic
arxiv.org·

Claude Mythos Glasswing: Why AI Vuln Discovery Terrifies Me

 🟣Claude  Content type: Blog  Content type: Discussion
tildalice.io·

RecursiveIntell/proveKV: Two-tier, receipted, content-addressed KV-cache pool. fib_k4_n32 cold tier + turbo_8bit hot tier. 18-20% lossless dPPL on real 1.7B LLM. Successor to kv-lossless-11x (archived).

 💎Obsidian  Content type: Code
github.com··r/LocalLLaMA

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

 🧠Local LLMs  Content type: Academic
arxiv.org·

martidu4/honey-ai: 🍯 All-in-one AI honeypot powered by local LLMs. SSH, HTTP, FTP, Telnet, SMTP, MySQL, Redis, Git, VNC, RDP — with canary tokens, tarpits, GZIP bombs, and threat intel reporting.

 🧠Local LLMs  Content type: Code
github.com··Hacker News

From Human Guidance to Autonomy: Agent Skill System for End-to-End LLM Deployment on Spatial NPUs

 🤖Transformers  Content type: Academic
arxiv.org·

Google DeepMind's Susan Zhang argues abundant AI content shifts the premium from raw intelligence to human relationships and social dynamics

 AI  Content type: News
digg.com·

Snapcompact: SoTA Compaction — Instant, Local, Free. Pick 3

 🧠Local LLMs  Content type: Blog
blog.can.ac·

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

 🤖LLMs  Content type: Academic
arxiv.org·

I ran local AI models on a six-year-old laptop with no GPU, and they actually worked

 🧠Local LLMs
xda-developers.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help