LLM

Feeds to Scour
SubscribedAll
Scoured 219 posts in 35.9 ms

BYTE PAIR ENCODING

 🔤Tokenization  Content type: Blog
dev.to··DEV

SLUUG Talk: Demystifying Large Language Models on Linux

 🤖GenAI  Content type: Code
github.com··DEV

AIs like ChatGPT fall apart in classic 'Stroop' psychological test — and that could stand in the way of achieving artificial general intelligence

 🧠LLMs
techradar.com
·

The LLM Gateway Pattern: Why Every Kubernetes-Based AI App Needs One

 🤖AI Tools
freecodecamp.org·

Open-LLM-VTuber Review: Offline AI Companion with Live2D

 🧠LLMs  Content type: Blog
dev.to··DEV

How attackers are gaining access to LLM inference

 🤖AI Tools
malware.news·

Deep Learning Weekly: Issue 458

 🤖Large Language Models

Speculators v0.5.0: DFlash support and online training

 Inference
developers.redhat.com·

Can LLMs save themselves from verbosity?

 🤖Large Language Models  Content type: Blog
dev.to··DEV

huawei-csl/KVarN: KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

 Inference  Content type: Code
github.com··Hacker News

AI Paper Review: Training Language Models to Follow Instructions with Human Feedback (InstructGPT)

 🤖GenAI
freecodecamp.org·

I Built an Adversarial Eval Framework and Attacked 5 LLMs — Every Single One Failed

 🤖Large Language Models  Content type: Blog
dev.to··DEV

How LLMs Actually Work: A Developer's Mental Model

 🤖Large Language Models  Content type: Blog
dev.to··DEV

LLM Fine-Tuning vs RAG: A Production Decision Framework for Engineering Teams

 🧠LLM Training  Content type: Blog
dev.to··DEV

I Accidentally Spent $400 on GPT-4o in One Month. Here's How to Never Do That.

 🧠Claude  Content type: Blog
dev.to··DEV

llama.cpp b9455 Finally Caught vLLM: 70t/s on 2x3090 Qwen 27B UQ8

 🤖LLM Inference  Content type: Blog
dev.to··DEV

I Benchmarked 3 Local LLMs on My Laptop — Here's What the Numbers Actually Show

 🧠LLMs  Content type: Blog
dev.to··DEV

KV cache quantization: what FP8/INT8 K and V actually buy you, and where they break

 Quantization  Content type: Blog
dev.to··DEV

BAGEN: LLM Agents Waste 44% of Tokens on Tasks They'll Fail

 🤖AI  Content type: Blog
dev.to··DEV

I Fuzzed 12 LLMs With 19 Payloads — Here What Broke

 🧠LLMs  Content type: Blog
dev.to··DEV

No more posts from buckman's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help