LLM

Feeds to Scour
SubscribedAll
Scoured 1911 posts in 9.2 ms

How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?

 🤖LLM Inference  Content type: Blog
medium.com·

SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption

 🤖LLM Inference
eprint.iacr.org·

Agentic AI for Insurance Underwriting: Beyond Chatbots and Prompts

 🤖AI Agents  Content type: Blog

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

 🤖LLM Inference  Content type: Academic
arxiv.org·

Quiz: Embeddings and Vector Databases With ChromaDB

 🤖LLM Inference
realpython.com·

New comment by Ayaz_Saifi in "Ask HN: Who wants to be hired? (June 2026)"

 🤖AI Agents

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 🤖LLM Inference
smolhub.com··r/LocalLLaMA

‘Getting control where we can’—Europe wants sovereign AI but most of the chips are from the U.S.

 🛡️Anthropic  Content type: News
fortune.com
·

What Are Tokens in LLMs?

 🤖LLM Inference  Content type: Blog

Here's a llama.cpp CLI Command builder.

 🤖LLM Inference

How LLMs Work?

 🤖LLM Inference  Content type: Blog
medium.com
·

How to Defend Against Prompt Injection in Production

 🤖Agents  Content type: Reference
leanpub.com··DEV

How we fight GPU scarcity without compromise

 🤖LLM Inference  Content type: Blog
equixly.com··Hacker News

Using local LLMs for agentic coding

 🤖LLM Inference  Content type: Blog
blog.alexewerlof.com·

New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"

 🤖AI Agents  Content type: Discussion

AuRA: Internalizing Audio Understanding into LLMs as LoRA

 🤖LLM Inference  Content type: Academic
arxiv.org·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 Vllm  Content type: Code
github.com··Hacker News

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

 🤖LLM Inference  Content type: Blog
dnhkng.github.io·

Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG

 🤖AI Agents  Content type: Blog
research.google·

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

 🤖LLM Inference
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help