Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM
🤖 LLM
Specific
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
1911
posts in
9.2
ms
How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an
LLM
?
🤖
LLM Inference
Content type:
Blog
medium.com
·
3d
3 days ago
Actions for How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?
SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption
🤖
LLM Inference
eprint.iacr.org
·
2d
2 days ago
Actions for SecLoRA: Secure Aggregation of Low-Rank Matrix Products via Functional Encryption
Agentic AI for Insurance Underwriting: Beyond Chatbots and
Prompts
🤖
AI Agents
Content type:
Blog
blog.nashtechglobal.com
·
4d
4 days ago
Actions for Agentic AI for Insurance Underwriting: Beyond Chatbots and Prompts
Instruction Finetuning DeepSeek-R1-8B
Model
Using
LoRA
and NEFTune
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
22h
22 hours ago
Actions for Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune
Quiz: Embeddings and Vector Databases With ChromaDB
🤖
LLM Inference
realpython.com
·
1d
1 day ago
Actions for Quiz: Embeddings and Vector Databases With ChromaDB
New comment by Ayaz_Saifi in "Ask HN: Who wants to be hired? (June 2026)"
🤖
AI Agents
drive.google.com
·
5d
5 days ago
·
Hacker News
Actions for New comment by Ayaz_Saifi in "Ask HN: Who wants to be hired? (June 2026)"
1-bit and 1.58 bit
LLM
Benchmarking on Jetson Orin Nano Super | Bonsai LM
🤖
LLM Inference
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
‘Getting control where we can’—Europe wants sovereign AI but most of the chips are from the U.S.
🛡️
Anthropic
Content type:
News
fortune.com
·
1d
1 day ago
Actions for ‘Getting control where we can’—Europe wants sovereign AI but most of the chips are from the U.S.
What Are Tokens in LLMs?
🤖
LLM Inference
Content type:
Blog
bearisland.dev
·
4d
4 days ago
·
Hacker News
Actions for What Are Tokens in LLMs?
Here's a
llama.cpp
CLI Command builder.
🤖
LLM Inference
llamabuilding.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for Here's a llama.cpp CLI Command builder.
How LLMs Work?
🤖
LLM Inference
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for How LLMs Work?
How to Defend Against
Prompt
Injection in Production
🤖
Agents
Content type:
Reference
leanpub.com
·
2d
2 days ago
·
DEV
Actions for How to Defend Against Prompt Injection in Production
How we fight GPU scarcity without compromise
🤖
LLM Inference
Content type:
Blog
equixly.com
·
5d
5 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Using local LLMs for agentic coding
🤖
LLM Inference
Content type:
Blog
blog.alexewerlof.com
·
6d
6 days ago
Actions for Using local LLMs for agentic coding
New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
🤖
AI Agents
Content type:
Discussion
news.ycombinator.com
·
1d
1 day ago
·
Hacker News
Actions for New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
AuRA: Internalizing Audio Understanding into LLMs as
LoRA
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
22h
22 hours ago
Actions for AuRA: Internalizing Audio Understanding into LLMs as LoRA
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
⚡
Vllm
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
2x GH200 for
LLM
inference, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
🤖
LLM Inference
Content type:
Blog
dnhkng.github.io
·
3d
3 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic
RAG
🤖
AI Agents
Content type:
Blog
research.google
·
5d
5 days ago
Actions for Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
🤖
LLM Inference
deemwar-products.github.io
·
5d
5 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help