Transformers

Feeds to Scour
SubscribedAll
Scoured 561 posts in 12.7 ms

MLPerf and the rise of latency-aware LLM benchmarking

馃摑NLP
edn.com

I finally built the central AI hub I've been wanting, and Open WebUI made it stupidly simple

馃摑NLP
xda-developers.com

Benchmarking Large Language Models for Safety Data Extraction

馃挰Prompt EngineeringContent type: Academic
arxiv.org

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

馃摑NLP
aermia.comHacker News

Agentic AI for Insurance Underwriting: Beyond Chatbots and Prompts

馃挰Prompt EngineeringContent type: Blog

Quantum circuits help AI overcome memory limitations with minimal new parameters

馃挰Prompt Engineering
phys.org

Treble Technologies and Hugging Face Address Benchmark of Automatic Speech Recognition Models

馃摑NLP
audioxpress.com

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

馃摑NLPContent type: Academic
arxiv.org

AI Agents Running Businesses: Andon Labs on Project Vend

馃挰Prompt Engineering
startuphub.ai

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

馃摑NLPContent type: Code
github.comHacker News

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

馃摑NLP
smolhub.comr/LocalLLaMA

How Confident Are AI Classifiers About Their Own Confidence?

馃摑NLPContent type: Blog

Should LLM Agents Decide in Social Simulations? Comparing Finite-State and LLM-Based Decision Policies

馃Agentic AIContent type: Academic
arxiv.org

SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation

馃МEmbeddingsContent type: Academic
arxiv.org

A wild idea: Abstract reality using ontology

馃挰Prompt EngineeringContent type: Discussion

Reachability and asymptotics of Gaussian Transformer dynamics

馃OllamaContent type: Academic
arxiv.org

google/gemma-4-12B-it-qat-q4_0-gguf

馃Ollama
huggingface.co

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

馃МEmbeddingsContent type: Academic
arxiv.org

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

馃摓Function CallingContent type: Code
github.comHacker News, r/LLM

RePAIR: Predictive Self-Supervised Representation Learning in Chess

馃МEmbeddingsContent type: Academic
arxiv.org
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help