🧠 LLM Inference - emschwartz · Scour

VSORA Board Chair Sandra Rivera on Solutions for AI Inference and LLM Processing

semiwiki.com·1d

🏗️LLM Infrastructure

Alibaba launches open source AI model RynnBrain for robotics

techzine.eu·22h

LookML: An Alternative Semantic Layer Approach to build a Reliable AI Analytics Agent with BigQuery

pub.towardsai.net·2d

Securing GenAI: Vol 4 — Fundamentals of AI model security

pub.towardsai.net·1d

🛡️AI Security

SAE Feature Matchmaking (Layer-to-Layer)

lesswrong.com·1d

marketplace.visualstudio.com·19h

How Yelp Built “Yelp Assistant”

blog.bytebytego.com·1d

💳Content Monetization

Gemini thinking | Gemini API | Google AI for Developers

ai.google.dev·1d

Reinforcement Inference: Leveraging Uncertainty for Self-Correcting Language Model Reasoning

arxiv.org·1d

🏗️LLM Infrastructure

Tokens of AI Bias

chinamediaproject.org·2d

🛡️AI Security

What do LLMs think when you don't tell them what to think about?

together.ai·5d

🏗️LLM Infrastructure

HQP: Sensitivity-Aware Hybrid Quantization and Pruning for Ultra-Low-Latency Edge AI Inference

arxiv.org·2d

📱Edge AI Optimization

Designing and Using Combinators: The Essence of Functional Programming

cse.chalmers.se·1d·

Discuss: Hacker News

💻Programming languages

NotebookLM: The AI that only learns from you

byandrev.dev·3d·

Discuss: Hacker News

👨‍💻AI Coding

Last30Days: A Recency-Aware Research API for X, Reddit, and the Web

lumify.ai·21h·

Discuss: Hacker News

📊Feed Optimization

Show HN: 289x speedup over MLP using Spectral Graphs

zenodo.org·3d·

Discuss: Hacker News

The control layer for AI

blog.dottxt.ai·4d·

Discuss: Hacker News

🛡️AI Security

Data Modeling for the Agentic Era: Semantics, Speed, and Stewardship

rilldata.com·1d·

Discuss: Hacker News

🔄Incremental Computation

Ask HN: Are past LLM models getting dumber?

news.ycombinator.com·21h·

Discuss: Hacker News

🏆LLM Benchmarking

Reliability of LLMs as medical assistants for the general public: a randomized preregistered study

nature.com·1d·

Discuss: Hacker News

🏆LLM Benchmarking

Loading more...