BF16

Brain Float, Mixed Precision, Numeric Format, TPU, Training Stability

Feeds to Scour
SubscribedAll
Scoured 84 posts in 6.7 ms

Google is reportedly turning to Intel to make its AI chips.

 🎯Tensor Cores  Content type: News
theverge.com
·

Google orders Intel Foundry to produce over three million TPUs for 2028 amid TSMC capacity crunch

 Flash Attention  Content type: News
tweaktown.com·
Less-relevant results

EP217: Latency vs Throughput vs Bandwidth

 ⏱️CUDA Events  Content type: News  Content type: Blog
blog.bytebytego.com·

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

 🎯Tensor Cores
openjdk.org··r/java

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 Cuda  Content type: Academic
arxiv.org·

How Will the AI IC Market Evolve Amid Rising Artificial Intelligence Adoption Through 2034?

 🎯Tensor Cores  Content type: Blog

Nvidia Nemotron 3 Ultra

 🏎️TensorRT

Intel Foundry Challenges TSMC Dominance With Massive Google AI Chip Order

 Flash Attention  Content type: News
hothardware.com·

Alphabet taps Intel to make three million in-house chips

 Flash Attention
oodaloop.com
·

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

 🛠Ml-eng  Content type: Blog
dnhkng.github.io·

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

 🔢cuBLAS  Content type: Code
github.com··Hacker News

Qwen 3.6 27B AutoRound GGUF, need your feedback

 🔄ONNX

Google reportedly orders at least three million chips from Intel to arrive in 2028, as TSMC struggles to keep up with the AI boom

 Flash Attention  Content type: News
pcgamer.com
··Hacker News

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

 📉Model Quantization  Content type: News
digg.com·

Google orders chips from Intel and Nvidia is testing its tech, as TSMC’s grip on AI starts to strain

 🎮NVIDIA  Content type: News
thenextweb.com·

Benchmarking dots.tts on Strix Halo

 🔥PyTorch
sleepingrobots.com·

Unsloth Gemma 4 QAT

 📉Model Quantization
unsloth.ai·

OpenAI files confidentially for IPO

 🔓Open-source  Content type: News
sherwood.news·

hanxiao/omni-macos: Native macOS semantic search over your local files - text, images, audio, video in one vector space, on-device on Apple silicon.

 🤖Automation  Content type: Code
github.com··Hacker News

Apollo and Blackstone Close $35 Billion Chip Debt Deal for Anthropic, Backstopped by Broadcom

 Flash Attention
easternherald.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help