Chinese AI

Feeds to Scour
SubscribedAll
Scoured 222 posts in 39.8 ms

DeepSeek Made AI Cheap. Now It Needs Billions to Keep It Cheap.

 🆕New AI  Content type: News  Content type: Blog

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

 🐘pgvector  Content type: Academic
arxiv.org·

Show HN: One API Key for 45 AI Models – Pay per Token, OpenAI Compatible

 🤖AI

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🤖AI  Content type: Code
github.com··Hacker News

This AI agent startup ditched Anthropic for DeepSeek — and says it’s saving millions

 🔬AI Labs
thenewstack.io·

More US firms turn to China’s DeepSeek over pricey Silicon Valley AI

 🎭Claude

DeepSeek V4 Pro beats GPT-5.5 Pro on precision

 🎛️Feed Filtering  Content type: News

DeepSeek enters the fight for token volume, Anthropic continues to dominate spend

 🤖AI  Content type: Blog
vercel.com··Hacker News

Alibaba says it is opening Qwen to third-party AI agents, starting with KFC, Mixue, and others, and Qwen's app serves 100M+ daily lifestyle services engagements...

 🤖AI
techmeme.com·

New comment by Stitch4223 in "DeepSeek V4 Pro beats GPT-5.5 Pro on precision"

 📊Model Serving Economics  Content type: Discussion

Integrate on-device AI models into your app using Core AI - WWDC26 - Videos

 🤖AI

Florian Brand, Prime Intellect research engineer, adopts Gemma 4 E4B 6-bit quantized as his primary local Mac LLM

 🤖AI  Content type: News
digg.com··Hacker News

ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models

 🤖AI  Content type: Academic
arxiv.org·

AIchain Skill: A Prompt as a Reusable Object

 🤖AI  Content type: Code
github.com··DEV

zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability

 🤖AI  Content type: Code
github.com··Hacker News

A Comprehensive Anatomy of Human and DeepSeek-R1 LLM Mathematical Reasoning

 Fast AI Inference  Content type: Academic
arxiv.org·

Decision-Aware Memory Cards: Counterfactual-Inspired Context Selection and Compression for Tool-Using LLM Agents

 🤖AI  Content type: Academic
arxiv.org·

john-rocky/apple-silicon-llm-bench: Neutral, reproducible benchmark for local LLMs on Apple Silicon (Mac · iPhone · iPad) — MLX, llama.cpp, CoreML, Apple Foundation Models

 🤖AI  Content type: Code
github.com··Hacker News

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

 🤖AI  Content type: Academic
arxiv.org·

alibaba/open-code-review: Battle-tested at Alibaba's scale. Hybrid architecture code review tool: deterministic pipelines + LLM Agent, precise line-level comments, built-in fine-tuned ruleset (NPE, thread-safety, XSS, SQL injection), OpenAI & Anthropic compatible.

 💻Coding Agents  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help