Qualcomm unveiled two data center AI accelerators, the AI200 (shipping in 2026) and AI250 (planned for 2027), marking a major pivot from its mobile and wireless roots. Both will be offered in full, liquidâcooled server racks, matching Nvidia and AMDâs rackâscale systems that cluster up to 72 accelerators as a single computer for training and serving advanced AI models.
Built on Qualcommâs Hexagon NPUs originally designed for smartphones, the chips reflect a strategy to scale onâdevice AI architecture to data center workloads. Shares rose 11% on the news. The new entrants intensify competition with Nvidia and AMD in the highâgrowth AI data center market, where multiâaccelerator, liquidâcooled racks are standard.
Related:
[Alibaba-backed Moonshot releases its second AI update âŚ
Qualcomm unveiled two data center AI accelerators, the AI200 (shipping in 2026) and AI250 (planned for 2027), marking a major pivot from its mobile and wireless roots. Both will be offered in full, liquidâcooled server racks, matching Nvidia and AMDâs rackâscale systems that cluster up to 72 accelerators as a single computer for training and serving advanced AI models.
Built on Qualcommâs Hexagon NPUs originally designed for smartphones, the chips reflect a strategy to scale onâdevice AI architecture to data center workloads. Shares rose 11% on the news. The new entrants intensify competition with Nvidia and AMD in the highâgrowth AI data center market, where multiâaccelerator, liquidâcooled racks are standard.
Related:
Alibaba-backed Moonshot releases its second AI update in four months as Chinaâs AI race heats up
MiniMax introduced MiniMaxâM2, an open MITâlicensed Mixture of Experts model tuned for coding and agentic workflows, with weights on Hugging Face. Although the model has 229B total parameters, it routes only ~10B active parameters per token to keep memory and tail latency low during planâactâverify loops across tools like shell, browser, retrieval, and editors. M2 features âinterleaved thinking,â emitting internal reasoning in ... blocks that must be preserved across turns; the team warns removing these harms multiâstep and toolâuse performance.
Another notable open-source release this week was Kimi K2 Thinking, an Mixture-of-Experts model with 1 trillion parameters and 32 billion active per inference. Built on the Kimi K2 base and optimized for reasoning and agentic abilities, it supports a 256k context window and uses INT4 quantization for efficiency, with a reported training cost of only $4.6 million. Notably, it can execute 200â300 tool calls autonomously, demonstrating advanced agent capabilities previously seen only in closed models. Despite lacking a full technical report, analyses show Kimi K2 inherits and refines DeepSeekâs architectureâexpanding MoE experts and vocabulary while optimizing inference costârepresenting both a direct evolution of DeepSeekâs design and a culmination of open-source innovations like FlashAttention and MuonClip.
Related:
Udio Says Users Can Download AI Songs for 48 Hours After Backlash to UMG Legal Settlement
Universal Music Group (UMG) settled its copyright lawsuit with AIâmusic startup Udio and signed âindustryâfirstâ licensing agreements to power a new AI music platform. The deal includes compensatory payments to UMG and new revenue opportunities for artists and songwriters, with optâin controls for different parts of the service. Udio plans to relaunch next year as a subscription platform that lets users customize, stream, and share music within a âwalled garden,â strengthened by audio fingerprinting; pricing remains undisclosed.
Udioâs current textâtoâmusic generator (known for âBBL Drizzyâ) will remain available during the transition, but distribution will be restricted under the new model. Backlash to download restrictions after the UMG deal prompted Udio to reopen downloads for a 48âhour window starting Nov. 3 so users can export existing songs under prior terms, including commercial rights and creator ownership (with attribution required for freeâtier users).
The settlement ends UMGâs claims that Udio trained on copyrighted catalogs âen masseâ and sets up a licensed, closed ecosystem for future AI music creation and monetization.
Related:
Microsoft to ship 60,000 Nvidia AI chips to UAE under US-approved deal
Microsoft will spend $15.2 billion in the UAE over four years, backed by U.S.-approved exports of Nvidiaâs most advanced AI chips and a large cloud buildout. The Commerce Department issued licenses beginning in September with strict cybersecurity and national security safeguards, enabling shipments of more than 60,000 Nvidia GPUs, including A100, H100, H200, and nextâgen GB300 Grace Blackwell chips.
Microsoft says it has already amassed the equivalent of 21,500 A100âclass GPUs in the UAE to serve models from OpenAI, Anthropic, openâsource providers, and its own stack. The outlay includes a $1.5 billion equity stake in G42, over $4.6 billion in data center capex through 2025, and a further $7.9 billion from 2026â2029, with $5.5 billion for ongoing AI and cloud expansion.
The deal turns the UAE into a test case for U.S. AI exportâcontrol diplomacy and a regional anchor for American AI influence, despite criticism of potential backâchannel risks relative to China restrictions. It also appears to contradict President Trumpâs televised comments that the most advanced Nvidia chips would not be exported, though the UAE licenses carry âgold standardâ security conditions and tie into the UAEâs pledge to invest $1.4 trillion in U.S. energy and AIârelated projects.
TPU v7, Googleâs answer to Nvidiaâs Blackwell is nearly here. Googleâs nextâgen TPU reportedly matches Nvidiaâs latest chips in raw FP8 throughput and memory bandwidth while enabling much larger podâscale deployments via a 3D torus plus optical switching fabric that trades lowâhop switch topologies for extreme scalability.
Alibaba-backed Moonshot releases its second AI update in four months as Chinaâs AI race heats up. The updated Kimi K2 Thinking reportedly cost about $4.6 million to train and can autonomously select hundreds of tools to complete tasks, aiming to reduce human intervention.
GitHub is launching a hub for multiple AI coding agents. Copilot subscribers will get a dashboard to run, manage, and compare multiple thirdâparty coding agents (including Codex, Claude, Jules, xAI, and Devin), alongside features like Plan Mode in VS Code and automated codeâreview tooling.
Cursor 2.0 shifts to in-house AI with Composer model and parallel agents. The update swaps in Composer, an inâhouse coding model optimized for codebaseâwide search and low latency, and adds an interface that runs up to eight isolated parallel agents with browser integration, sandboxed terminals, and enterprise controls.
Microsoft AIâs first in-house image generator MAI-Image-1 is now available. Microsoft says the model produces faster photorealistic and artistically lit imagesâespecially of food, nature, and landscapesâand has been rolled into Bing Image Creator and Copilot Audio Expressions, with EU availability coming soon.
Googleâs AI Mode gets new agentic capabilities to help book event tickets and beauty appointments. The feature can autonomously search multiple sites in real time to find and link you to tickets and beauty or wellness appointments that match specific preferences and constraints.
Canva launches its own design model, adds new AI features to the platform. A new generative design model creates editable, multiâlayered files across formats (from social posts to websites), powers an alwaysâavailable assistant that can be @mentioned in projects, and integrates spreadsheets, miniâapp widgets, Affinity tools, and ad analytics into Canvaâs workflow.
Sora for Android saw nearly half a million installs on its first day. Appfigures estimates about 470,000 firstâday downloads across several markets (roughly 296,000 in the U.S.), far exceeding early iOS dayâone numbers after OpenAI expanded availability and dropped invites.
Instacart Debuts White-Label AI Shopping Chatbot in Enterprise Push. The assistantâtested on Sproutsâ site and available in Krogerâs iPhone appâprovides product recommendations as part of Instacartâs push to sell whiteâlabel eâcommerce AI tools to grocery chains.
Nvidia becomes first public company worth $5 trillion. Investors rallied on expectations of massive AI chip sales, new U.S. supercomputer deals, and strategic investments (including a $1B stake in Nokia and a pledge to invest up to $100B in OpenAI), pushing Nvidiaâs stock up more than 50% this year and keeping its GPUs scarce and highly sought for dataâcenter AI workloads.
Cokeâs New AI-Generated Ad Required 100 Staff and 70,000 AI-Generated Clips, and It Still Looks Like Garbage. Despite using over 70,000 AIâgenerated clips and around 100 staff, the holiday spot largely avoids human faces, leans on uncanny animalâfilled hyperreal landscapes, and has been widely criticized for disjointed, lowâquality visuals.
Amazon launches AI infrastructure project, to power Anthropicâs Claude model The tech giant had started Project Rainier last year to build an AI compute cluster spread across multiple data centers in the U.S. The computer incorporates nearly half-a-million of Amazonâs in-house Trainium2 chips.
Lambda inks multibillion-dollar AI infrastructure deal with Microsoft. Microsoft will add tens of thousands of Nvidia GPUs, including GB300 NVL72 systems, to expand its AI compute capacity.
Apple Nears $1 Billion-a Year Deal to Use Google AI for Siri. Apple would pay roughly $1 billion annually for access to Googleâs 1.2 trillionâparameter Gemini model to power a planned Siri overhaul.
Google partners with Ambaniâs Reliance to offer free AI Pro access to millions of Jio users in India. Eligible Jio subscribers get 18 months of free access to Googleâs Gemini 2.5 Pro, expanded AI image/video and Notebook LM usage, 2 TB of cloud storage, and deeper Google Cloud TPU and Gemini Enterprise integration across Relianceâs businesses.
Chinaâs Baidu says weekly robotaxi rides hit 250,000 â same as Alphabetâs Waymo this spring. Baidu reports its Apollo Go service delivered 250,000 fully driverless paid rides per week (totaling 17 million orders and 240 million kilometers) across multiple Chinese cities and several international markets.
Waymoâs robotaxis are coming to three new cities. The company will seek approvals in Nevada and Michigan before launching, plans to add Chinaâmade Zeekr RTs with sixthâgeneration driverless tech, and expects to start serving riders in those cities likely next year.
Driverless Tech Firm Pony AI Raises $863 Million in HK Listing. The offering included a 15% overallotment and drew interest from investors including talks of a roughly $100 million participation by Uber; proceeds target scaling Level 4 robotaxi/robotruck services and R&D as Pony AI aims for profitability by 2028â29.
Shopify says AI traffic is up 7x since January, AI-driven orders are up 11x. Partnerships with OpenAI, Perplexity, and Microsoft Copilotâplus internal tools like Scoutâare driving rapid growth in AIâdriven traffic and orders by tapping merchant data to embed shopping into AI conversations and guide product decisions.
People Inc. forges AI licensing deal with Microsoft as Google traffic drops. As Google search traffic declines, People Inc. becomes a launch partner in Microsoftâs publisher content marketplaceâa payâperâuse model where buyers like Copilot can directly compensate publishers for content, amid efforts to block AI crawlers to force licensing talks.
Inception raises $50 million to build diffusion models for code and text. The startup plans diffusionâbased models that refine outputs in parallel rather than sequentiallyâaiming for faster, more efficient code and largeâtext tasksâwith its new Mercury model funded by a $50M seed led by Menlo Ventures.
Amazon Sues to Stop Perplexity From Using AI Tool to Buy Stuff. Amazon alleges Perplexityâs Comet agent places orders on behalf of users without properly identifying itself, violating Amazonâs terms and prompting a federal lawsuit accusing the startup of computer fraud.
Scaling Latent Reasoning via Looped Language Models. The authors show that recursively reusing weightâtied layers with an entropyâregularized adaptive earlyâexit mechanism yields 2â3Ă parameterâefficiency gains at scale.
Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries. Training an auxiliary head to predict a compact learned summary of a long future windowârather than multiple independent future tokensâimproves longârange reasoning and yields up to ~5% gains on math and coding benchmarks at the 8B scale.
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning. This method trains models to produce an internal monologue and take discrete intermediate âactions,â providing a dense, similarityâbased reward at each step by comparing predicted actions to decomposed expert actions.
The End of Manual Decoding: Towards Truly End-to-End Language Models. Lightweight prediction heads augment transformers to dynamically set sampling parameters (like temperature and topâp) at each step so the model controls decoding endâtoâend, matching or exceeding expertâtuned baselines and enabling naturalâlanguage steering of sampling.
Continuous Autoregressive Language Models. The approach replaces discrete nextâtoken prediction with nextâvector prediction by compressing K tokens into continuous vectors via an autoencoder and using a likelihoodâfree generative head with new evaluation/sampling methods to reduce autoregressive steps and compute.
Defeating the Training-Inference Mismatch via FP16. Switching mixedâprecision from BF16 to FP16 during RL fineâtuning reduces numerical rounding errors between training and inference engines, removing the need for importanceâsampling fixes, improving stability, and narrowing the deployment gap.
Kimi Linear: An Expressive, Efficient Attention Architecture. By combining Kimi Delta Attention and MultiâHead Latent Attention, this hybrid linear attention mechanism boosts efficiency and often matches or exceeds full attention on several tasks.
Remote Labor Index: Measuring AI Automation of Remote Work. This index evaluates AI agents on realâworld freelance projectsâcomparing AI outputs to human deliverables via manual Eloâstyle pairwise judgmentsâto quantify how much of remote, computerâbased work current models can automate (currently about 2.5%).
arXiv Changes Rules After Getting Spammed With AI-Generated âResearchâ Papers. arXiv says the change aims to curb lowâeffort, AIâgenerated submissionsâmostly superficial reviews and position pieces lacking substantive discussion of open research problemsâby banning such computer science review and position papers.
Studio Ghibli, Bandai Namco, Square Enix demand OpenAI stop using their content to train AI. The groups allege OpenAI used membersâ copyrighted works as training data and in Sora 2 outputs without permission, and request the company stop using that content and formally address the copyright concerns.
Character.ai to ban teens from talking to its AI chatbots. Starting Nov. 25, underâ18s will be blocked from conversational chats and limited to generating nonâinteractive content like videos amid lawsuits and scrutiny over harmful interactions and impersonation of real victims.
OpenAI Risks Billions as Court Weighs Privilege in Copyright Row. If plaintiffs gain access to internal Slack messages and attorney communications about OpenAIâs deletion of pirated book data, the company could face evidenceâspoliation sanctions and enhanced statutory damages that together may amount to billions.
Stability AI largely wins UK court battle against Getty Images over copyright and trademark. The ruling found Stability did not infringe Gettyâs copyrights by training Stable Diffusion on scraped images, though the judge did find limited instances of trademark infringement where Getty watermarks appeared in generated images.
Xania Monet is the first AI-powered artist to debut on a Billboard airplay chart, but she likely wonât be the last. Her chart debut and multimillionâdollar record deal spotlight growing commercial acceptance of AIâcreated performers, even as musicians and industry figures voice ethical and labor concerns.
Jerome Powell says the AI hiring apocalypse is real: âJob creation is pretty close to zeroâ. He warned that firms are citing AIâdriven automation for layoffs and hiring freezes, leaving underlying job creation near zero despite ongoing GDP growth and heavy corporate AI investment.
The A.I.-Profits Drought and the Lessons of History. A Media Lab study and recent evidence suggest generative AI has boosted productivity mainly in narrow, customized use cases and personal âshadowâ tools, while many firms face integration and sectoral limits that have constrained broad profit gains.
No posts