MarkGao's Feed

Claude Tag

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems. Read more ›

Covers 2 stories including Agent identity in Claude Tag: a new access model for autonomous, team-wide AI

Covered by 35 sources including The Rundown AI, 9to5Mac

Discussed on Hacker News

🤖artificial intelligence arXiv·

Data-driven Machine Learning Cannot Reach Symbolic-level Logical Reasoning -- The Limit of the Scaling Law

Sphere neural networks have achieved symbolic level syllogistic reasoning without training data, raising the question of where the limit of the scaling law for logical reasoning lies, i.e., whether data-driven machine learning systems can achieve the same level by increasing training data and training time. We show two methodological limitations that prevent supervised deep learning from reaching the symbolic-level syllogistic reasoning: (1) tra... Read more ›

🦀openclaw hackmyclaw.com·

Challenge Over

HackMyClaw is over. No one was able to crack Fiu, and the challenge became too expensive to keep running. Thanks to everyone who participated. Read more ›

Covered by 3 sources including Simon Willison’s Weblog, fernandoi.cl

🚀Startups TechCrunch·

2 days left to save up to $190: Join 1,000+ founders and investors at TechCrunch Founder Summit

2 days left to lock in your spot at TechCrunch Founder Summit 2026 and save up to $190 before Early Bird rates expire on June 26 at 11:59 p.m. PT. Register here. Read more ›

🧠LLMs GitHub·

Show HN: KV-psi, using Linux PSI to to trim an LLM KV cache

Contribute to infiniteregrets/kv-psi development by creating an account on GitHub. Read more ›

Discussed on Hacker News

🤖Agentic Engineering MIT News·

Improving the speed and energy-efficiency of AI agents

“Murakkab” is a new automated system that streamlines the design of agentic workloads for AI applications and optimizes their deployment for customers, reducing computation and cost while boosting energy efficiency. Read more ›

⚛️Quantum Computing Nature

On the robustness of topological gap detection via transport

Nature - On the robustness of topological gap detection via transport Read more ›

Covered by 4 sources including The Verge, Neowin

🤖AI Agents Product Hunt

Lyto

\"One AI agent across your browser, tools, and messages \" Discussion \| Link Read more ›

📱Content Strategy creators.facebook.com·

Re-introducing Facebook Creator Studio: Now AI-Powered

Meet the new and improved Facebook Creator Studio—now with AI—to manage content, track insights and grow your audience. Learn what's new. Read more ›

Covered by 4 sources including The Verge, TechCrunch

📰AI News TechCrunch·

Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on

New models are launching in Asia that promise Mythos-like capabilities without fear of an export ban. U.S. AI labs may never recover this enormous market. Read more ›

Covers 2 stories including Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Covered by 3 sources including TNW | Artificial-Intelligence, indiehacker.news

Discussed on Hacker News

🔬Neurotech arXiv·

Average Rankings Mask Per-Subject Optimality: A Friedman-Nemenyi Benchmark of EEG Motor-Imagery BCI Decoders

Electroencephalography (EEG) is the dominant non-invasive modality for brain-computer interfaces (BCIs), yet reliable decoding of motor imagery is hampered by inter- and intra-individual variability. A recurring claim is that one decoding pipeline, most often a spatial or Riemannian method, is broadly preferable. We test the weakest version of that claim under the most favourable conditions. Using the Mother of All BCI Benchmarks (MOABB) frame... Read more ›

📈Tech Trends TechCrunch·

It’s not about Anthropic vs. OpenAI anymore

AI models have progressed to the point where their capabilities have real political consequences. Dealing with those consequences will require collective action. Read more ›

Covers What Should Be Done

₿Crypto arXiv·

CyberChainBench: Can AI Agents Secure Smart Contracts Against Real-World On-Chain Vulnerabilities?

We present CyberChainBench, a benchmark for evaluating LLM-based agents on smart contract security across three complementary tasks: vulnerability detection, exploit generation, and patch synthesis. Built from 541 real-world exploit incidents from DeFiHackLabs spanning 9 EVM chains, the benchmark provides end-to-end on-chain evaluation where agents interact with historical blockchain state through isolated evaluation environments orchestrated ... Read more ›

🎼Agent Orchestration GitHub·

Show HN: Dspyer – self-correcting, optimizable LLM steps for DSPy and LangGraph

A transpiler from stateful imperative workflows to declarative DSPy programs - theramkm/dspyer Read more ›

Discussed on Hacker News

🚀Startups investormatch.pro·

Show HN: I built a tool that matches Founders to VCs based on their pitch deck

<a href=" Read more ›

Discussed on Hacker News

🛠️Developer Tools codelabs.developers.google.com·

Show HN: Catch abnormal usage of your API keys on Google Cloud

Learn how to secure, monitor, and remediate Gemini and Google API keys. Read more ›

Covers AlterLang InterCode: A Native Intercomprehension Paradigm in Programming, Powered by GuruDev

Discussed on Hacker News

🤖artificial intelligence Nature

A hidden predictor of sudden cardiac death uncovered by deep learning

A machine-learning model trained on thousands of electrocardiogram recordings identifies a previously unrecognized group of at-risk people. A machine-learning model trained on thousands of electrocardiogram recordings identifies a previously unrecognized group of at-risk people. Read more ›

🤖Multi-Agent Systems GitHub·

Show HN: Multi Agent Protocol for AI Scientist by Hexo Labs

A multi-agent protocol pairing a tool-using Scientist with a question-only advisor — no tools, no answers, no directives — improves Kaggle test performance on 4 of 5 MLE-bench tasks - hexo-ai/socrates Read more ›

Discussed on Hacker News

✍️Prompt Engineering arXiv·

Narration-of-Thought: Inference-Time Scaffolding for Defeasible Ethical Reasoning in Large Language Models

Standard chain-of-thought on moral dilemmas exhibits two failure modes: stakeholder collapse (the trace names at most one party with a stake in the outcome) and uncertainty suppression (no explicit unknowns or hedges before committing to an action). We introduce narration-of-thought (NoT), a system prompt that structures chain-of-thought into five sections: protagonist, stakeholders, two-step consequences, uncertainty, then commitment. NoT adds ... Read more ›

🤖claude code Simon Willison’s Weblog·

Porting the Moebius 0.2B image inpainting model to run in the browser with Claude Code

This morning I saw , describing a small but effective inpainting model - a model where you can mark regions of an image to remove and the model imagines what should fill the space. The released model , but since it described itself as 0.2B I decided to try and get it running using WebGPU in a browser. TL;DR: I got it working, and you can try the demo at The finished tool Here's a video demo of the finished tool: You can open any image in it (non-square images get letterboxed), highlight areas... Read more ›

Covers 3 stories including Hugging Face – Fun chat with your own Artificial Intelligence

Covered by indiehacker.news

Discussed on Hacker News