Sorting Prompts - LLMs are not wrong you just caught them mid thought
kau.sh·19h
Proof Automation
Your LLM Won’t Stop Lying Any Time Soon
hackaday.com·10h
💻Local LLMs
Is ChatGPT-5 Able to Provide Proofs for Advanced Mathematics?
machinelearningmastery.com·4d
🎯Proof Tactics
An enough week
blog.mitrichev.ch·1d·
🧮Z3 Solver
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.ai·1d
📊Feed Optimization
Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
arxiv.org·1d
🕵️Vector Smuggling
Revisiting Long-context Modeling from Context Denoising Perspective
arxiv.org·3d
🔢Denotational Semantics
How Google Translate & ChatGPT Work: The Transformer, Unboxed
dev.to·1d·
Discuss: DEV
🧠Learned Codecs
A Manifesto for the Programming Desperado
github.com·20h·
Discuss: Hacker News
💻Programming languages
Mitigating Judgment Preference Bias in Large Language Models through Group-Based Polling
arxiv.org·1d
💻Local LLMs
Causality Guided Representation Learning for Cross-Style Hate Speech Detection
arxiv.org·1d
🎙️Whisper
Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers
arxiv.org·1d
Automated Theorem Proving
Graph-based LLM over Semi-Structured Population Data for Dynamic Policy Response
arxiv.org·3d
💻Local LLMs
Context Length Alone Hurts LLM Performance Despite Perfect Retrieval
arxiv.org·3d
🧮Kolmogorov Complexity
HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation
arxiv.org·1d
Proof Automation
StaR-KVQA: Structured Reasoning Traces for Implicit-Knowledge Visual Question Answering
arxiv.org·2d
🧠Intelligence Compression
Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Finetuning
arxiv.org·1d
🔲Cellular Automata
What is a Large Language Model (LLM)
dev.to·19h·
Discuss: DEV
💻Local LLMs
Contrastive Weak-to-strong Generalization
arxiv.org·1d
Information Bottleneck
From RNNs to ChatGPT: The Paper That Changed How AI Thinks 🤖
dev.to·19h·
Discuss: DEV
🎧Learned Audio