Building Custom LLM Judges for AI Agent Accuracy
databricks.com·16h
🧭SMT Solvers
Flag this post
Wordle Solver
🔗Parser Combinators
Flag this post
Interpretable Heart Disease Prediction via a Weighted Ensemble Model: A Large-Scale Study with SHAP and Surrogate Decision Trees
arxiv.org·7h
🎲Probabilistic Programming
Flag this post
This is one way I use AI for coding
🧩Theorem Proving
Flag this post
Energy Loss Functions for Physical Systems
arxiv.org·7h
🎲Probabilistic Programming
Flag this post
CoCoVa: Chain of Continuous Vision-Language Thought for Latent Space Reasoning
arxiv.org·7h
🔗Parser Combinators
Flag this post
LA-MARRVEL: A Knowledge-Grounded and Language-Aware LLM Reranker for AI-MARRVEL in Rare Disease Diagnosis
arxiv.org·7h
⚖️Logic Programming
Flag this post
Aligning LLM agents with human learning and adjustment behavior: a dual agent approach
arxiv.org·1d
🎲Probabilistic Programming
Flag this post
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
arxiv.org·1d
⚖Algorithmic Game Theory
Flag this post
Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences
arxiv.org·7h
🔗Parser Combinators
Flag this post
Balancing Cost, Power, and AI Performance
oreilly.com·17h
🔗Parser Combinators
Flag this post
Building Your Own LLM-Powered Sports Analyst: A RAG Approach with Fine-tuning
🎲Probabilistic Programming
Flag this post
Oolong: Evaluating Long Context Reasoning and Aggregation Capabilities
arxiv.org·7h
🔗Parser Combinators
Flag this post
The Art of the Do-Over: Designing Idempotent Jobs as a Journey to Peace of Mind
🧩Theorem Proving
Flag this post
PROPEX-RAG: Enhanced GraphRAG using Prompt-Driven Prompt Execution
arxiv.org·1d
🎲Probabilistic Programming
Flag this post
Loading...Loading more...