mbanerjeepalmer's Top FindsLoading...
Learning Complementary Policies for Human-AI Teams
arxiv.org·3d
🎯recommendation systems
Flag this post
Sign up or login to customize your feed and get personalized topic recommendations
What's the stack for going from a fine-tune on vLLM to a simple, paid public API?
reddit.com·1d·
Discuss: r/LocalLLaMA
Flag this post
Willpower is exhausting, use content blockers
lesswrong.com·21h
Flag this post
A review of MSUM's AI Innovation Summit: Day Two
lesswrong.com·22h
Flag this post
GrowthHacker: Automated Off-Policy Evaluation Optimization Using Code-Modifying LLM Agents
arxiv.org·3d
🎯recommendation systems
Flag this post
Dynamic Model Selection for Trajectory Prediction via Pairwise Ranking and Meta-Features
arxiv.org·3d
🎯recommendation systems
Flag this post
Vulnerability Inception: How AI Code Assistants Replicate and Amplify Security Flaws
github.com·7h·
Discuss: r/LocalLLaMA
Flag this post
People Seem Funny In The Head About Subtle Signals
lesswrong.com·1d
Flag this post
Quoting Ben Stolovitz
simonwillison.net·23h
Flag this post
AI Safety's Berkeley Bubble and the Allies We're Not Even Trying to Recruit
lesswrong.com·1h
Flag this post
Enhancing Federated Learning Privacy with QUBO
arxiv.org·2d
Flag this post
Fine-tuning a chat model to mimic one person
reddit.com·2d·
Discuss: r/LocalLLaMA
Flag this post
Oolong: Evaluating Long Context Reasoning and Aggregation Capabilities
arxiv.org·2d
🎯recommendation systems
Flag this post
LookSync: Large-Scale Visual Product Search System for AI-Generated Fashion Looks
arxiv.org·3d
🎯recommendation systems
Flag this post
Incremental Selection of Most-Filtering Conjectures and Proofs of the Selected Conjectures
arxiv.org·3d
🎯recommendation systems
Flag this post
NOMAD - Navigating Optimal Model Application to Datastreams
arxiv.org·3d
🎯recommendation systems
Flag this post
STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack
arxiv.org·3d
🎯recommendation systems
Flag this post