original ↗
lmika.org·3d
Teaching LLM to be Persuasive: Reward-Enhanced Policy Optimization for Alignment frm Heterogeneous Rewards
arxiv.org·13h
A Computational Framework for Interpretable Text-Based Personality Assessment from Social Media
arxiv.org·1d
Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions
arxiv.org·13h
Excerpts from my neuroscience to-do list
lesswrong.com·20h
Loading...Loading more...