RL Learning with LoRA: A Diverse Deep Dive
kalomaze.bearblog.dev·2d
💻Software Engineering
Flag this post
2 Years of ML vs. 1 Month of Prompting
levs.fyi·1d·
Discuss: Hacker News
💻Software Engineering
Flag this post
Self play and autocurricula in the age of agents
amplifypartners.com·5h·
Discuss: Hacker News
💻Software Engineering
Flag this post
Right-sized AI: Good for business, users, and the planet
web.dev·13h·
Discuss: Hacker News
💻Software Engineering
Flag this post
Show HN: Charl – ML language with native tensors and autograd
charlbase.org·16h·
Discuss: Hacker News
💪Fitness
Flag this post
Andrej Karpathy on LLM cognitive deficits
lesswrong.com·8h
💻Software Engineering
Flag this post
Lookahead Unmasking Elicits Accurate Decoding in Diffusion Language Models
arxiv.org·49m
💪Fitness
Flag this post
Inclusion of Role into Named Entity Recognition and Ranking
arxiv.org·49m
💻Software Engineering
Flag this post
Generalization in Representation Models via Random Matrix Theory: Application to Recurrent Networks
arxiv.org·1d
💻Software Engineering
Flag this post
DiagnoLLM: A Hybrid Bayesian Neural Language Framework for Interpretable Disease Diagnosis
arxiv.org·49m
💻Software Engineering
Flag this post
A Diffusion Model to Shrink Proteins While Maintaining Their Function
arxiv.org·49m
💪Fitness
Flag this post
TriShGAN: Enhancing Sparsity and Robustness in Multivariate Time Series Counterfactuals Explanation
arxiv.org·49m
💪Fitness
Flag this post
Improving Region Representation Learning from Urban Imagery with Noisy Long-Caption Supervision
arxiv.org·49m
💪Fitness
Flag this post
Time-Warping Control: Taming Complex Systems with AI
dev.to·2h·
Discuss: DEV
💻Software Engineering
Flag this post
A two-stage semi-supervised domain generalization network for fault diagnosis under unknown working conditions
sciencedirect.com·1d
💻Software Engineering
Flag this post
Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs
arxiv.org·49m
💻Software Engineering
Flag this post
Annotation-Efficient Universal Honesty Alignment
paperium.net·20h·
Discuss: DEV
💻Software Engineering
Flag this post
Explicit Knowledge-Guided In-Context Learning for Early Detection of Alzheimer's Disease
arxiv.org·49m
💪Fitness
Flag this post
TYrPPG: Uncomplicated and Enhanced Learning Capability rPPG for Remote Heart Rate Estimation
arxiv.org·49m
💪Fitness
Flag this post