🧠 Language Models - jinkai_lau · Scour

Language Models: Does the brain really know what word is coming next? 🧠LLMs

elifesciences.org·3d

Fine-Tuning: the series ⚙️MLOps

GenNA: Conditional generation of nucleotide sequences guided by natural-language annotations 🧠LLMs

biorxiv.org·5d

G-Loss: Graph-Guided Fine-Tuning of Language Models 🧠LLMs

How does Reinforcement Learning Affect Models 🧠LLMs

lesswrong.com·3d

Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning ⚙️MLOps

Full Fine-Tuning ✅Dev Best Practices

·6d

Unsupervised protein language models learn patterns of enzyme function 🗄️Vector Databases

biorxiv.org·6d

Applications of the Transformer Architecture in AI-Assisted English Reading Comprehension 🧠LLMs

Language models know what matters and the foundations of ethics better than you 🧠LLMs

lesswrong.com·3d

Information Extraction from Electricity Invoices with General-Purpose Large Language Models 🧠LLMs

Three Models of RLHF Annotation: Extension, Evidence, and Authority 🧠LLMs

An Empirical Study of Methods for SFTing Opaque Reasoning Models ⚙️MLOps

lesswrong.com·6d

TLPO: Token-Level Policy Optimization for Mitigating Language Confusion in Large Language Models 🧠LLMs

Carbon-Taxed Transformers: A Green Compression Pipeline for Overgrown Language Models 🧠LLMs

Identifying the Achilles' Heel: An Iterative Method for Dynamically Uncovering Factual Errors in Large Language Models 🧠LLMs

A Dual-Task Paradigm to Investigate Sentence Comprehension Strategies in Language Models 🧠LLMs

Analysing Lightweight Large Language Models for Biomedical Named Entity Recognition on Diverse Ouput Formats 🧠LLMs

Evaluating Temporal Consistency in Multi-Turn Language Models 🧠LLMs

PAINT: Partial-Solution Adaptive Interpolated Training for Self-Distilled Reasoners 🧠LLMs

Log in to enable infinite scrolling