🎛️ Fine-tuning - alexclaydon · Scour

The Magic Correlations: Understanding Knowledge Transfer from Pretraining to Supervised Fine-Tuning

arxiv.org·18h

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

arxiv.org·1d

🎯Vector Search

Image Classification with CNNs – Part 4: Dealing with Variations in Input

dev.to·1h·

Discuss: DEV

📄Document AI

Olmix: A framework for data mixing throughout LM development

allenai.org·6h

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

machinelearning.apple.com·23h

The 4 Parameter-Efficient Fine-Tuning Methods: How to Adapt LLMs 100× Faster

pub.towardsai.net

·3d

Presentation: Building Embedding Models for Large-Scale Real-World Applications

infoq.com

·7h

🎯Vector Search

Beyond the Prompt - Why and How to Fine-tune Your Own Models

devblogs.microsoft.com·2d

📄Document AI

Training Data from Real-World Sources

lightningrod.ai·2d

📄Document AI

The implementation for the drifting model

breno.bearblog.dev·12h

Ai’s Inner Workings Revealed By Model Trained On One Billion Data Points

quantumzeitgeist.com·1d

Scaling LLM Post-Training at Netflix

netflixtechblog.com·14h

📄Document AI

The 5 Distributed Training Methods: How to Train Models Too Large for One GPU

pub.towardsai.net

·7h

One Task at a Time, Even with AI

wakamoleguy.com·7h·

Discuss: Hacker News

Synergistic Enhancement of Requirement-to-Code Traceability: A Framework Combining Large Language Model based Data Augmentation and an Advanced Encoder

chipublib.idm.oclc.org·1d

📋Contract AI

The Role of Supervised Fine-Tuning in AI

hackernoon.com·3d

Category Theory, AI and Jobs

deadneurons.substack.com·8h·

Discuss: Substack

MiniMaxAI/MiniMax-M2.5

huggingface.co·8h·

Discuss: Hacker News, r/LocalLLaMA

📋Contract AI

Researchers propose a self-distillation fix for ‘catastrophic forgetting’ in LLMs

infoworld.com·1d

Mastering Model Adaptation: A Guide to Fine-Tuning on Google Cloud

cloud.google.com·2d

Loading more...