Transformers
Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune
🎯Fine-Tuning Content type: AcademicLazyAttention: Efficient Retrieval-Augmented Generation with Deferred Positional Encoding
🤖AI Content type: AcademicLess-relevant results