💬 Large Language Models - potatoice · Scour

In-Context Function Learning in Large Language Models

arxiv.org·22h

🤖Software Engineering with AI

BalatroBench Benchmarks Large Language Models Playing Balatro

balatrobench.com·16h·

Discuss: Hacker News

🤖Software Engineering with AI

Large Language Models for Mortals book

andrewpwheeler.com·2d

🤖Software Engineering with AI

Manifold-Aware Temporal Domain Generalization for Large Language Models

arxiv.org·22h

🧬Computational Neuroscience

Presentation: Building Embedding Models for Large-Scale Real-World Applications

infoq.com

·11h

🤖Software Engineering with AI

A History of Large Language Models

gregorygundersen.com·2d

🧬Computational Neuroscience

Data Engineering for Large Models: Architecture, Algorithms & Projects

github.com·1h

🤖Software Engineering with AI

Recursive Language Models: Stop Stuffing the Context Window

nlp.elvissaravia.com·1d

🤖Software Engineering with AI

Gibbs Measures from Deep Shaped Multilayer Perceptrons

link.aps.org·1d

🧬Computational Neuroscience

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

machinelearning.apple.com·1d

🤖Software Engineering with AI

Larger AI Models Are Not Always Better At Remembering Facts, Research Reveals

quantumzeitgeist.com·1d

🧬Computational Neuroscience

facebookresearch/MUSE: A library for Multilingual Unsupervised or Supervised word Embeddings

github.com·10h

🤖Software Engineering with AI

Building an AI Proposal Pipeline: From Call Transcripts to Branded Web Pages with Python and Supabase

dev.to·2h·

Discuss: DEV

🤖Software Engineering with AI

Multimodal Large Language Models: Architectures, Training, and Real-World Applications

pub.towardsai.net

·5d

🧬Computational Neuroscience

A Survey on Federated Fine-Tuning of Large Language Models

openreview.net·4h·

Discuss: Hacker News

🤖Software Engineering with AI

Addendum: Data splitting against information leakage with DataSAIL

nature.com·14h

🧬Computational Neuroscience

Ai’s Inner Workings Revealed By Model Trained On One Billion Data Points

quantumzeitgeist.com·1d

🧬Computational Neuroscience

How AI Generates Brand Names: The Real Pipeline

dev.to·1d·

Discuss: DEV

🤖Software Engineering with AI

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling

huggingface.co·1d·

Discuss: Hacker News

🤖Software Engineering with AI

Olmix: A framework for data mixing throughout LM development

allenai.org·11h

🤖Software Engineering with AI

Loading more...