🧠 LLM Training - calleum · Scour

Data-Constrained Language Model Pretraining: Improved Regularization and Scaling Laws

⚡LLM Inference Academic

Less-relevant results

ju4nv1e1r4/nlp_engine_inference: An inference engine for NLP models.

⚡LLM Inference Code

github.com··r/rust

Reinventing Entropy

⚙️Systems Programming News Blog

3blue1brown.substack.com··Substack

Mythograph Atelier #1 - Abstract Art That Means Something to You

🕸️axum Blog

huggingface.co·

AI Chart Understanding Breakthrough: MIT-IBM Dataset Lets Small Models Beat GPT-4o

⚡LLM Inference

techtimes.com·

Fisher-Guided Progressive Parameter Selection for Adaptive Fine-Tuning

⚡LLM Inference Academic

The crash that vanished: control and emergence in a five-model economy

🦀Rust Blog

huggingface.co·

Multi-Hop Knowledge Composition is Bound by Pretraining Exposure

⚡LLM Inference Academic

If Claude Fable stops helping you, you'll never know

⚡LLM Inference Blog

jonready.com··Lobsters, Hacker News

Amazing Digital Dentures (a failed project)

📝Long-form Tech Essays Blog

huggingface.co·

Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation Model Pretraining

⚡LLM Inference Academic

Room360: Video-to-3D Spatial Reconstruction Platform

🖥️Self-Hosting Blog

huggingface.co·

Benchmarking Empirical Privacy Protection for Adaptations of Large Language Models

⚡LLM Inference Academic

Advancing the State-of-the-Art in Empirical Privacy Auditing

λType Theory Academic

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

⚡LLM Inference Blog

huggingface.co·

In-Context Learning for Latent Space Bayesian Optimization

⚡LLM Inference Academic

Unifying Local Communications and Local Updates for LLM Pretraining

⚡LLM Inference Academic

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

⚡LLM Inference Blog

huggingface.co··Hacker News

A Unifying Lens on Reward Uncertainty in RLHF

⚡LLM Inference Academic

A Controlled Audit of Pretraining Contamination in Public Medical Vision-Language Benchmarks

📄CS Papers Academic

Sign up or log in to see more results

Log in to enable infinite scrolling