Transformers

Feeds to Scour
SubscribedAll
Scoured 106 posts in 5.2 ms

Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune

 ⚙️LLM Fine-tuning  Content type: Academic
arxiv.org·

How LLMs Actually Work: A Friendly Map for Humans • oreoro

 ⚙️LLM Fine-tuning

Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens

 🤖LLM  Content type: Blog
medium.com
·

ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks

 🤖AI  Content type: Blog  Content type: Tutorial

markusheimerl/gpt: A generative pretrained transformer implementation

 🤖AI  Content type: Code
github.com··Hacker News

Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)

 🖼️Lightroom
glidemagazine.com·
Less-relevant results

Context windows in AI: why every token is a budget decision

 💬Prompt Engineering  Content type: Blog
redis.io·

know the mother tongue of your LLMs

 🤖LLM

How Confident Are AI Classifiers About Their Own Confidence?

 🤖AI  Content type: Blog

The Sequence Knowledge #874: Transformers or Not?

 💬Prompt Engineering
substackcdn.com··Substack

We Taught a Model to Speak Legalese. Here’s What Changed.

 🤖AI skills  Content type: Blog
medium.com·

Pathetic pretense

 🌌Astrophotography  Content type: Blog
freethoughtblogs.com·

MLPerf and the rise of latency-aware LLM benchmarking

 🤖LLM
edn.com·

Machine learning from scratch, what to build before using scikit-learn

 🤖AI  Content type: Tutorial
iwtlp.com··DEV

Adventurer becomes first British woman to cross Atlantic by hydrogen balloon

 Premier League  Content type: News
the-independent.com·

UR-BERT: Scaling Text Encoders for Massively Multilingual TTS Through Universal Romanization and Speech Token Prediction

 Gemini  Content type: Academic
arxiv.org·

Breaking tunnel vision, imaging AI lifts fluorescence image restoration accuracy and speed

 📸Computational Photography
phys.org·

Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model

 🤖AI  Content type: Academic
nature.com·

The Inference Alpha: Maximizing Frontier Models on AMD

 🦙Ollama  Content type: Blog
digitalocean.com·

Don't let the LLM speak, just probe it (8 minute read)

 🤖AI  Content type: Blog
blog.j11y.io·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help