Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
🤖 Transformers
Specific
Attention Mechanism, BERT, GPT, Neural Architecture
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
106
posts in
7.3
ms
Instruction
Finetuning
DeepSeek-R1-8B
Model
Using LoRA and NEFTune
⚙️
LLM Fine-tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Instruction Finetuning DeepSeek-R1-8B Model Using LoRA and NEFTune
How LLMs Actually Work: A Friendly Map for Humans • oreoro
⚙️
LLM Fine-tuning
oreoro.github.io
·
5d
5 days ago
·
Hacker News
Actions for How LLMs Actually Work: A Friendly Map for Humans • oreoro
Your
LLM
Isn’t Reading Your Manners — It’s Counting Your Tokens
🤖
LLM
Content type:
Blog
medium.com
·
13h
13 hours ago
Actions for Your LLM Isn’t Reading Your Manners — It’s Counting Your Tokens
ELI5 is a terrible
learning
prompt, here's the structural reason it fails and a 4-level replacement that actually sticks
🤖
AI
Content type:
Blog
Content type:
Tutorial
appliedaihub.org
·
1d
1 day ago
·
r/PromptEngineering
Actions for ELI5 is a terrible learning prompt, here's the structural reason it fails and a 4-level replacement that actually sticks
markusheimerl/gpt
: A generative pretrained
transformer
implementation
🤖
AI
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for markusheimerl/gpt: A generative pretrained transformer implementation
Visual Artist and Percussionist Bob
Bert
(Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
🖼️
Lightroom
glidemagazine.com
·
1d
1 day ago
Actions for Visual Artist and Percussionist Bob Bert (Sonic Youth, Pussy Galore) Talks Experimenting With Sounds on Debut Solo Album ‘Beach Bongo Bloodbath’ (INTERVIEW)
Less-relevant results
Context windows in AI: why every token is a budget decision
💬
Prompt Engineering
Content type:
Blog
redis.io
·
21h
21 hours ago
Actions for Context windows in AI: why every token is a budget decision
know the mother tongue of your LLMs
🤖
LLM
mothertoken.inigoimaz.com
·
2d
2 days ago
·
Hacker News
Actions for know the mother tongue of your LLMs
How Confident Are AI Classifiers About Their Own Confidence?
🤖
AI
Content type:
Blog
gmcirco.github.io
·
3d
3 days ago
·
Hacker News
Actions for How Confident Are AI Classifiers About Their Own Confidence?
The Sequence Knowledge #874:
Transformers
or Not?
💬
Prompt Engineering
substackcdn.com
·
2d
2 days ago
·
Substack
Actions for The Sequence Knowledge #874: Transformers or Not?
We Taught a
Model
to Speak Legalese. Here’s What Changed.
🤖
AI skills
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for We Taught a Model to Speak Legalese. Here’s What Changed.
Pathetic pretense
🌌
Astrophotography
Content type:
Blog
freethoughtblogs.com
·
2d
2 days ago
Actions for Pathetic pretense
MLPerf and the rise of latency-aware
LLM
benchmarking
🤖
LLM
edn.com
·
6d
6 days ago
Actions for MLPerf and the rise of latency-aware LLM benchmarking
Machine
learning
from scratch, what to build before using scikit-learn
🤖
AI
Content type:
Tutorial
iwtlp.com
·
1d
1 day ago
·
DEV
Actions for Machine learning from scratch, what to build before using scikit-learn
Adventurer becomes first British woman to cross Atlantic by hydrogen balloon
⚽
Premier League
Content type:
News
the-independent.com
·
3d
3 days ago
Actions for Adventurer becomes first British woman to cross Atlantic by hydrogen balloon
UR-BERT
: Scaling Text
Encoders
for Massively
Multilingual
TTS Through Universal Romanization and Speech Token Prediction
✨
Gemini
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for UR-BERT: Scaling Text Encoders for Massively Multilingual TTS Through Universal Romanization and Speech Token Prediction
The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again
⚙️
LLM Fine-tuning
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for The Memory Problem is Solved: How Google’s Memory Caching Makes RNNs Smart Again
Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented
CNN-transformer
model
🤖
AI
Content type:
Academic
nature.com
·
6d
6 days ago
Actions for Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model
Breaking tunnel vision, imaging AI lifts fluorescence image restoration accuracy and speed
📸
Computational Photography
phys.org
·
2d
2 days ago
Actions for Breaking tunnel vision, imaging AI lifts fluorescence image restoration accuracy and speed
The Inference Alpha: Maximizing Frontier
Models
on AMD
🦙
Ollama
Content type:
Blog
digitalocean.com
·
1d
1 day ago
Actions for The Inference Alpha: Maximizing Frontier Models on AMD
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help