How running a single EC2 with just Gunicorn silently capped my app — and what it taught me about real scaling
dev.to·2d·
Discuss: DEV
☁️Cloudflare Workers
Preview
Report Post

For a long time, my backend looked healthy.

CPU wasn’t maxed. Memory wasn’t fully used.

Yet users were getting slow responses… and deployments felt risky.

That was my first real lesson:

Infrastructure can fail you quietly before it fails loudly.

🚧 The setup that worked… until it didn’t

I was hosting a production Django app on a single EC2 instance, running behind Gunicorn.

On paper, the instance had enough RAM and decent specs.

In reality, the application never came close to using what I was paying for.

Why?

Because Gunicorn doesn’t automatically scale with your instance.

It only uses what you explicitly allow it to use.

I had:

One Gunicorn service

A small number of workers

One port

No real load balancing

Restart-based deployments

It worked fine for early traffi…

Similar Posts

Loading similar posts...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help