**Real-time Distributed Training of Time-Series Model on Str
dev.to·2d·
Discuss: DEV
📈Time Series
Preview
Report Post

Real-time Distributed Training of Time-Series Model on Streaming Data

Objective:

Develop a distributed AI/ML system that enables real-time training of a time-series model on streaming data in a cloud-scale environment.

Constraints:

  1. The system must handle 10 million IoT devices generating 1000 bytes of sensor data per device every second. This data is streamed into the training platform via WebSockets.
  2. The model architecture is a custom-designed graph neural network (GNN) with 5 million parameters and an average training iteration time of 30 seconds on a single NVIDIA V100 GPU.
  3. The system requires real-time predictions with a latency of less than 5 seconds for new, unseen IoT data points.
  4. Training must occur in parallel on multiple AWS EC2 p3.16xlarge instances ...

Similar Posts

Loading similar posts...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help