Multimodal Universe Dataset Streaming
royce.bearblog.dev·10h
Preview
Report Post

Overview

Written by Mike Smith

Astronomical datasets are growing exponentially in size and complexity, with modern surveys like JWST, LSST, and others capturing multi-wavelength observations of billions of celestial objects, creating rich multimodal datasets that are invaluable for training machine learning models (and for discovering new astronomy!). The Multimodal Universe (MMU) is a ~100TB dataset of cross-matchable astronomical objects compiled by astronomers working across different astronomical disciplines. It is one of the most comprehensive multimodal astronomical datasets available, but its size creates a significant barrier to entry: downloading and hosting 100TB locally is not feasible for many researchers.

While Hugging…

Similar Posts

Loading similar posts...