Leveraging Machine Learning and Large Language Models for Automated Image Clustering and Description in Legal Discovery
arxiv.org·1h
📚Document Clustering
Preview
Report Post

View PDF

Abstract:The rapid increase in digital image creation and retention presents substantial challenges during legal discovery, digital archive, and content management. Corporations and legal teams must organize, analyze, and extract meaningful insights from large image collections under strict time pressures, making manual review impractical and costly. These demands have intensified interest in automated methods that can efficiently organize and describe large-scale image datasets. This paper presents a systematic investigation of automated cluster description generation through the integration of image clustering, image captioning, and large language models (LLMs). We apply K-means clustering to group images into 20 visually coherent clu…

Similar Posts

Loading similar posts...