Authors:Valentine Olanubi (1), Phineas Agar (1), Brendan Ames (2) ((1) University of Alabama, Department of Mathematics, (2) University of Southampton, School of Mathematical Sciences)
Abstract:We consider the densest submatrix problem, which seeks the submatrix of fixed size of a given binary matrix that contains the most nonzero entries. This problem is a natural generalization of fundamental problems in combinatorial optimization, e.g., the densest subgraph, maximum cli…
Authors:Valentine Olanubi (1), Phineas Agar (1), Brendan Ames (2) ((1) University of Alabama, Department of Mathematics, (2) University of Southampton, School of Mathematical Sciences)
Abstract:We consider the densest submatrix problem, which seeks the submatrix of fixed size of a given binary matrix that contains the most nonzero entries. This problem is a natural generalization of fundamental problems in combinatorial optimization, e.g., the densest subgraph, maximum clique, and maximum edge biclique problems, and has wide application the study of complex networks. Much recent research has focused on the development of sufficient conditions for exact solution of the densest submatrix problem via convex relaxation. The vast majority of these sufficient conditions establish identification of the densest submatrix within a graph containing exactly one large dense submatrix hidden by noise. The assumptions of these underlying models are not observed in real-world networks, where the data may correspond to a matrix containing many dense submatrices of varying sizes. We extend and generalize these results to the more realistic setting where the input matrix may contain \emph{many} large dense subgraphs. Specifically, we establish sufficient conditions under which we can expect to solve the densest submatrix problem in polynomial time for random input matrices sampled from a generalization of the stochastic block model. Moreover, we also provide sufficient conditions for perfect recovery under a deterministic adversarial. Numerical experiments involving randomly generated problem instances and real-world collaboration and communication networks are used empirically to verify the theoretical phase-transitions to perfect recovery given by these sufficient conditions.
| Subjects: | Optimization and Control (math.OC); Machine Learning (cs.LG) |
| Cite as: | arXiv:2601.03946 [math.OC] |
| (or arXiv:2601.03946v1 [math.OC] for this version) | |
| https://doi.org/10.48550/arXiv.2601.03946 arXiv-issued DOI via DataCite (pending registration) |
Submission history
From: Brendan Ames [view email] [v1] Wed, 7 Jan 2026 14:02:25 UTC (2,395 KB)