Database Research needs an Abstract Relational Query Language

View PDF HTML (experimental)

Abstract:For decades, SQL has been the default language for composing queries, but it is increasingly used as an artifact to be read and verified rather than authored. With Large Language Models (LLMs), queries are increasingly machine-generated, while humans read, validate, and debug them. This shift turns relational query languages into interfaces for back-and-forth communication about intent, which will lead to a rethinking of relational language design, and more broadly, relational interface design. We argue that this rethinking needs support from an Abstract Relational Query Language (ARQL): a semantics-first reference metalanguage that separates query intent from user…

View PDF HTML (experimental)

Abstract:For decades, SQL has been the default language for composing queries, but it is increasingly used as an artifact to be read and verified rather than authored. With Large Language Models (LLMs), queries are increasingly machine-generated, while humans read, validate, and debug them. This shift turns relational query languages into interfaces for back-and-forth communication about intent, which will lead to a rethinking of relational language design, and more broadly, relational interface design. We argue that this rethinking needs support from an Abstract Relational Query Language (ARQL): a semantics-first reference metalanguage that separates query intent from user-facing syntax and makes underlying relational patterns explicit and comparable across user-facing languages. An ARQL separates a query into (i) a relational core (the compositional structure that determines intent), (ii) modalities (alternative representations of that core tailored to different audiences), and (iii) conventions (orthogonal environment-level semantic parameters under which the core is interpreted, e.g., set vs. bag semantics, or treatment of null values). Usability for humans or machines then depends less on choosing a particular language and more on choosing an appropriate modality. Comparing languages becomes a question of which relational patterns they support and what conventions they choose. We introduce Abstract Relational Calculus (ARC), a strict generalization of Tuple Relational Calculus (TRC), as a concrete instance of ARQL. ARC comes in three modalities: (i) a comprehension-style textual notation, (ii) an Abstract Language Tree (ALT) for machine reasoning about meaning, and (iii) a diagrammatic hierarchical graph (higraph) representation for humans. ARC provides the missing vocabulary and acts as a Rosetta Stone for relational querying.


Comments:	CIDR 2026. 16th Annual Conference on Innovative Data Systems Research (CIDR ’26). January 18-21, 2026, Chaminade, USA. 16 pages, 21 figures
Subjects:	Databases (cs.DB); Logic in Computer Science (cs.LO)
Cite as:	arXiv:2512.12957 [cs.DB]
	(or arXiv:2512.12957v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2512.12957 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Wolfgang Gatterbauer [view email] [v1] Mon, 15 Dec 2025 03:44:20 UTC (322 KB)

Submission history

Similar Posts