Abstract:Our study aims to establish a unified, systematic, and referable knowledge framework for the annotation of art image datasets, addressing issues of ambiguous definitions and inconsistent results caused by the lack of common standards during the annotation process. To achieve this goal, a hierarchical and systematic art image knowledge graph was constructed. It was developed based on the composition principles of art images, incorporating the Structured Theory of Visual Knowledge proposed by Academician Yunhe Pan in On Visual Knowledge-which states that visual knowledge must achieve precise expression of spatial forms and dynamic relationships through “prototype-category” and “hierarchical structure”. Through in-depth review of Chi…
Abstract:Our study aims to establish a unified, systematic, and referable knowledge framework for the annotation of art image datasets, addressing issues of ambiguous definitions and inconsistent results caused by the lack of common standards during the annotation process. To achieve this goal, a hierarchical and systematic art image knowledge graph was constructed. It was developed based on the composition principles of art images, incorporating the Structured Theory of Visual Knowledge proposed by Academician Yunhe Pan in On Visual Knowledge-which states that visual knowledge must achieve precise expression of spatial forms and dynamic relationships through “prototype-category” and “hierarchical structure”. Through in-depth review of Chinese and Western art theories and pioneering integration of the Chinese cultural perspective, this graph took shape. The core visual language of art images was deconstructed by this knowledge graph. Meanwhile, the unique spatial theory and symbolic system of Chinese painting were compared with and supplemented by Western art theories. This graph converts qualitative artistic concepts into a clear structured framework. It not only conforms to the cognitive law that “visual knowledge takes precedence over verbal knowledge” in humans but also provides an interpretable and inferential visual knowledge foundation for AI art generation and cross-cultural art analysis. It ensures the high quality and consistency of annotated data, thus offering key support for art intelligence research in the AI 2.0 era.
| Comments: | 24 pages, in Chinese language |
| Subjects: | Human-Computer Interaction (cs.HC) |
| MSC classes: | 68T07 |
| ACM classes: | H.5.2; J.5 |
| Cite as: | arXiv:2511.03585 [cs.HC] |
| (or arXiv:2511.03585v1 [cs.HC] for this version) | |
| https://doi.org/10.48550/arXiv.2511.03585 arXiv-issued DOI via DataCite (pending registration) |
Submission history
From: Zejian Li [view email] [v1] Wed, 5 Nov 2025 16:04:54 UTC (977 KB)