Study on the Correlation between Tang Dynasty Literary Creation and Socio-Political Background Based on Big Data Analysis Techniques
DOI:
https://doi.org/10.70917/ijcisim-2026-0150Keywords:
LDA; K-Means++; TF-IDF; Apriori algorithm; Tang Dynasty literary creationAbstract
Every major change in society has a profound impact on literary creation. The rapid development of big data technology provides new ideas and methods for literary research. In this paper, LDA topic model and K-Means++ text clustering algorithm are used to mine and divide the keywords respectively. The data processing processes such as word division, stop word deletion, and text feature word extraction are performed to analyze the relevance of the text data. The study is based on the database of “Tang Dynasty Literature Chronicle Map Platform” for text mining, and the keywords mined can be roughly divided into six types: ideology and culture, social class, political events, literary style, literary themes, and literary style. Combined with the perplexity index can LDAvis visualization analysis, determine the number of themes is 2 when the difference between the themes is large. Based on Apriori algorithm for association rule analysis, Tang Dynasty literary creation has a strong correlation with political events, social class, ideology and culture.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Chi Zhang

This work is licensed under a Creative Commons Attribution 4.0 International License.