Study on the Correlation between Tang Dynasty Literary Creation and Socio-Political Background Based on Big Data Analysis Techniques

Authors

  • Chi Zhang College of Liberal Arts, Harbin Normal University, Harbin, Heilongjiang, 150080, China

DOI:

https://doi.org/10.70917/ijcisim-2026-0150

Keywords:

LDA; K-Means++; TF-IDF; Apriori algorithm; Tang Dynasty literary creation

Abstract

Every major change in society has a profound impact on literary creation. The rapid development of big data technology provides new ideas and methods for literary research. In this paper, LDA topic model and K-Means++ text clustering algorithm are used to mine and divide the keywords respectively. The data processing processes such as word division, stop word deletion, and text feature word extraction are performed to analyze the relevance of the text data. The study is based on the database of “Tang Dynasty Literature Chronicle Map Platform” for text mining, and the keywords mined can be roughly divided into six types: ideology and culture, social class, political events, literary style, literary themes, and literary style. Combined with the perplexity index can LDAvis visualization analysis, determine the number of themes is 2 when the difference between the themes is large. Based on Apriori algorithm for association rule analysis, Tang Dynasty literary creation has a strong correlation with political events, social class, ideology and culture.

Downloads

Download data is not yet available.

Downloads

Published

2026-01-22

How to Cite

Chi Zhang. (2026). Study on the Correlation between Tang Dynasty Literary Creation and Socio-Political Background Based on Big Data Analysis Techniques. International Journal of Computer Information Systems and Industrial Management Applications, 18, 14. https://doi.org/10.70917/ijcisim-2026-0150

Issue

Section

Original Articles