Exploring the Application of the Apriori Algorithm in Knowledge Mining for Linguistic Data within Chinese Studies

Authors

  • Du Gan PhD Candidate, Faculty of Humannities, Kasersats University, Bangkok, 10900, Thailand.
  • Kanokporn Numtong Associate Professor, Faculty of Humannities, Kasersats University, Bangkok, 10900, Thailand.
  • Hao Li PhD Candidate, Faculty of Humannities, Kasersats University, Bangkok, 10900, Thailand.
  • Songyu Jiang Dr., Rattanakosin International College of Creative Entrepreneurship, Rajamangala University of Technology Rattanakosin, Thailand, 73170, Thailand.

Keywords:

Apriori Algorithm, Linguistic Patterns, Cultural Nuances, Temporal Evolution, Computational Linguistics.

Abstract

This study applies the Apriori algorithm to analyse patterns, syntactic structures, and thematic clusters in Chinese studies data from various genres. This study aims to identify recurring linguistic elements in order to shed light on the dynamic nature of the Chinese language across different contexts and time periods. The Apriori algorithm is used to identify frequent item sets and establish associations between linguistic constructs in large datasets (over 20 years). This study examines the complexity of the Chinese language by analysing co-occurrence patterns, syntactic tendencies, and thematic categorizations. This study examines the evolution of language, regional word choices, and cultural nuances. Thematic clusters and sensory associations establish the relationship between language and culture. The study of Chinese language patterns and cultural implications utilises data to advance computational linguistics and theory. Computational models prioritize cultural and historical context analysis for more comprehensive language processing. Theoretical implications help researchers understand language evolution and culture, while practical implications improve language technology tools. The conclusion provides support for research in computational linguistics, cultural studies, and linguistic theory-based holistic language analysis and application.

Downloads

Published

2024-05-02