JONGROK Khmer Language Corpora

Powered by the JONGROK Multi-Agent Text Extraction System — your digital granary for freely accessible Khmer-language data.

Loading...

Our Most Used Datasets

This section provides a list of the most frequently accessed datasets by users. Explore and download these datasets for your analysis.

Loading corpora...

Khmer Language Analysis Tools

Specialized tools designed for the unique characteristics of the Khmer language.

Word Frequency Analysis

Track usage patterns of Khmer words and phrases across different time periods and text types.

Khmer Collocation Finder

Discover words that frequently appear together in Khmer texts and analyze their relationships.

Text Comparison

Compare language usage across different Khmer corpora, genres, or time periods.

Khmer Script Support

Our platform fully supports the Khmer script with specialized rendering and analysis capabilities.

  • Full Khmer Unicode support
  • Subscript consonant handling
  • Khmer-specific search algorithms
  • Diacritic-aware processing
ភាសាខ្មែរ

Sample Khmer text analysis:

ការសិក្សាភាសាខ្មែរតាមរយៈទិន្នន័យ

(Khmer language study through data)

Start Exploring Khmer Language Data

Join researchers, linguists, and language enthusiasts using MATES for Khmer corpus analysis.