JONGROK Khmer Language Corpora
Powered by the JONGROK Multi-Agent Text Extraction System — your digital granary for freely accessible Khmer-language data.
Our Most Used Datasets
This section provides a list of the most frequently accessed datasets by users. Explore and download these datasets for your analysis.
Loading corpora...
Khmer Language Analysis Tools
Specialized tools designed for the unique characteristics of the Khmer language.
Word Frequency Analysis
Track usage patterns of Khmer words and phrases across different time periods and text types.
Khmer Collocation Finder
Discover words that frequently appear together in Khmer texts and analyze their relationships.
Text Comparison
Compare language usage across different Khmer corpora, genres, or time periods.
Khmer Script Support
Our platform fully supports the Khmer script with specialized rendering and analysis capabilities.
- Full Khmer Unicode support
- Subscript consonant handling
- Khmer-specific search algorithms
- Diacritic-aware processing
Sample Khmer text analysis:
ការសិក្សាភាសាខ្មែរតាមរយៈទិន្នន័យ
(Khmer language study through data)
Start Exploring Khmer Language Data
Join researchers, linguists, and language enthusiasts using MATES for Khmer corpus analysis.