Each language usually has several important features that are hidden statistically and certain redundancy. These features can be utilized to perform text compression that is suitable for the optimal use of resources. This study proposes an analysis the potency of the efficiency and compression rate of Cirebon language script using the Binary Huffman algorithm. The analysis of the potency is based on the entropy of the Cirebon language script. The study begins with an analysis of the Cirebon language script to calculate the probability of each symbol. These probabilities are used to calculate the value of entropy. The results showed that the entropy of the Cirebon language script was 3.976 bits per symbol, with an expected code length of 4.02 bits per symbol. Then, the estimated efficiency of compression with the Binary Huffman Code is 98.89% and the compression rate is 0.80402. These results can be considered in the use of Cirebon language compression for the telecommunications transmission process.
Entropy is a statistical parameter that measures how much average information is generated for each symbol in a text. Each language usually has several important features that are hidden statistically and certain redundancy. These features can be utilized to form appropriate text compression tools for optimal use of resources. This study proposes an analysis of the entropy of the Cirebon language text for text compression using the Ternary Huffman Code algorithm. This entropy value then becomes the reference level of the Cirebon script compression level. The probability of each symbol in the Cirebon Regional Text is used to calculate the entropy value. The result shows the entropy of the Cirebon language script was 2.508 bits per symbol, with an expected code length of 2.565 bits per symbol. Estimated compression efficiency with Ternary Huffman Code is 97.77% and compression rate is 0.51308.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.