lz78.txt

来自「PDF文件格式解析库源代码」· 文本代码 · 共 56 行

TXT

56 行

The LZ78 is a dictionary-based compression algorithm that maintains an
explicit dictionary. 

When a symbol that not yet in the dictionary is encountered, the codeword has the
index value 0 and it is added to the dictionary. With this method, the algorithm 
gradually builds up a dictionary. The codewords output by the algorithm consist of 
two elements: an index referring to the longest matching dictionary entry and
the first non-matching symbol.

Encode Algorithm:

word <- NIL;

while (there is input)
{
	symbol <- next symbol from input;
	phrase <- word + symbol;
	
	if (phrase exists in the dictionary)
	{
		word <- phrase;
	}
	else
	{
		output (index(word), symbol);
		add phrase to the dictionary;
		word <- NIL;
	}
}

Decode Algorithm:

(index, symbol) <- read pair from input

if (index = 0)
{
	add symbol to the dictionary
	output symbol
}
else
{
	word <- look up dictionary by index
	phrase <- word + symbol
	add phrase to the dictionary
	output phrase
}


LZ78 has several weaknesses. First of all, the dictionary grows without
bounds. Various methods have been introduced to prevent this, the easiest
being to become either static once the dictionary is full or to throw
away the dictionary and start creating a new one from scratch.

The inclusion of an explicitly
coded symbol into every match may cause the next match to be worse
than it could be if it were allowed to include this symbol.

lz78.txt - 源码说明

本页面展示了「PDF文件格式解析库源代码」中的 lz78.txt 源码文件，采用文本编程语言编写，共 56 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫下载站收录了大量与文件格式相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?