A straightforward logical approach for compressing XML documents is to use the traditional, general-purpose text compression tools. 压缩XML文档的一种简单方法是使用传统的通用文本压缩(TC)工具。
Standard text compression techniques can give dramatic reductions in document size at the cost of additional processing overhead. 标准文本压缩(TC)技术可以极大减少文档的大小,代价是额外的处理开销。
In the structure part, XML tags and attributes are encoded in a dictionary-based fashion before passing the encoding to a back-end general text compression scheme. 在结构部分,在将编码传递到后端一般文本压缩(TC)方案之前,XML标记和属性以基于词典的方式进行编码。
Large-alphabet Chinese text compression using adaptive Markov model and arithmetic coder. 使用调适马可夫模型与算术编码之大字符集中文文本资料压缩。
This group of XML compressors is XML-blind, that is, they treat XML documents as usual plain text documents and thus apply the traditional text compression techniques. 这种XML压缩器是与XML无关的,也就是说,它们将XML文档用作普通的纯文本文件,因此采用的是传统的文本压缩(TC)技术。