⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 koi8_u.py

📁 mallet是自然语言处理、机器学习领域的一个开源项目。
💻 PY
字号:
""" Python Character Mapping Codec for KOI8U.    This character scheme is compliant to RFC2319Written by Marc-Andre Lemburg (mal@lemburg.com).Modified by Maxim Dzumanenko <mvd@mylinux.com.ua>.(c) Copyright 2002, Python Software Foundation."""#"import codecs, koi8_r### Codec APIsclass Codec(codecs.Codec):    def encode(self,input,errors='strict'):        return codecs.charmap_encode(input,errors,encoding_map)    def decode(self,input,errors='strict'):        return codecs.charmap_decode(input,errors,decoding_map)class StreamWriter(Codec,codecs.StreamWriter):    passclass StreamReader(Codec,codecs.StreamReader):    pass### encodings module APIdef getregentry():    return (Codec().encode,Codec().decode,StreamReader,StreamWriter)### Decoding Mapdecoding_map = koi8_r.decoding_map.copy()decoding_map.update({        0x00a4: 0x0454, #       CYRILLIC SMALL LETTER UKRAINIAN IE        0x00a6: 0x0456, #       CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I        0x00a7: 0x0457, #       CYRILLIC SMALL LETTER YI (UKRAINIAN)        0x00ad: 0x0491, #       CYRILLIC SMALL LETTER UKRAINIAN GHE WITH UPTURN        0x00b4: 0x0403, #       CYRILLIC CAPITAL LETTER UKRAINIAN IE        0x00b6: 0x0406, #       CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I        0x00b7: 0x0407, #       CYRILLIC CAPITAL LETTER YI (UKRAINIAN)        0x00bd: 0x0490, #       CYRILLIC CAPITAL LETTER UKRAINIAN GHE WITH UPTURN})### Encoding Mapencoding_map = codecs.make_encoding_map(decoding_map)

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -