cdb_classifier.py

来自「用python实现的邮件过滤器」· Python 代码 · 共 28 行

PY
28
字号
"""A classifier that uses a CDB database.A CDB wordinfo database is quite small and fast but is slow to update.It is appropriate if training is done rarely (e.g. monthly or weekly usingarchived ham and spam).  See mailsort.py for an example application thatuses this classifier."""from spambayes import cdbfrom spambayes.tokenizer import tokenizefrom spambayes.classifier import Classifierclass CdbClassifier(Classifier):    def __init__(self, cdbfile=None):        Classifier.__init__(self)        if cdbfile is not None:            self.wordinfo = cdb.Cdb(cdbfile)    def probability(self, record):        return float(record)    def save_wordinfo(self, db_file):        items = []        for word, record in self.wordinfo.iteritems():            prob = Classifier.probability(self, record)            items.append((word, str(prob)))        cdb.cdb_make(db_file, items)

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?