prob.c

来自「一个C语言写的快速贝叶斯垃圾邮件过滤工具」· C语言代码 · 共 54 行

54 行

/* $Id: prob.c,v 1.11 2005/01/17 03:00:48 relson Exp $ *//*****************************************************************************NAME:   prob.c -- calculate token's spamicityAUTHORS:   David Relson <relson@osagesoftware.com>   Matthias Andree <matthias.andree@gmx.de>******************************************************************************/#include "globals.h"#include "prob.h"double calc_prob(uint good, uint bad, uint goodmsgs, uint badmsgs){    int n = good + bad;    double fw, pw;    /* http://www.linuxjournal.com/article.php?sid=6467 */    /* robs is Robinson's s parameter, the "strength of background info" */    /* robx is Robinson's x parameter, the assumed probability that     * a word we don't have enough info about will be spam */    /* n is the number of messages that contain the word w */    if (n == 0#ifdef EXTRA_DOMAIN_CHECKING	    /* we had this in place while the ignore lists caused the	     * token to have "nan" counts because score.c left the	     * message counts at zero - #ifdef'd out for speed */	    || badmsgs == 0 || goodmsgs == 0#endif	    ) {	/* in these cases, pw would be undefined and return NaN	 * we substitute "we don't know", the x parameter */	fw = robx;    } else {	/* The original version of this code has four divisions.	pw = ((bad / badmsgs) / (bad / badmsgs + good / goodmsgs));	*/	/* This modified version, with 1 division, is considerably% faster. */	pw =   bad * (double)goodmsgs	    / (bad * (double)goodmsgs + good * (double)badmsgs);	fw = (robs * robx + n * pw) / (robs + n);    }    return fw;}

prob.c - 源码说明

本页面展示了「一个C语言写的快速贝叶斯垃圾邮件过滤工具」中的 prob.c 源码文件，采用 C语言编程语言编写，共 54 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫开发者社区收录了大量与C语言相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?