⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 word.h

📁 基于python的中文分词程序
💻 H
字号:
#ifndef _WORD_H_#define _WORD_H_#include <climits>#include <cstring>#include "memory.h"namespace rmmseg{    const int word_embed_len = 4; /* at least 1 char (3 bytes+'\0') */    struct Word    {        unsigned char   nbytes;   /* number of bytes */        char            length;   /* number of characters */        unsigned short  freq;        char            text[word_embed_len];    };    /**     * text: the text of the word.     * length: number of characters (not bytes).     * freq: the frequency of the word.     */    inline Word *make_word(const char *text, int length=1,                           int freq=0, int nbytes=-1)    {        if (freq > USHRT_MAX)            freq = USHRT_MAX;   /* avoid overflow */        if (nbytes == -1)            nbytes = strlen(text);        Word *w = static_cast<Word *>(pool_alloc(sizeof(Word)                                                 + nbytes+1                                                 - word_embed_len));        w->nbytes = std::strlen(text);        w->length = length;        w->freq = freq;        std::strncpy(w->text, text, nbytes);        w->text[nbytes] = '\0';        return w;    }}#endif /* _WORD_H_ */

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -