⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 tokenizer.h

📁 使用具有增量学习的监控式学习方法。包括几个不同的分类算法。
💻 H
字号:
#ifndef TOKENIZER_H#define TOKENIZER_H#include <stdio.h>#include "languages.h"#include "stemmer.h"#include "stopword.h"typedef struct tokenizer_ tokenizer;tokenizer *tokenizer_new (const char *str);tokenizer *tokenizer_alpha_new (void);tokenizer *tokenizer_ws_new (void);tokenizer *tokenizer_ngram_new (void);tokenizer *tokenizer_null_new (void);inttokenizer_set_minmax (tokenizer *tok, int min, int max);voidtokenizer_set_stopwords (tokenizer *tok, word_stopper *ws);voidtokenizer_set_stemmer (tokenizer *tok, stemmer_functions *sf);voidtokenizer_set_languages (tokenizer *tok, languages *langs);inttokenizer_set_language (tokenizer *tok, const char *lang);voidtokenizer_set_text (tokenizer *tok, const char *text, int size,		    const char *code);const char *tokenizer_next_token (tokenizer *tok);inttokenizer_save (tokenizer *tok, FILE *f);#endif

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -