gstemmer.h

来自「一个由Mike Gashler完成的机器学习方面的includes neural」· C头文件 代码 · 共 49 行

H
49
字号
#ifndef __GSTEMMER_H__#define __GSTEMMER_H__#define GSTEMMER_MAX_WORD_SIZE 64struct stemmer;// This class just wraps the Porter Stemmer.// It finds the stems of words.  Examples:// "cats"->"cat"// "dogs"->"dog"// "fries"->"fri"// "fishes"->"fish"// "pies"->"pi"// "lovingly"->"lovingli"// "candy"->"candi"// "babies"->"babi"// "bus"->"bu"// "busses"->"buss"// "women"->"women"// "hasty"->"hasti"// "hastily"->"hastili"// "fly"->"fly"// "kisses"->"kiss"// "goes"->"goe"// "brought"->"brought"// As you can see the stems aren't always real words, but// that's okay as long as it produces the same stem for words// that have the same etymological roots. Even then it still// isn't perfect (notice it got "bus" wrong), but it should// still improve analysis somewhat in many cases.class GStemmer{protected:	struct stemmer* m_pPorterStemmer;	char m_szBuf[GSTEMMER_MAX_WORD_SIZE];public:	GStemmer();	~GStemmer();	// Pass in a word (you don't need to lowercase or null-terminate it) and this	// will return its stem. The buffer it returns is only valid until the next time	// you call GetStem.	const char* GetStem(const char* szWord, int nLen);};#endif // __GSTEMMER_H__

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?