⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 standardanalyzertest2.java

📁 通过JavaCC构建Lucene标准分析器、过滤器。其中有“停止词”分析器
💻 JAVA
字号:
package ch12;

import java.io.IOException;
import java.io.StringReader;

import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.SimpleAnalyzer;
import org.apache.lucene.analysis.StopAnalyzer;
import org.apache.lucene.analysis.Token;
import org.apache.lucene.analysis.TokenStream;
import org.apache.lucene.analysis.standard.StandardAnalyzer;

public class StandardAnalyzerTest2 {

	/**
	 * @param args
	 */

	public StandardAnalyzerTest2() {
	}

	public static void main(String[] args) {
		// TODO Auto-generated method stub

		Analyzer a = new StandardAnalyzer();

//		StringReader sr = new StringReader("123,456" + "CNN"
//				+ "abc123@cba.com.cn" + "Microsoft" + "I		think ..."
//				+ "192.168.0.1" + "中国驰名商标");

//		 StringReader sr = new StringReader("123,456");

//		 StringReader sr = new StringReader("CNN"+":"+"Cable News Network");

//		 StringReader sr = new StringReader("abc123@cba.com.cn");

//		 StringReader sr = new StringReader("I think ...,");

		 StringReader sr = new StringReader("192.168.0.1");

		// StringReader sr = new StringReader("中国驰名商标");

		TokenStream ts = a.tokenStream(sr);

		try {
			int i = 0;
			Token t = ts.next();
			while (t != null) {
				i++;
				System.out.println("Line" + i + ":" + t.termText());
				t = ts.next();
			}
		} catch (IOException e) {
			// TODO Auto-generated catch block
			e.printStackTrace();
		}
	}
}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -