⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 stopwords.java

📁 一个很不错的词频统计程序,目前只支持英文,中文的本人正在修改中.改好后上传给大家分享
💻 JAVA
📖 第 1 页 / 共 2 页
字号:
package edu.udo.cs.wvtool.external;

import java.util.Hashtable;

/**
 * Class that can test whether a given string is a stop word. Lowercases all
 * words before the test.
 * 
 * @author Eibe Frank (eibe@cs.waikato.ac.nz)
 * @version 1.0
 */
public class Stopwords {

    /** The hashtable containing the list of stopwords */
    private static Hashtable m_Stopwords = null;

    static {

        if (m_Stopwords == null) {
            m_Stopwords = new Hashtable();
            Double dummy = new Double(0);

            m_Stopwords.put("a", dummy);
            m_Stopwords.put("abaft", dummy);
            m_Stopwords.put("aboard", dummy);
            m_Stopwords.put("about", dummy);
            m_Stopwords.put("above", dummy);
            m_Stopwords.put("across", dummy);
            m_Stopwords.put("afore", dummy);
            m_Stopwords.put("aforesaid", dummy);
            m_Stopwords.put("after", dummy);
            m_Stopwords.put("again", dummy);
            m_Stopwords.put("against", dummy);
            m_Stopwords.put("agin", dummy);
            m_Stopwords.put("ago", dummy);
            m_Stopwords.put("aint", dummy);
            m_Stopwords.put("albeit", dummy);
            m_Stopwords.put("all", dummy);
            m_Stopwords.put("almost", dummy);
            m_Stopwords.put("alone", dummy);
            m_Stopwords.put("along", dummy);
            m_Stopwords.put("alongside", dummy);
            m_Stopwords.put("already", dummy);
            m_Stopwords.put("also", dummy);
            m_Stopwords.put("although", dummy);
            m_Stopwords.put("always", dummy);
            m_Stopwords.put("am", dummy);
            m_Stopwords.put("american", dummy);
            m_Stopwords.put("amid", dummy);
            m_Stopwords.put("amidst", dummy);
            m_Stopwords.put("among", dummy);
            m_Stopwords.put("amongst", dummy);
            m_Stopwords.put("an", dummy);
            m_Stopwords.put("and", dummy);
            m_Stopwords.put("anent", dummy);
            m_Stopwords.put("another", dummy);
            m_Stopwords.put("any", dummy);
            m_Stopwords.put("anybody", dummy);
            m_Stopwords.put("anyone", dummy);
            m_Stopwords.put("anything", dummy);
            m_Stopwords.put("are", dummy);
            m_Stopwords.put("aren't", dummy);
            m_Stopwords.put("around", dummy);
            m_Stopwords.put("as", dummy);
            m_Stopwords.put("aslant", dummy);
            m_Stopwords.put("astride", dummy);
            m_Stopwords.put("at", dummy);
            m_Stopwords.put("athwart", dummy);
            m_Stopwords.put("away", dummy);
            m_Stopwords.put("b", dummy);
            m_Stopwords.put("back", dummy);
            m_Stopwords.put("bar", dummy);
            m_Stopwords.put("barring", dummy);
            m_Stopwords.put("be", dummy);
            m_Stopwords.put("because", dummy);
            m_Stopwords.put("been", dummy);
            m_Stopwords.put("before", dummy);
            m_Stopwords.put("behind", dummy);
            m_Stopwords.put("being", dummy);
            m_Stopwords.put("below", dummy);
            m_Stopwords.put("beneath", dummy);
            m_Stopwords.put("beside", dummy);
            m_Stopwords.put("besides", dummy);
            m_Stopwords.put("best", dummy);
            m_Stopwords.put("better", dummy);
            m_Stopwords.put("between", dummy);
            m_Stopwords.put("betwixt", dummy);
            m_Stopwords.put("beyond", dummy);
            m_Stopwords.put("both", dummy);
            m_Stopwords.put("but", dummy);
            m_Stopwords.put("by", dummy);
            m_Stopwords.put("c", dummy);
            m_Stopwords.put("can", dummy);
            m_Stopwords.put("cannot", dummy);
            m_Stopwords.put("can't", dummy);
            m_Stopwords.put("certain", dummy);
            m_Stopwords.put("circa", dummy);
            m_Stopwords.put("close", dummy);
            m_Stopwords.put("concerning", dummy);
            m_Stopwords.put("considering", dummy);
            m_Stopwords.put("cos", dummy);
            m_Stopwords.put("could", dummy);
            m_Stopwords.put("couldn't", dummy);
            m_Stopwords.put("couldst", dummy);
            m_Stopwords.put("d", dummy);
            m_Stopwords.put("dare", dummy);
            m_Stopwords.put("dared", dummy);
            m_Stopwords.put("daren't", dummy);
            m_Stopwords.put("dares", dummy);
            m_Stopwords.put("daring", dummy);
            m_Stopwords.put("despite", dummy);
            m_Stopwords.put("did", dummy);
            m_Stopwords.put("didn't", dummy);
            m_Stopwords.put("different", dummy);
            m_Stopwords.put("directly", dummy);
            m_Stopwords.put("do", dummy);
            m_Stopwords.put("does", dummy);
            m_Stopwords.put("doesn't", dummy);
            m_Stopwords.put("doing", dummy);
            m_Stopwords.put("done", dummy);
            m_Stopwords.put("don't", dummy);
            m_Stopwords.put("dost", dummy);
            m_Stopwords.put("doth", dummy);
            m_Stopwords.put("down", dummy);
            m_Stopwords.put("during", dummy);
            m_Stopwords.put("durst", dummy);
            m_Stopwords.put("e", dummy);
            m_Stopwords.put("each", dummy);
            m_Stopwords.put("early", dummy);
            m_Stopwords.put("either", dummy);
            m_Stopwords.put("em", dummy);
            m_Stopwords.put("english", dummy);
            m_Stopwords.put("enough", dummy);
            m_Stopwords.put("ere", dummy);
            m_Stopwords.put("even", dummy);
            m_Stopwords.put("ever", dummy);
            m_Stopwords.put("every", dummy);
            m_Stopwords.put("everybody", dummy);
            m_Stopwords.put("everyone", dummy);
            m_Stopwords.put("everything", dummy);
            m_Stopwords.put("except", dummy);
            m_Stopwords.put("excepting", dummy);
            m_Stopwords.put("f", dummy);
            m_Stopwords.put("failing", dummy);
            m_Stopwords.put("far", dummy);
            m_Stopwords.put("few", dummy);
            m_Stopwords.put("first", dummy);
            m_Stopwords.put("five", dummy);
            m_Stopwords.put("following", dummy);
            m_Stopwords.put("for", dummy);
            m_Stopwords.put("four", dummy);
            m_Stopwords.put("from", dummy);
            m_Stopwords.put("g", dummy);
            m_Stopwords.put("gonna", dummy);
            m_Stopwords.put("gotta", dummy);
            m_Stopwords.put("h", dummy);
            m_Stopwords.put("had", dummy);
            m_Stopwords.put("hadn't", dummy);
            m_Stopwords.put("hard", dummy);
            m_Stopwords.put("has", dummy);
            m_Stopwords.put("hasn't", dummy);
            m_Stopwords.put("hast", dummy);
            m_Stopwords.put("hath", dummy);
            m_Stopwords.put("have", dummy);
            m_Stopwords.put("haven't", dummy);
            m_Stopwords.put("having", dummy);
            m_Stopwords.put("he", dummy);
            m_Stopwords.put("he'd", dummy);
            m_Stopwords.put("he'll", dummy);
            m_Stopwords.put("her", dummy);
            m_Stopwords.put("here", dummy);
            m_Stopwords.put("here's", dummy);
            m_Stopwords.put("hers", dummy);
            m_Stopwords.put("herself", dummy);
            m_Stopwords.put("he's", dummy);
            m_Stopwords.put("high", dummy);
            m_Stopwords.put("him", dummy);
            m_Stopwords.put("himself", dummy);
            m_Stopwords.put("his", dummy);
            m_Stopwords.put("home", dummy);
            m_Stopwords.put("how", dummy);
            m_Stopwords.put("howbeit", dummy);
            m_Stopwords.put("however", dummy);
            m_Stopwords.put("how's", dummy);
            m_Stopwords.put("i", dummy);
            m_Stopwords.put("id", dummy);
            m_Stopwords.put("if", dummy);
            m_Stopwords.put("ill", dummy);
            m_Stopwords.put("i'm", dummy);
            m_Stopwords.put("immediately", dummy);
            m_Stopwords.put("important", dummy);
            m_Stopwords.put("in", dummy);
            m_Stopwords.put("inside", dummy);
            m_Stopwords.put("instantly", dummy);
            m_Stopwords.put("into", dummy);
            m_Stopwords.put("is", dummy);
            m_Stopwords.put("isn't", dummy);
            m_Stopwords.put("it", dummy);
            m_Stopwords.put("it'll", dummy);
            m_Stopwords.put("it's", dummy);
            m_Stopwords.put("its", dummy);
            m_Stopwords.put("itself", dummy);
            m_Stopwords.put("i've", dummy);
            m_Stopwords.put("j", dummy);
            m_Stopwords.put("just", dummy);
            m_Stopwords.put("k", dummy);
            m_Stopwords.put("l", dummy);
            m_Stopwords.put("large", dummy);
            m_Stopwords.put("last", dummy);
            m_Stopwords.put("later", dummy);
            m_Stopwords.put("least", dummy);
            m_Stopwords.put("left", dummy);
            m_Stopwords.put("less", dummy);
            m_Stopwords.put("lest", dummy);
            m_Stopwords.put("let's", dummy);
            m_Stopwords.put("like", dummy);
            m_Stopwords.put("likewise", dummy);
            m_Stopwords.put("little", dummy);
            m_Stopwords.put("living", dummy);
            m_Stopwords.put("long", dummy);
            m_Stopwords.put("m", dummy);
            m_Stopwords.put("many", dummy);
            m_Stopwords.put("may", dummy);
            m_Stopwords.put("mayn't", dummy);
            m_Stopwords.put("me", dummy);
            m_Stopwords.put("mid", dummy);
            m_Stopwords.put("midst", dummy);
            m_Stopwords.put("might", dummy);
            m_Stopwords.put("mightn't", dummy);
            m_Stopwords.put("mine", dummy);
            m_Stopwords.put("minus", dummy);
            m_Stopwords.put("more", dummy);
            m_Stopwords.put("most", dummy);
            m_Stopwords.put("much", dummy);
            m_Stopwords.put("must", dummy);
            m_Stopwords.put("mustn't", dummy);
            m_Stopwords.put("my", dummy);
            m_Stopwords.put("myself", dummy);
            m_Stopwords.put("n", dummy);
            m_Stopwords.put("near", dummy);
            m_Stopwords.put("'neath", dummy);
            m_Stopwords.put("need", dummy);
            m_Stopwords.put("needed", dummy);

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -