⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 textutilstest.java

📁 Heritrix是一个开源,可扩展的web爬虫项目。Heritrix设计成严格按照robots.txt文件的排除指示和META robots标签。
💻 JAVA
字号:
/* TextUtilsTest.java * * $Id: TextUtilsTest.java 5107 2007-05-01 00:21:52Z gojomo $ * * Created Tue Jan 20 14:17:59 PST 2004 * * Copyright (C) 2004 Internet Archive. * * This file is part of the Heritrix web crawler (crawler.archive.org). * * Heritrix is free software; you can redistribute it and/or modify * it under the terms of the GNU Lesser Public License as published by * the Free Software Foundation; either version 2.1 of the License, or * any later version. * * Heritrix is distributed in the hope that it will be useful, * but WITHOUT ANY WARRANTY; without even the implied warranty of * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the * GNU Lesser Public License for more details. * * You should have received a copy of the GNU Lesser Public License * along with Heritrix; if not, write to the Free Software * Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA  02111-1307  USA */package org.archive.util;import java.util.regex.Matcher;import junit.framework.Test;import junit.framework.TestCase;import junit.framework.TestSuite;/** * JUnit test suite for TextUtils *  * @author gojomo * @version $ Id$ */public class TextUtilsTest extends TestCase {    /**     * Create a new TextUtilsTest object     *      * @param testName     *            the name of the test     */    public TextUtilsTest(final String testName) {        super(testName);    }    /**     * run all the tests for TextUtilsTest     *      * @param argv     *            the command line arguments     */    public static void main(String argv[]) {        junit.textui.TestRunner.run(suite());    }    /**     * return the suite of tests for MemQueueTest     *      * @return the suite of test     */    public static Test suite() {        return new TestSuite(TextUtilsTest.class);    }    public void testMatcherRecycling() {        String pattern = "f.*";        Matcher m1 = TextUtils.getMatcher(pattern,"foo");        assertTrue("matcher against 'foo' problem", m1.matches());        TextUtils.recycleMatcher(m1);        Matcher m2 = TextUtils.getMatcher(pattern,"");        assertFalse("matcher against '' problem", m2.matches());        assertTrue("matcher not recycled",m1==m2);        // now verify proper behavior without recycling        Matcher m3 = TextUtils.getMatcher(pattern,"fuggedaboutit");        assertTrue("matcher against 'fuggedaboutit' problem",m3.matches());        assertFalse("matcher was recycled",m3==m2);    }        public void testGetFirstWord() {        final String firstWord = "one";        String tmpStr = TextUtils.getFirstWord(firstWord + " two three");        assertTrue("Failed to get first word 1 " + tmpStr,            tmpStr.equals(firstWord));        tmpStr = TextUtils.getFirstWord(firstWord);        assertTrue("Failed to get first word 2 " + tmpStr,            tmpStr.equals(firstWord));           }        public void testUnescapeHtml() {        final String abc = "abc";        CharSequence cs = TextUtils.unescapeHtml("abc");        assertEquals(cs, abc);        final String backwards = "aaa;lt&aaa";        cs = TextUtils.unescapeHtml(backwards);        assertEquals(cs, backwards);        final String ampersand = "aaa&aaa";        cs = TextUtils.unescapeHtml(ampersand);        assertEquals(cs, ampersand);        final String encodedAmpersand = "aaa&aaa";        cs = TextUtils.unescapeHtml(encodedAmpersand);        assertEquals(cs, ampersand);        final String encodedQuote = "aaa'aaa";        cs = TextUtils.unescapeHtml(encodedQuote);        assertEquals(cs, "aaa'aaa");        final String entityQuote = "aaa"aaa";        cs = TextUtils.unescapeHtml(entityQuote);        assertEquals(cs, "aaa\"aaa");    }        public void testUnescapeHtmlWithDanglingAmpersand() {        final String mixedEncodedAmpersand1 = "aaa&aaa&aaa";        CharSequence cs = TextUtils.unescapeHtml(mixedEncodedAmpersand1);        assertEquals(cs,"aaa&aaa&aaa");        final String mixedEncodedAmpersand2 = "aaa&aaa&aaa&aaa";        cs = TextUtils.unescapeHtml(mixedEncodedAmpersand2);        assertEquals(cs,"aaa&aaa&aaa&aaa");    } }

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -