⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 stripsessioncfidstest.java

📁 这是个爬虫和lucece相结合最好了
💻 JAVA
字号:
package org.archive.crawler.url.canonicalize;import org.apache.commons.httpclient.URIException;import org.archive.net.UURIFactory;import junit.framework.TestCase;public class StripSessionCFIDsTest extends TestCase {    private static final String [] INPUTS = {        "http://a.b.c/boo?CFID=1169580&CFTOKEN=48630702" +            "&dtstamp=22%2F08%2F2006%7C06%3A58%3A11",        "http://a.b.c/boo?CFID=12412453&CFTOKEN=15501799" +        "   &dt=19_08_2006_22_39_28",        "http://a.b.c/boo?CFID=14475712" +        "   &CFTOKEN=2D89F5AF-3048-2957-DA4EE4B6B13661AB" +            "&r=468710288378&m=forgotten",        "http://a.b.c/boo?CFID=16603925" +        "   &CFTOKEN=2AE13EEE-3048-85B0-56CEDAAB0ACA44B8",        "http://a.b.c/boo?CFID=4308017&CFTOKEN=63914124" +            "&requestID=200608200458360%2E39414378"    };        private static final String [] OUTPUTS = {        "http://a.b.c/boo?dtstamp=22%2F08%2F2006%7C06%3A58%3A11",        "http://a.b.c/boo?dt=19_08_2006_22_39_28",        "http://a.b.c/boo?r=468710288378&m=forgotten",        "http://a.b.c/boo?",        "http://a.b.c/boo?requestID=200608200458360%2E39414378"    };    public void testCanonicalize() throws URIException {        for (int i = 0; i < INPUTS.length; i++) {            String result = (new StripSessionCFIDs(INPUTS[i])).                canonicalize(INPUTS[i], UURIFactory.getInstance(INPUTS[i]));            assertEquals(result, OUTPUTS[i]);        }    }}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -