⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 chunktaghandleradaptertest.java

📁 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能
💻 JAVA
字号:
package com.aliasi.test.unit.corpus;import com.aliasi.chunk.ChunkFactory;import com.aliasi.chunk.Chunking;import com.aliasi.chunk.ChunkingImpl;import com.aliasi.corpus.ChunkTagHandlerAdapter;import com.aliasi.test.unit.BaseTestCase;import com.aliasi.tokenizer.IndoEuropeanTokenizerFactory;import com.aliasi.tokenizer.TokenizerFactory;public class ChunkTagHandlerAdapterTest extends BaseTestCase {    static TokenizerFactory TOK_FACTORY = new IndoEuropeanTokenizerFactory();    public void testPredicates() {    assertTrue(ChunkTagHandlerAdapter.isBeginTag("B-FOO"));    assertTrue(ChunkTagHandlerAdapter           .isBeginTag(ChunkTagHandlerAdapter.BEGIN_TAG_PREFIX                   + "FOO"));    assertFalse(ChunkTagHandlerAdapter.isBeginTag("I-FOO"));    assertFalse(ChunkTagHandlerAdapter.isBeginTag("a"));    assertTrue(ChunkTagHandlerAdapter.isInTag("I-BAR"));    assertTrue(ChunkTagHandlerAdapter           .isInTag(ChunkTagHandlerAdapter.IN_TAG_PREFIX+"BAR"));    assertFalse(ChunkTagHandlerAdapter.isInTag("B-BAR"));    assertFalse(ChunkTagHandlerAdapter.isInTag("b"));    assertTrue(ChunkTagHandlerAdapter.isOutTag("O"));    assertTrue(ChunkTagHandlerAdapter           .isOutTag(ChunkTagHandlerAdapter.OUT_TAG));    assertFalse(ChunkTagHandlerAdapter.isOutTag("I-FOO"));    assertFalse(ChunkTagHandlerAdapter.isOutTag("c"));    }    public void testToChunkingBIO() {    String[] tokens = new String[0];    String[] whitespaces = new String[] { " " };    String[] tags = new String[0];    Chunking chunkingExpected = new ChunkingImpl(" ");    assertFullEquals(chunkingExpected,             ChunkTagHandlerAdapter             .toChunkingBIO(tokens,whitespaces,tags));    String[] tokens2 = new String[] { "John", "J", ".", "Smith",                     "lives", "in", "Washington", "." };    String[] whitespaces2 = new String[] { "", " ", "", " ", " ",                          " ", " ", "", "" };    String[] tags2 = { "B-PER", "I-PER", "I-PER", "I-PER",              "O", "O", "B-LOC", "O" };    ChunkingImpl chunkingExpected2         = new ChunkingImpl("John J. Smith lives in Washington.");    chunkingExpected2.add(ChunkFactory.createChunk(0,13,"PER"));    chunkingExpected2.add(ChunkFactory.createChunk(23,33,"LOC"));    assertFullEquals(chunkingExpected2,             ChunkTagHandlerAdapter             .toChunkingBIO(tokens2,whitespaces2,tags2));    }    public void testAdapter() {    CollectingChunkHandler chunkCollector = new CollectingChunkHandler();    ChunkTagHandlerAdapter adapter         = new ChunkTagHandlerAdapter(chunkCollector);        String[] tokens2 = new String[] { "John", "J", ".", "Smith",                     "lives", "in", "Washington", "." };    String[] whitespaces2 = new String[] { "", " ", "", " ", " ",                          " ", " ", "", "" };    String[] tags2 = { "B-PER", "I-PER", "I-PER", "I-PER",              "O", "O", "B-LOC", "O" };    ChunkingImpl chunkingExpected2         = new ChunkingImpl("John J. Smith lives in Washington.");    chunkingExpected2.add(ChunkFactory.createChunk(0,13,"PER"));    chunkingExpected2.add(ChunkFactory.createChunk(23,33,"LOC"));    adapter.handle(tokens2,whitespaces2,tags2);    chunkCollector.assertEquals(new Chunking[] { chunkingExpected2 });    }}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -