multitextextractor.java

来自「随书的代码」· Java 代码 · 共 60 行

JAVA
60
字号
import org.xml.sax.*;import java.util.List;public class MultiTextExtractor implements ContentHandler {  private List documents;    // This field is deliberately not initialized in the  // constructor. It is initialized for each document parsed, not  // for each object constructed.  private StringBuffer currentDocument;    public MultiTextExtractor(List documents) {        if (documents == null) {      throw new NullPointerException(       "Documents list must be non-null");    }    this.documents = documents;     }  // Initialize the per-document data structures  public void startDocument() {        currentDocument = new StringBuffer();      }    // Flush and commit the per-document data structures  public void endDocument() {        String text = currentDocument.toString();    documents.add(text);      }      // Update the per-document data structures  public void characters(char[] text, int start, int length) {    currentDocument.append(text, start, length);         }        // do-nothing methods  public void setDocumentLocator(Locator locator) {}  public void startPrefixMapping(String prefix, String uri) {}  public void endPrefixMapping(String prefix) {}  public void startElement(String namespaceURI, String localName,   String qualifiedName, Attributes atts) {}  public void endElement(String namespaceURI, String localName,   String qualifiedName) {}  public void ignorableWhitespace(char[] text, int start,    int length) {}  public void processingInstruction(String target,    String data) {}  public void skippedEntity(String name) {}}

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?