首页 › 资源下载 › Java编程 › 随书的代码 › 源码查看

domtextextractor.java

来自「随书的代码」· Java 代码 · 共 75 行

JAVA

75 行

import javax.xml.parsers.*;import org.w3c.dom.*;import org.xml.sax.SAXException;import java.io.IOException;public class DOMTextExtractor {  public void processNode(Node node) {        if (node instanceof Text) {      Text text = (Text) node;      String data = text.getData();      System.out.println(data);    }      }  // note use of recursion  public void followNode(Node node) {        processNode(node);    if (node.hasChildNodes()) {      NodeList children = node.getChildNodes();      for (int i = 0; i < children.getLength(); i++) {        followNode(children.item(i));      }     }      }  public static void main(String[] args) {    if (args.length <= 0) {      System.out.println("Usage: java DOMTextExtractor URL");      return;    }        String url = args[0];        try {      DocumentBuilderFactory factory        = DocumentBuilderFactory.newInstance();      DocumentBuilder parser = factory.newDocumentBuilder();      // If expandEntityReferences isn't turned off, there      //  won't be any entity reference nodes in the DOM tree      factory.setExpandEntityReferences(false);            // Read the document      Document document = parser.parse(url);             // Process the document      DOMTextExtractor extractor = new DOMTextExtractor();      extractor.followNode(document);    }    catch (SAXException e) {      System.out.println(url + " is not well-formed.");    }    catch (IOException e) {       System.out.println(       "Due to an IOException, the parser could not check " + url      );     }    catch (FactoryConfigurationError e) {       System.out.println("Could not locate a factory class");     }    catch (ParserConfigurationException e) {       System.out.println("Could not locate a JAXP parser");     }       } // end main}

domtextextractor.java - 源码说明

本页面展示了「随书的代码」中的 domtextextractor.java 源码文件，采用 Java 编程语言编写，共 75 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫开发者社区收录了大量与Java相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?