⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 domtextextractor.java

📁 随书的代码
💻 JAVA
字号:
import javax.xml.parsers.*;import org.w3c.dom.*;import org.xml.sax.SAXException;import java.io.IOException;public class DOMTextExtractor {  public void processNode(Node node) {        if (node instanceof Text) {      Text text = (Text) node;      String data = text.getData();      System.out.println(data);    }      }  // note use of recursion  public void followNode(Node node) {        processNode(node);    if (node.hasChildNodes()) {      NodeList children = node.getChildNodes();      for (int i = 0; i < children.getLength(); i++) {        followNode(children.item(i));      }     }      }  public static void main(String[] args) {    if (args.length <= 0) {      System.out.println("Usage: java DOMTextExtractor URL");      return;    }        String url = args[0];        try {      DocumentBuilderFactory factory        = DocumentBuilderFactory.newInstance();      DocumentBuilder parser = factory.newDocumentBuilder();      // If expandEntityReferences isn't turned off, there      //  won't be any entity reference nodes in the DOM tree      factory.setExpandEntityReferences(false);            // Read the document      Document document = parser.parse(url);             // Process the document      DOMTextExtractor extractor = new DOMTextExtractor();      extractor.followNode(document);    }    catch (SAXException e) {      System.out.println(url + " is not well-formed.");    }    catch (IOException e) {       System.out.println(       "Due to an IOException, the parser could not check " + url      );     }    catch (FactoryConfigurationError e) {       System.out.println("Could not locate a factory class");     }    catch (ParserConfigurationException e) {       System.out.println("Could not locate a JAXP parser");     }       } // end main}

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -