textextractionutility

来自「随书的代码」· 代码 · 共 30 行

TXT
30
字号
import org.w3c.dom.*;import org.w3c.dom.traversal.*;public class TextExtractor {  public static String getText(Node node) {        if (node == null) return "";        // Set up the iterator    Document doc = node.getOwnerDocument();    DocumentTraversal traversable = (DocumentTraversal) doc;    int whatToShow      = NodeFilter.SHOW_TEXT | NodeFilter.SHOW_CDATA_SECTION;    NodeIterator iterator = traversable.createNodeIterator(node,      whatToShow, null, true);         // Extract the text    StringBuffer result = new StringBuffer();    Node current;    while ((current = iterator.nextNode()) != null) {      result.append(current.getNodeValue());    }    return result.toString();      }}

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?