📄 changes.txt
字号:
bin/parser.cmd, bin/sitecapturer, bin/sitecapturer.cmd, bin/stringextractor.bat, bin/stringextractor.cmd, bin/thumbelina.bat, bin/thumbelina.cmd, bin/translate.bat, bin/translate.cmd, src/org/htmlparser/Attribute.java, src/org/htmlparser/Node.java, src/org/htmlparser/NodeFactory.java, src/org/htmlparser/PrototypicalNodeFactory.java, src/org/htmlparser/Remark.java, src/org/htmlparser/StringNodeFactory.java, src/org/htmlparser/Tag.java, src/org/htmlparser/Text.java, src/org/htmlparser/beans/BeanyBaby.java, src/org/htmlparser/beans/FilterBean.java, src/org/htmlparser/beans/HTMLLinkBean.java, src/org/htmlparser/beans/HTMLTextBean.java, src/org/htmlparser/beans/LinkBean.java, src/org/htmlparser/beans/StringBean.java, src/org/htmlparser/beans/package.html, src/org/htmlparser/filters/AndFilter.java, src/org/htmlparser/filters/CssSelectorNodeFilter.java, src/org/htmlparser/filters/HasAttributeFilter.java, src/org/htmlparser/filters/HasChildFilter.java, src/org/htmlparser/filters/HasParentFilter.java, src/org/htmlparser/filters/HasSiblingFilter.java, src/org/htmlparser/filters/LinkRegexFilter.java, src/org/htmlparser/filters/LinkStringFilter.java, src/org/htmlparser/filters/NodeClassFilter.java, src/org/htmlparser/filters/NotFilter.java, src/org/htmlparser/filters/OrFilter.java, src/org/htmlparser/filters/RegexFilter.java, src/org/htmlparser/filters/TagNameFilter.java, src/org/htmlparser/http/ConnectionManager.java, src/org/htmlparser/http/ConnectionMonitor.java, src/org/htmlparser/http/Cookie.java, src/org/htmlparser/http/package.html, src/org/htmlparser/nodeDecorators/AbstractNodeDecorator.java, src/org/htmlparser/nodeDecorators/DecodingNode.java, src/org/htmlparser/nodeDecorators/EscapeCharacterRemovingNode.java, src/org/htmlparser/nodeDecorators/NonBreakingSpaceConvertingNode.java, src/org/htmlparser/nodeDecorators/package.html, src/org/htmlparser/nodes/AbstractNode.java, src/org/htmlparser/nodes/RemarkNode.java, src/org/htmlparser/nodes/TagNode.java, src/org/htmlparser/nodes/TextNode.java, src/org/htmlparser/nodes/package.html, src/org/htmlparser/parserapplications/filterbuilder/FilterBuilder.java, src/org/htmlparser/scanners/CompositeTagScanner.java, src/org/htmlparser/tags/BaseHrefTag.java, src/org/htmlparser/tags/BodyTag.java, src/org/htmlparser/tags/CompositeTag.java, src/org/htmlparser/tags/DoctypeTag.java, src/org/htmlparser/tags/FormTag.java, src/org/htmlparser/tags/FrameSetTag.java, src/org/htmlparser/tags/FrameTag.java, src/org/htmlparser/tags/HeadTag.java, src/org/htmlparser/tags/ImageTag.java, src/org/htmlparser/tags/JspTag.java, src/org/htmlparser/tags/LabelTag.java, src/org/htmlparser/tags/LinkTag.java, src/org/htmlparser/tags/MetaTag.java, src/org/htmlparser/tags/OptionTag.java, src/org/htmlparser/tags/ScriptTag.java, src/org/htmlparser/tags/SelectTag.java, src/org/htmlparser/tags/TableRow.java, src/org/htmlparser/tags/TableTag.java, src/org/htmlparser/tags/TextareaTag.java, src/org/htmlparser/tags/TitleTag.java, src/org/htmlparser/tags/package.html, src/org/htmlparser/tests/lexerTests/KitTest.java, src/org/htmlparser/tests/lexerTests/LexerTests.java: Documentation revamp part one. Deprecated node decorators. Added doSemanticAction for Text and Comment nodes. Added missing sitecapturer scripts. Fixed DOS batch files to work when called from any location. 2005-04-06 06:27 derrickoswald * build.xml, docs/release.txt, docs/samples.html: End user experience issues: remove multiple wiki files in zip fix sample application links change readme.txt to use Windows line endings change copyright date 2005-04-06 06:20 derrickoswald * docs/contributors.html, src/org/htmlparser/filters/LinkRegexFilter.java, src/org/htmlparser/filters/LinkStringFilter.java: Add link pattern filters submitted by John Derrick. 2005-04-04 20:48 derrickoswald * src/org/htmlparser/: NodeFilter.java, Parser.java, package.html, parserapplications/SiteCapturer.java: Update javadocs. Enable SiteCapturer to handle resource names containing spaces.Integration Build 1.5 - 20050313--------------------------------2005-03-13 09:51 derrickoswald * src/org/htmlparser/: lexer/Lexer.java, lexer/Page.java, lexer/Source.java, lexerapplications/tabby/Tabby.java, scanners/ScriptDecoder.java, tests/lexerTests/TagTests.java, util/IteratorImpl.java: Bug #1121401 No Parsing with yahoo! By default nio.charset.CharsetDecoder replaces characters it cannot represent in the current encoding with zero, which was the value returned by the page when the Stream reached EOF. This changes the Page return value to (char)Source.EOF (-1) when the end of stream is encountered. 2005-03-12 16:39 derrickoswald * src/org/htmlparser/beans/: BeanyBaby.java, LinkBean.java: Fix bean example, stop sharing connections. 2005-03-12 15:27 derrickoswald * build.xml, lib/commons-logging.jar: Bug #1018884 'compile' ant task from build.xml messes up ./src directory Added optional "classes" property to build.xml. This directory is where class files are put. It defaults to src. To use: build -Dclasses=classdir <target> where classdir is a peer directory to src. Removed unused commons-logging.jar while I was in there. 2005-03-12 12:53 derrickoswald * src/org/htmlparser/: lexer/Lexer.java, scanners/ScriptScanner.java, tests/scannersTests/ScriptScannerTest.java: Add STRICT flag to ScriptScanner to revert to legacy handling of broken ETAGO (</). If STRICT is true, scan according to HTML specification, else if false, scan with quote smart state machine which heuristically yields the correct parse. 2005-03-12 08:39 derrickoswald * src/org/htmlparser/: tests/visitorsTests/UrlModifyingVisitorTest.java, util/NodeList.java: RFE #1160345 NodeList.visitAllNodesWith Added visitAllNodesWith to the NodeList class. 2005-03-12 07:52 derrickoswald * src/org/htmlparser/: beans/StringBean.java, tests/utilTests/AllTests.java, tests/utilTests/NonEnglishTest.java: Bug #1161137 Non English Character web page Reinitialize the string buffer after encoding change exception processing. 2005-03-12 06:52 derrickoswald * src/org/htmlparser/http/ConnectionManager.java: Bug #1160010 NullPointerException in addCookies Add test for null expiry date.Integration Build 1.5 - 20050306--------------------------------2005-03-06 21:18 derrickoswald * src/org/htmlparser/: lexer/Lexer.java, lexer/Page.java, scanners/ScriptScanner.java, scanners/StyleScanner.java, tests/scannersTests/ScriptScannerTest.java: Bug #1104627 Parser Crash reading javascript Bug #1024045 StringBean crashes on an URL Bug #1021925 StyleTag with missing linefeed prevents page from parsing Corrected operation with script and style scanners to recognize the ETAGO when parsing CDATA -- see http://www.w3.org/TR/html4/appendix/notes.html#notes-specifying-data. Original solution to bug #741769 ScriptScanner doesn't handle quoted </script> tags, was erroneous; it should have been recognized as faulty HTML. Several test cases changed to follow this advice: "Authors should therefore escape "</" within the content." 2005-03-06 16:46 derrickoswald * src/org/htmlparser/: lexer/InputStreamSource.java, tests/lexerTests/LexerTests.java: Bug #1044707 mark()/reset() issues Added wrapping with a org.htmlparser.lexer.Stream if markSupported returns false on the InputStream passed to InputStreamSource constructor. Added better error message when reset fails in setEncoding(). 2005-03-04 10:57 derrickoswald * src/org/htmlparser/parserapplications/filterbuilder/FilterBuilder.java: Bug #1153508 CVS sources do not compile Repaired sources so it would compile with Java 1.4. 2005-02-14 19:41 derrickoswald * src/org/htmlparser/lexer/InputStreamSource.java: Bug #1056438 Byte Order Mark Not a solution, just a better error message. 2005-02-14 18:54 derrickoswald * docs/contributors.html: Add David Anderson to contributors list. 2005-02-14 18:49 derrickoswald * src/org/htmlparser/parserapplications/SiteCapturer.java: Implement suggested change for bug #1061869 Crashing when trying to capture link to XLS document checking for null from getContentType(). Integration Build 1.5 - 20050213--------------------------------2005-02-13 15:49 derrickoswald * src/org/htmlparser/parserapplications/filterbuilder/: images/.xvpics/copy.gif, images/.xvpics/cut.gif, images/.xvpics/delete.gif, wrappers/images/.xvpics/AndFilter.gif, wrappers/images/.xvpics/HasAttributeFilter.gif, wrappers/images/.xvpics/HasChildFilter.gif, wrappers/images/.xvpics/HasParentFilter.gif, wrappers/images/.xvpics/HasSiblingFilter.gif, wrappers/images/.xvpics/NodeClassFilter.gif, wrappers/images/.xvpics/NotFilter.gif, wrappers/images/.xvpics/OrFilter.gif, wrappers/images/.xvpics/RegexFilter.gif, wrappers/images/.xvpics/StringFilter.gif, wrappers/images/.xvpics/StringFilter2.gif, wrappers/images/.xvpics/TagNameFilter.gif: FilterBuilder remove mistakenly dropped files. 2005-02-13 15:43 derrickoswald * src/org/htmlparser/parserapplications/filterbuilder/: Filter.java, FilterBuilder.java, HtmlTreeCellRenderer.java, HtmlTreeModel.java, SubFilterList.java, images/about.gif, images/copy.gif, images/cut.gif, images/delete.gif, images/new.gif, images/open.gif, images/paste.gif, images/save.gif, wrappers/AndFilterWrapper.java, wrappers/HasAttributeFilterWrapper.java, wrappers/HasChildFilterWrapper.java, wrappers/HasParentFilterWrapper.java, wrappers/HasSiblingFilterWrapper.java, wrappers/NodeClassFilterWrapper.java, wrappers/NotFilterWrapper.java, wrappers/OrFilterWrapper.java, wrappers/RegexFilterWrapper.java, wrappers/StringFilterWrapper.java, wrappers/TagNameFilterWrapper.java, wrappers/images/AndFilter.gif, wrappers/images/HasAttributeFilter.gif, wrappers/images/HasChildFilter.gif, wrappers/images/HasParentFilter.gif, wrappers/images/HasSiblingFilter.gif, wrappers/images/NodeClassFilter.gif, wrappers/images/OrFilter.gif, wrappers/images/RegexFilter.gif, wrappers/images/TagNameFilter.gif, wrappers/images/.xvpics/AndFilter.gif, wrappers/images/.xvpics/HasAttributeFilter.gif, wrappers/images/.xvpics/HasChildFilter.gif, wrappers/images/.xvpics/HasParentFilter.gif, wrappers/images/.xvpics/HasSiblingFilter.gif, wrappers/images/.xvpics/NodeClassFilter.gif, wrappers/images/.xvpics/NotFilter.gif, wrappers/images/.xvpics/OrFilter.gif, wrappers/images/.xvpics/RegexFilter.gif, wrappers/images/.xvpics/StringFilter.gif, wrappers/images/.xvpics/StringFilter2.gif, wrappers/images/.xvpics/TagNameFilter.gif, wrappers/images/NotFilter.gif, wrappers/images/StringFilter.gif, images/.xvpics/delete.gif, images/.xvpics/copy.gif, images/.xvpics/cut.gif, layouts/NullLayoutManager.java, layouts/VerticalLayoutManager.java: FilterBuilder 2005-02-13 15:36 derrickoswald * src/org/htmlparser/filters/AndFilter.java, src/org/htmlparser/filters/HasAttributeFilter.java, src/org/htmlparser/filters/HasChildFilter.java, src/org/htmlparser/filters/HasParentFilter.java, src/org/htmlparser/filters/HasSiblingFilter.java, src/org/htmlparser/filters/NodeClassFilter.java, src/org/htmlparser/filters/NotFilter.java, src/org/htmlparser/filters/OrFilter.java, src/org/htmlparser/filters/RegexFilter.java, src/org/htmlparser/filters/StringFilter.java, src/org/htmlparser/filters/TagNameFilter.java, src/org/htmlparser/lexerapplications/thumbelina/Thumbelina.java, src/org/htmlparser/tags/TableRow.java, src/org/htmlparser/tags/TableTag.java, src/org/htmlparser/NodeFilter.java, src/org/htmlparser/Parser.java, bin/filterbuilder, bin/filterbuilder.bat, bin/thumbelina, bin/thumbelina.bat, src/org/htmlparser/tests/ParserTest.java, src/org/htmlparser/tests/visitorsTests/HtmlPageTest.java, build.xml, src/org/htmlparser/beans/BeanyBaby.java, src/org/htmlparser/beans/FilterBean.java, src/org/htmlparser/util/NodeList.java: FilterBuilder Implemented: RFE #1000063 FilterBean Task #93153 filter builder tool 2005-01-09 19:43 derrickoswald * docs/samples.html: Fix link to StringExtractor. 2004-09-24 19:16 derrickoswald * docs/contributors.html: Update Alberto's contributor info. 2004-09-06 13:19 derrickoswald * build.xml: Provide for building with JDK 1.5 by adding source="1.3" to javac tasks. 2004-09-06 13:12 derrickoswald * src/org/htmlparser/: tags/MetaTag.java, lexer/Page.java, tests/ParserTest.java:
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -