⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 modifications

📁 c4.5的源码决策树最全面最经典的版本
💻
字号:
Modifications since book was published:--------------------------------------------------------------------------------(1)	17 August 1992:  fixed bug in prunerule.c	In routine Satisfies about line 434:	moved statement		    t->Outcome = -1;	to before the for loop--------------------------------------------------------------------------------(2)	2nd Feb 1993: fixed errors reported by Dick Jackson	c4.5rules.c line 34: changed ';' to ',' 	getnames.c: moved CopyString() declaration to head--------------------------------------------------------------------------------(3)	19th June 1993: fixed error reported by Guillermo Irisarri	ANSI C doesn't like "exit()" with no args in average.c, xval-prep.c--------------------------------------------------------------------------------(4)	5th July 1993: fixed bug in c4.5rules reported by Ray Mooney	SaveRules() was invoked before EvaluateRulesets(), but the latter	can delete globally unhelpful rules.  SaveRules() was moved to	after evaluation of rules on training data	    (Note: this change affects only the use of consultr with the	     saved rules; experimental results are unaltered.)--------------------------------------------------------------------------------(5)	13th July 1993: changed rules.c to improve printing with -s option	When tests on discrete attributes use value groups, the standard	form of test is	    "<attribute> in {<value>, <value>, ...}".	If there is only one value, this should appear as	    "<attribute> = <value>".	This has already been changed in trees; a similar change has now	been made to function PrintCondition() in rules.c--------------------------------------------------------------------------------(6)	28th July 1993: killed very large confusion matrices	confmat.c line 19: added copout if number of classes > 20--------------------------------------------------------------------------------(7)	9th September 1993: fixed problems notified by Mike Jankulak.	* Added checks for reasonable parameter values in c4.5, c4.5rules.	  Check in GetNames() for discrete N: N must be at least 2.	* consult, consultr don't work with attributes of type discrete N !	  Added routines in trees.c to save and restore values of attributes	  of this type when saving / reading trees.	  Modified rules.c to invoke these routines when saving / reading	  rulesets.	  NOTE: old .tree, .unpruned and .rules files must be regenerated	        if they are to be used by the modified programs.--------------------------------------------------------------------------------(8)	3rd November 1993; problem notified by Jason Catlett	c4.5rules prints an incorrect confusion matrix for the training	set when rules are dropped.  Altered testrules.c.--------------------------------------------------------------------------------(9)	21st December 1993; tidying up only	Changed definition of Log() in defns.i so that argument of log()	is guaranteed float.--------------------------------------------------------------------------------(10)    5th February 1994; problem notified by George John	Calculation of Gain in build.c can be negative rather than zero	due to FP rounding.  Changed tests "Gain[Att] >= 0" to	"Gain[Att] > -Epsilon".--------------------------------------------------------------------------------(11)	25th May 1994; problem notified by Ronny Kohavi	Similar problem in info.c with -g option.  Changed test        "ThisGain > 0" to "ThisGain > -Epsilon".--------------------------------------------------------------------------------(12)	30th May 1994; tidying up	Removed explicit Outcomes field from rules.  This simplifies	the code somewhat with little decrease in efficiency.--------------------------------------------------------------------------------(13)	18th July 1994; problem notified by Ronny Kohavi	Average gain evaluated incorrectly when all attributes have	many discrete values.  In build.c, introduced MultiVal to check	for this contingency.--------------------------------------------------------------------------------(14)	18th-20th July 1994; modifications to siftrules.c	(a) Changed coding of exceptions:	    * added cost of encoding total number of errors to cost of	      identifying false positives and false negatives.	    * applied penalty to non-representative theories as described	      in my ML'94 paper.	(b) Introduced a new form of local greedy search for finding	    good subsets when there are more than 10 rules.  This is	    faster than simulated annealing and replaces it as the default:	    simulated annealing is still available via a new option -a.--------------------------------------------------------------------------------***********************  Release 6  July 1994  *********************************--------------------------------------------------------------------------------(15)	11th August 1994; bug reported by KaiMing Ting and Zijian Zheng	In subset.c, DiscrKnownBaseInfo() can be called when KnownItems = 0.	Trapped such calls.--------------------------------------------------------------------------------(16)	21st January 1995; bug reported by Tom Fawcett	Very large trees can cause the short int in TreeSize to overflow.	Changed to int.--------------------------------------------------------------------------------(17)	6th April 1995; bug reported by Ronny Kohavi	Exit status not being set properly.  Modified the following:	    c4.5.c, c4.5rules.c, consult.c, consultr.c.--------------------------------------------------------------------------------(18)	19th April 1995; bug reported by Kim Horn	For very small values of CF less than 0.1%, confidence levels	are computed erratically.  Modified stats.c.--------------------------------------------------------------------------------(19)    June 1995: modifications to siftrules.c (again!)        Scheme described above in 14(a) amended in line with my ML'95	paper, available by anonymous ftp from ftp.cs.su.oz.au, directory	pub/ml, file q.ml95.ps.Z.--------------------------------------------------------------------------------***********************  Release 7  June 1995  *********************************--------------------------------------------------------------------------------(20)	6th July 1995; bug reported by Andrew Taylor	Tree printing can have problems when attribute names are very long.	Modified trees.c.--------------------------------------------------------------------------------(21)	18th October 1995: modifications to contin.c	Altered the calculation of gain for continuous attributes (described	in "Improved Use of Continuous Attributes in C4.5").--------------------------------------------------------------------------------***********************  Release 8  October 1995  ******************************--------------------------------------------------------------------------------(22)	26th Feb 1996; minor glitches reported by Ron Kohavi of SGI	Fn declared extern in rules.c	-lm removed from consult, consultr, xval-prep--------------------------------------------------------------------------------

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -