⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 modifications

📁 数据挖掘c4.5算法(vc语言版本)欢迎大家下载测试!!!!
💻
字号:
Modifications since book was published:

--------------------------------------------------------------------------------
(1)	17 August 1992:  fixed bug in prunerule.c

	In routine Satisfies about line 434:
	moved statement
		    t->Outcome = -1;
	to before the for loop
--------------------------------------------------------------------------------
(2)	2nd Feb 1993: fixed errors reported by Dick Jackson

	c4.5rules.c line 34: changed ';' to ','
 	getnames.c: moved CopyString() declaration to head
--------------------------------------------------------------------------------
(3)	19th June 1993: fixed error reported by Guillermo Irisarri

	ANSI C doesn't like "exit()" with no args in average.c, xval-prep.c
--------------------------------------------------------------------------------
(4)	5th July 1993: fixed bug in c4.5rules reported by Ray Mooney

	SaveRules() was invoked before EvaluateRulesets(), but the latter
	can delete globally unhelpful rules.  SaveRules() was moved to
	after evaluation of rules on training data
	    (Note: this change affects only the use of consultr with the
	     saved rules; experimental results are unaltered.)
--------------------------------------------------------------------------------
(5)	13th July 1993: changed rules.c to improve printing with -s option

	When tests on discrete attributes use value groups, the standard
	form of test is
	    "<attribute> in {<value>, <value>, ...}".
	If there is only one value, this should appear as
	    "<attribute> = <value>".
	This has already been changed in trees; a similar change has now
	been made to function PrintCondition() in rules.c
--------------------------------------------------------------------------------
(6)	28th July 1993: killed very large confusion matrices

	confmat.c line 19: added copout if number of classes > 20
--------------------------------------------------------------------------------
(7)	9th September 1993: fixed problems notified by Mike Jankulak.

	* Added checks for reasonable parameter values in c4.5, c4.5rules.
	  Check in GetNames() for discrete N: N must be at least 2.

	* consult, consultr don't work with attributes of type discrete N !
	  Added routines in trees.c to save and restore values of attributes
	  of this type when saving / reading trees.
	  Modified rules.c to invoke these routines when saving / reading
	  rulesets.

	  NOTE: old .tree, .unpruned and .rules files must be regenerated
	        if they are to be used by the modified programs.
--------------------------------------------------------------------------------
(8)	3rd November 1993; problem notified by Jason Catlett

	c4.5rules prints an incorrect confusion matrix for the training
	set when rules are dropped.  Altered testrules.c.
--------------------------------------------------------------------------------
(9)	21st December 1993; tidying up only

	Changed definition of Log() in defns.i so that argument of log()
	is guaranteed float.
--------------------------------------------------------------------------------
(10)    5th February 1994; problem notified by George John

	Calculation of Gain in build.c can be negative rather than zero
	due to FP rounding.  Changed tests "Gain[Att] >= 0" to
	"Gain[Att] > -Epsilon".
--------------------------------------------------------------------------------
(11)	25th May 1994; problem notified by Ronny Kohavi

	Similar problem in info.c with -g option.  Changed test
        "ThisGain > 0" to "ThisGain > -Epsilon".
--------------------------------------------------------------------------------
(12)	30th May 1994; tidying up

	Removed explicit Outcomes field from rules.  This simplifies
	the code somewhat with little decrease in efficiency.
--------------------------------------------------------------------------------
(13)	18th July 1994; problem notified by Ronny Kohavi

	Average gain evaluated incorrectly when all attributes have
	many discrete values.  In build.c, introduced MultiVal to check
	for this contingency.
--------------------------------------------------------------------------------
(14)	18th-20th July 1994; modifications to siftrules.c

	(a) Changed coding of exceptions:
	    * added cost of encoding total number of errors to cost of
	      identifying false positives and false negatives.
	    * applied penalty to non-representative theories as described
	      in my ML'94 paper.
	(b) Introduced a new form of local greedy search for finding
	    good subsets when there are more than 10 rules.  This is
	    faster than simulated annealing and replaces it as the default:
	    simulated annealing is still available via a new option -a.
--------------------------------------------------------------------------------

***********************  Release 6  July 1994  *********************************

--------------------------------------------------------------------------------
(15)	11th August 1994; bug reported by KaiMing Ting and Zijian Zheng

	In subset.c, DiscrKnownBaseInfo() can be called when KnownItems = 0.
	Trapped such calls.
--------------------------------------------------------------------------------
(16)	21st January 1995; bug reported by Tom Fawcett

	Very large trees can cause the short int in TreeSize to overflow.
	Changed to int.
--------------------------------------------------------------------------------
(17)	6th April 1995; bug reported by Ronny Kohavi

	Exit status not being set properly.  Modified the following:
	    c4.5.c, c4.5rules.c, consult.c, consultr.c.
--------------------------------------------------------------------------------
(18)	19th April 1995; bug reported by Kim Horn

	For very small values of CF less than 0.1%, confidence levels
	are computed erratically.  Modified stats.c.
--------------------------------------------------------------------------------
(19)    June 1995: modifications to siftrules.c (again!)

        Scheme described above in 14(a) amended in line with my ML'95
	paper, available by anonymous ftp from ftp.cs.su.oz.au, directory
	pub/ml, file q.ml95.ps.Z.
--------------------------------------------------------------------------------

***********************  Release 7  June 1995  *********************************

--------------------------------------------------------------------------------
(20)	6th July 1995; bug reported by Andrew Taylor

	Tree printing can have problems when attribute names are very long.
	Modified trees.c.
--------------------------------------------------------------------------------
(21)	18th October 1995: modifications to contin.c

	Altered the calculation of gain for continuous attributes (described
	in "Improved Use of Continuous Attributes in C4.5").
--------------------------------------------------------------------------------

***********************  Release 8  October 1995  ******************************

--------------------------------------------------------------------------------
(22)	26th Feb 1996; minor glitches reported by Ron Kohavi of SGI

	Fn declared extern in rules.c
	-lm removed from consult, consultr, xval-prep
--------------------------------------------------------------------------------

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -