⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 ibm数据生成器所生成文件说明.txt

📁 IBM(原)数据生成器和源代码
💻 TXT
字号:
(1) Associations and Sequential Patterns: 
Code: 
assoc.gen.tar.Z (26,286 bytes) 
Downloading and Compiling Tips 


Usage: 
   gen lit|tax|seq [options]
   gen lit|tax|seq -help     For more detailed list of options

lit: large (frequent) itemsets without taxonomies 
tax: large (frequent) itemsets with taxonomies 
seq: sequential patterns 
Output Format: 
There are two posssible output formats for the data file, based on whether or not the "-ascii" option is specified. 
Binary 
Consists of <CustID, TransID, NumItems, List-Of-Items.> Each of these is a 4-byte integer. 

Ascii 
Each line contains a CustID, TransID, and Item. Each of these take up 10 bytes, for a total of 33 bytes per line. 

Apart from the data file, this program also generates a pattern file. The pattern file has three parts: 
A description of the data. 

A list of items with high weights. (Recall that the weight corresponds to the probability that item will appear in an itemset.) Each line has the item number, followed by the weight. 

A list of the itemsets/sequential patterns with high weight. (Recall that the weight corresponds to the probability that the itemset will appear in a transaction.) Each line has the weight, the expected confidence for rules generated from this itemset, and the itemset. 



⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -