📄 32.txt
字号:
发信人: mining (key), 信区: DataMining
标 题: UCI data set description
发信站: 南京大学小百合站 (Tue Apr 29 13:50:39 2003)
Iris Plant Database
From Fisher, 1936
Documentation: complete
3 classes, 4 numeric attributes, 150 instances
1 class is linearly separable from the other 2, but the other 2 are not line
arly separable from each other (simple database)
Ftp Access
Isolet Spoken Letter Recognition Database
From Ron Cole and Mark Fanty
6238 + 1559 instances, 26 classes (one for each letter)
All attributes are real-valued scaled from -1.0 to 1.0.
No missing values
Ftp Access
Kinship Database
From Hinton 1986 & Quinlan 1989
Relational
24 individuals, 12 relations
104 instances derivable
Case studies have been reported by both authors
Ftp Access
Labor relations Database
From Collective Bargaining Review
Documentation: no statistics
Please see the labor directory for more information
Ftp Access
LED Display Domains
From Classification and Regression Trees book
Documentation: sufficient, but missing statistical information
All attributes are Boolean-valued
Two versions: 7 and 24 attributes
Optimal Baye's rate known for the 10% probability of noise problem
Several ML researchers have used this domain for testing noise tolerancy
We provide here 2 C programs for generating sample databases
Ftp Access
Lenses Database
Donated by Benoit Julien
Small database with few attributes
attributes are either binary- or ternary-valued
3 classes: hard contact lenses, soft contact lenses, or neither
Ftp Access
Letter Recognition Database
From David Slate
Based on various fonts
20,000 instances (712565 bytes) (.Z available)
17 attributes: 1 class (letter category) and 16 numeric (integer)
No missing attribute values
Ftp Access
Liver-disorders Database
BUPA Medical Research Ltd. database donated by Richard S. Forsyth
7 numeric-valued attributes
345 instances (male patients)
Includes cost data (donated by Peter Turney)
Ftp Access
Logic-theorist
Donated by Paul O'Rorke's (described in Machine Learning)
All code for LT
Ftp Access
Lung Cancer Database
Donated by Stefan Aeberhard
32 instances, 57 Attributes (2 classes)
No Attribute Definitions
Ftp Access
Lymphography Database (restricted access)
From Ljubljana Oncology Institute
Documentation: incomplete
CITATION REQUIREMENT: Please use (see the documentation file)
148 instances; 19 attributes; 4 classes; no missing data values
Mechanical Analysis Data
Donated by members of the Universita di Torino
Fault diagnosis problem of electromechanical devices
ENIGMA system application described in proceedings of MLC-1990
Each of the 209 instances is described by a different set of components
PUMPS DATA SET
Newer version of above dataset with domain theory and results
Ftp Access
Meta-data Database
Donated by J.Gama
Meta-Data was used in order to give advice about which classification method
is appropriate for a particular dataset (taken from the results of the Stat
log project).
528 instances; 22 attributes; numeric prediction; missing values
Ftp Access
Mobile Robots Database
Donated by Volker Klingspor, Katharina J. Morik and Anke D. Rieger
Learning Concepts from Sensor Data of a Mobile Robot
Multiple levels of learning (from raw sensor data to high level concepts)
Ftp Access
Molecular Biology Databases
Promoter Gene Sequences Database
Donated by Jude Shavlik; See AAAI-90 Towell, Shavlik, & Noordewier
E. Coli promoter gene sequences (DNA) with partial domain theory
106 instances, each predictor attribute takes on one of four values
50% positive instances
Splice-junction Gene Sequences Database
Donated by Geoffrey Towell, Noordewier, & Shavlik
categories "ei" and "ie" include every "split-gene" for primates in Genbank
64.1
non-splice examples taken from sequences known not to include a splicing sit
e
3190 instances with classes "ei" (25%), "ie" (25%) and Neither (50%)
Domain theory included
Protein Secondary Structure Database
Originally created and used by Qian and Sejnowski
From CMU connectionist bench repository
Classifies secondary structure of certain globular proteins
3 classes: alpha-helix, beta-sheet and random-coil
Protein Secondary Structure Domain Theory
Donated and created by Jude Shavlik & Rich Maclin
Imperfect domain theory for Qian and Sejnowski Protein Secondary Structure d
atabase (above)
Closely implements the algorithm of Chou and Fasman
Ftp Access
--
※ 来源:.南京大学小百合站 bbs.nju.edu.cn.[FROM: 202.118.237.14]
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -