⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 mcupsampling.m

📁 一款数据挖掘的软件
💻 M
字号:
% MCUpSampling: implementation for up sampling
%
% Parameters:
% classifier: base classifier 
% para: parameters 
%   1. PosRatio: ratio of positive examples after sampling, default: 10
% X_train: training examples
% Y_train: training labels
% X_test: testing examples
% Y_test: testing labels 
% num_class: number of classes
% class_set: set of class labels such as [1,-1], the first one is the
% positive label
%
% Output parameters:
% Y_compute: the predicted labels
% Y_prob: the prediction confidence in [0,1]
%
% Require functions: 
% ParseParameter, Classify

function  [Y_compute, Y_prob] = MCUpSampling(classifier, para, X_train, Y_train, X_test, Y_test, num_class, class_set)

if (num_class ~= 2), 
    error('Error: The number of classes is larger than 2!');
end;

p = str2num(char(ParseParameter(para, {'-PosRatio'}, {'0.5'})));
sizefactor = p(1);

% If there are no training data, simply pass to the next level
if (isempty(X_train)),
    [Y_compute, Y_prob] = Classify(classifier, X_train, Y_train, X_test, Y_test, num_class, class_set);
    return;
end;

% Collect the positive and negative data
data_neg = X_train(Y_train ~= class_set(1), :);
data_pos = X_train(Y_train == class_set(1), :);
num_positive = size(data_pos, 1);
num_negative = size(data_neg, 1);

% Up sample the positive data
upsize = fix(sizefactor / (1 - sizefactor) * num_negative);
rand_index = fix(rand(1, upsize) * num_positive) + 1;
data_additional = data_pos(rand_index, :);     
X_train = [data_neg; data_additional];

label_neg = ones(size(data_neg, 1), 1) * class_set(2);
label_additional = ones(size(data_additional, 1), 1) * class_set(1);
Y_train = [label_neg; label_additional];       

% Classification
[Y_compute, Y_prob] = Classify(classifier, X_train, Y_train, X_test, Y_test, num_class, class_set);

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -