📄 feat_selbw.m

📁 用Cross validation的方法建立人工神经网络的模型！
💻 M
字号:
function [best_features, removed_features] = feat_selBW (sample, Eps)

%
% function that selects best set of features removing one at a time from a complete set
% INPUT: sample, usual data matrix, last column is binary target
%        Eps, stopping criterion for feature selection (usually small positive value, say 1e-4)
% OUTPUT: best_features, set of selected features
%         removed_features, set of removed features in each iteration (in the first iteration there are
%                           no removed features (NaN value) e.g. removed_features(2) is a feature removed
%                           in the second iteration (of course, features once removed stay removed)
%         sensitivity, estimated sensitivity of prediction for each selected feature
%         specificity, estimated specificity of prediction for each selected feature
%         sensitivity_std, standard deviation of sensitivity for each selected feature
%         specificity_std, standard deviation of specificity for each selected feature
%

% initialize things
accuracy = zeros(1, size(sample, 2) - 1);

% start from the set of all features and calculate accuracy
best_features = [1 : size(sample, 2) - 1];
removed_features = [NaN];
[accuracy(1)] = calculate_prediction_accuracy(sample, best_features);

iteration = 2;
accuracy_last_round = accuracy(1);
eps = accuracy(1);

% until adding features improves accuracy
while eps > Eps & ~isempty(best_features)
    fprintf(1, '\n\nCurrent accuracy: %.2f', 100 * accuracy_last_round);
    fprintf(1, '\nRemoving features...');
    
    tsn = []; tsp = [];
    for i = 1 : length(best_features)
        fprintf(1, '\n    iteration %d/%d removing feature %d', i, length(best_features), best_features(i));

        % remove feature i
        features = setdiff(best_features, best_features(i));

        % caclulate sensitivity and specificity
        [acc(i)] = calculate_prediction_accuracy(sample, features);

        % calculate the accuracy of the i-th added feature
        fprintf(1, ' (%.2f)', 100* acc(i));
    end

    % find best removed feature and overall accuracy
    [accuracy_this_round pos] = max(acc);

    % is there any improvement from the last step, i.e. does the removed feature help
    eps = accuracy_this_round - accuracy_last_round;

    % we do not allow removing features that decrease accuracy
    if eps > Eps
        % add feature to removed features
        removed_features = [removed_features best_features(pos)];
        % remove best feature from the vector of selected features
        best_features = setdiff(best_features, best_features(pos));

        % caclulate accuracy
        accuracy(iteration) =  acc(pos);
        
        % store epsilon (in case anybody needs it)
        epsilon(iteration) = eps;

        iteration = iteration + 1;
    end
    
    accuracy_last_round = accuracy_this_round;
end

return
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -