📄 mmread.m

📁 在matlab环境下讲视频文件读入
💻 M
字号:
function [video, audio] = mmread(filename, frames, time, disableVideo, disableAudio)
% function [video, audio] = mmread(filename, frames, time, disableVideo, disableAudio)
% mmread reads virtually any media file.  If Windows Media Play can play
% it, so should mmread.  It uses the Window's DirectX infrastructure to
% render the media, so other OSs are out of luck.
%
% INPUT
% filename      input file to read (mpg, avi, wmv, asf, wav, mp3, gif, jpg, ...)
% frames        specifies which video frames to capture, default [] for all or 
%               to specify time
% time          [startTime stopTime], default [] for all
% disableVideo  disables ALL video capturing, to save memory or time
% disableAudio  disables ALL audio capturing, to save memory or time
%
% OUTPUT
% video is a struct with the following fields:
%   width           width of the video frames
%   height          height of the video frames
%   rate            the frame rate of the video, if it can't be determined
%                   it will be 1.
%   nrFramesTotal   the total number of frames in the movie regardless of
%                   how many were captured.  Unfortunately, this can not
%                   always be determined.  If it is negative then it
%                   is an estimate based upon the duration and rate
%                   (normally accurate to within .1%).   It can be 0, 
%                   in which case it could not be determined at all.  If it
%                   is a possitive number then it should always be accurate.
%   frames          a struct array with the following fields:
%       cdata       [height X width X 3] uint8 matricies
%       colormap    always empty
%
% audio is a struct with the following fields:
%   nrChannels      the number of channels in the audio stream (1 or 2)
%   rate            sampling rate of the audio, ex. 44100.  If it can't be
%                   determined then it will be 1.
%   bits            bit depth of the samples (8 or 16)
%   data            the real data of the whole audio stream.  This can be
%                   played using wavplay.  If time ranges are specified,
%                   the length of the data may not correspond to the total
%                   time.  This normally happens with movies.  The issue is
%                   that the start of the audio stream is generally counted
%                   at the END of the first frame.  So, time is shifted by
%                   1/framerate.
%   nrFramesTotal   Audio comes in packets or frames when captured, the
%                   division of the audio into frames may or may not make
%                   sense.
%   frames          cell array of uint8s.  Probably not of great use.
%
% If there is no video or audio stream the corresponding structure will be
% empty.
%
% Specifying frames does not effect audio capturing.  If you want only a
% subsection of the audio use the 3rd parameter "time".  Specifying time
% effects both audio and video.  Time is specified in seconds (subsecond
% resolution is supported with fractional numbers ex. 1.125), starting at 0.
% Time is defined as startTime (inclusive) to stopTime (exclusive), or
% using set notation [startTime stopTime).
%
% If there are multiple video or audio streams, then the structure will be
% of length > 1.  For example: audio(1).data and audio(2).data.
%
% Images work, however the frames must be specified.  For some reason
% DirectShow doesn't ever stop when "playing" an image.  So to deal with
% this, I added support so that the processing stops once the last
% specified frame is captured instead of waiting until the media completes.
%
% EXAMPLES
% [video, audio] = mmread('chimes.wav'); % read whole wav file
% wavplay(audio.data,audio.rate);
%
% video = mmread('mymovie.mpg'); % read whole movie
% movie(video.frames);
%
% video = mmread('mymovie.mpg',1:10); %get only the first 10 frames
%
% video = mmread('mymovie.mpg',[],[0 3.5]); %read the first 3.5 seconds of the video
%
% [video, audio] = mmread('chimes.wav',[],[0 0.25]); %read the first 0.25 seconds of the wav
% [video, audio] = mmread('chimes.wav',[],[0.25 0.5]); %read 0.25 to 0.5 seconds of the wav, there is no overlap with the previous example.
%
% video = mmread('mymovie.mpg',[],[],false,true); %read all frames, disable audio
%
% Written by Micah Richert

if nargin < 5
    disableAudio = false;
    if nargin < 4
        disableVideo = false;
        if nargin < 3
            time = [];
            if nargin < 2
                frames = [];
            end
        end
    end
end

try
    mexDDGrab('buildGraph',filename);
    if (isempty(time))
        mexDDGrab('setFrames',frames);
    else
        if (numel(time) ~= 2) error('time must be a vector of length 2: [startTime stopTime]'); end
        mexDDGrab('setTime',time(1),time(2));
    end
    if (disableVideo) mexDDGrab('disableVideo'); end;
    if (disableAudio | nargout < 2) mexDDGrab('disableAudio'); end;
    mexDDGrab('doCapture');
    
    [nrVideoStreams, nrAudioStreams] = mexDDGrab('getCaptureInfo');
    
    video = struct('width',{},'height',{},'nrFramesTotal',{},'frames',{});
    audio = struct('nrChannels',{},'rate',{},'bits',{},'nrFramesTotal',{},'data',{},'frames',{});
    
    warned = false;
    
    % loop through getting all of the video data from each stream
    for i=1:nrVideoStreams
        [width, height, rate, nrFramesCaptured, nrFramesTotal] = mexDDGrab('getVideoInfo',i-1);
        video(i).width = width;
        video(i).height = height;
        video(i).rate = rate;
        video(i).nrFramesTotal = nrFramesTotal;
        video(i).frames = struct('cdata',cell(1,nrFramesCaptured),'colormap',cell(1,nrFramesCaptured));
        
        if (any(frames > nrFramesTotal))
            warning(['Frame(s) ' num2str(frames(frames>nrFramesTotal)) ' exceed the number of frames in the movie.']);
        end
        
        scanline = ceil(width*3/4)*4; % the scanline size must be a multiple of 4.
        
        for f=1:nrFramesCaptured
            data = mexDDGrab('getVideoFrame',i-1,f-1);
            
            if (numel(data) ~= scanline*height)
                if (numel(data) > 3*width*height)
                    if (~warned)
                        warning('dimensions do not match data size. Guessing badly...');
                        warned = true;
                    end
                    scanline = width*3;
                    data = data(1:3*width*height);
                else
                    error('dimensions do not match data size. Too little data.');
                end
            end
            
            % if there is any extra scanline data, remove it
            data = reshape(data,scanline,height);
            data = data(1:3*width,:);
            
            % the data ordering is wrong for matlab images, so permute it
            tmp = permute(reshape(data, 3, width, height),[3 2 1]);
            % the images are also upside down and colors were backwards.
            video(i).frames(f).cdata = tmp(end:-1:1,:,3:-1:1);
        end
        
        % if frames are specified then make sure that the order is the same
        if (~isempty(frames) && nrFramesCaptured > 0)
            [uniqueFrames, dummy, frameOrder] = unique(frames);
            if (length(uniqueFrames) > nrFramesCaptured)
                warning('Not all frames specified were captured.  Returning what was captured, but order may be different than specified.');
                remainingFrames = frames(frames<=uniqueFrames(nrFramesCaptured));
                [dummy, dummy, frameOrder] = unique(remainingFrames);
            end
            
            video(i).frames = video(i).frames(frameOrder);
        end
    end
    
    % loop through getting all of the audio data from each stream
    for i=1:nrAudioStreams
        [nrChannels, rate, bits, nrFramesCaptured, nrFramesTotal] = mexDDGrab('getAudioInfo',i-1);
        audio(i).nrChannels = nrChannels;
        audio(i).rate = rate;
        audio(i).bits = bits;
        audio(i).nrFramesTotal = nrFramesTotal;
        audio(i).frames = cell(1,nrFramesCaptured);
        for f=1:nrFramesCaptured
            data = mexDDGrab('getAudioFrame',i-1,f-1);
            audio(i).frames{f} = data;
        end
        % combine the data across frames
        d = double(cat(1,audio(i).frames{:}));
        % reshape and rescale the data so that it is nrChannels x Samples
        % and -1.0 to 1.0.  This should be the same output as wavread.
        audio(i).data = reshape(d/2^(bits-1),nrChannels,length(d)/nrChannels)';
    end
    
    mexDDGrab('cleanUp');
catch
    err = lasterror;
    mexDDGrab('cleanUp');
    rethrow(err);
end
⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -