http:^^www.cs.wisc.edu^~dyer^cs766^proj.html
来自「This data set contains WWW-pages collect」· HTML 代码 · 共 310 行 · 第 1/2 页
HTML
310 行
<P>Real-world application of machine vision techniques has been limited byseveral factors. Most heuristic algorithms rely on several fudge factorswhich are difficult to tune manually. Many physics-based approached makesimplifying assumptions that significantly reduce performance inreal-world settings. Additionally, these algorithms often require accuratetechnical information about the problem, such as camera parameters orsurface properties. We conjecture that these difficulties can be bypassedby learning the target function directly from examples. Previous researchin this area has primarily focused on window-based approaches, which areinherently scale-dependent. These techniques, while effective for low-level vision, suffer from "the curse of dimensionality" when applied to intermediate-level vision task. We propose a family of scale-independent neural network techniques closely related to pyramids, the discrete Fourier transform and the wavelet transform. We (hope to) show that this methodology can be applied to learn shape-from-shading from a small number of examples. <HR SIZE=5><LI><B>Rebecca Hasti</B><BR><I>Hand Gesture Recognition Using Orientation Histograms</I><P>A quick and efficient method for computer recognition of hand gestures would be useful in a number of situations. For example, a system whichrecognized hand gestures in real-time could be used instead of a mouseto operate a computer. For this project, I plan to implement a gesturerecognition method for static hand gestures using orientation histogramsdescribed by W. Freeman and M. Roth in "Orientation Histograms for HandGesture Recognition," Proc. Int. Workshop on Automatic Face and GestureRecognition, 1995. This method of pattern recognition using orientationhistograms is relatively simple and fast and somewhat insensitive to scene illumination.<HR SIZE=5><LI><B>Kirk Hogenson</B> and <B>Todd Turnidge</B><BR><I>Applications of the Steerable Pyramid</I><P>In "The Design and Use of Steerable Filters," Adelson et al. discuss thecreation of an orientable filter, i.e., a filter that selects a specificdirection. This orientable (or "steerable") filter is capable ofdetecting the response of the image to filtering at any desired orientation, based on the result of a few 'basis' filters.<P>The steerable filter can be extended to select a specific scale as well asorientation, yielding a "steerable pyramid filter." The pyramid isroughly analogous to the Laplacian Pyramid in that each levelcorresponds to information at a different scale in the image. As withorientation, image response at any desired scale can be determined fromthe 'basis' scales (i.e., levels in the pyramid).<P>With such a pyramid, one can accomplish numerous tasks often performed inmachine vision applications, such as edge and contour detection, adaptivenoise reduction, and stereo matching.<P>For our project, we intend to implement such a steerable filter, as well assome its applications. <HR SIZE=5><LI><B>Mike James</B><BR><I>Internal Signature Keying: Improving Robustness of a Snake's Local Edge Finder</I><P>This project proposes to improve the distraction avoidance capabilities ofa snake tracking system's without moving up to the global object level. Theedge localization abilities of the snake are currently achieved at a locallevel, but only look for the strongest edge along a line normal to the snake.When tracking an object boundary the snake will be a closed contour. For a solidobject the pixel values immediately inside the contour are likely to remainconstant. Viewed along a line normal to the boundary this can be viewed as asignature to look for in subsequent search iterations. The local edge finder canthen be set up to look for this signature, as well as the intensity step signifyingthe boundary. It is hoped that this will help the edge finder avoid picking up onpotentially stronger background edges that may fall inside the search window.<HR SIZE=5><LI><B>Ted Perkins</B><BR><I>Solving Single-Image Random-Dot Stereograms (SIRDS) for Depth</I><P>Normal random-dot stereograms work by presenting each eye withtwo separate images; correspondences between the two images allowthe reconstruction of a depth map based on horizontal disparitybetween corresponding patterns of dots. SIRDS combine the imagesfor both eyes into a single image, relying on semi-periodic random dotfields. A program will be written to 'look' at SIRDS and reconstructa depth map. The first stage of the problem will scan for potentialmatches for each pixel, and an iterative relaxation scheme will beused to arrive at a (hopefully) globally coherent solution.<HR SIZE=5><LI><B>Dan Replogle</B><BR><I>Mosaic Construction</I><P>The project will construct a mosaic from a set of two images. The projectwill be modified to handle larger sets of images if time allows. The projectwill use a 2D transformation model, including translation and rotation. Hierarchical matching will be used. Matches will be made first at smaller,subsampled images, then refined. Matches that minimize the sum of squareddifferences will be considered the best. <HR SIZE=5><LI><B>Greg Sharp</B><BR><I>Texture Replication</I><P>In computer generated images, objects and textures which are difficult tomodel are often copied directly from scanned photographs. To a large degree,the success of this technique depends upon the image quality of the objectin the original photograph. For example, objects which are obscured orhave uneven illumination are not good candidates, because information aboutthe shape and/or texture of the object is missing or distorted.<P>In this paper, we present a algorithm to approximate missing or distortedimage information for a textured object. A texture description is recoveredfrom a object by identifying texel properties such as texel size, shape,density and orientation. The region of missing or distorted image informationis then textured by applying a texture replication technique to the region.<HR SIZE=5><LI><B>Mark Smucker</B><BR><I>Implementation and Comparison of Three Global Multi-level Thresholding Techniques</I><P>My intended project is to implement three different algorithms formultilevel thresholding of gray scale images and compare them onvarious images. The three methods I intend to implement areinteresting because of their relative newness and considerabledifferences in approaches. I intend to try and compare them in thestyle done by Lee and Chung for other global thresholding techniquesbesides summarizing and implementing them.<P>The first algorithm I intend to implement comes from the paper, "Afast histogram-clustering approach for multi-level thresholding" byTsai and Chen, which is "computationally fast and efficient" andshould be a good baseline system to test the other two algorithmsagainst since it does not attempt to consider the globalcharacteristics of the gray level distribution. The second algorithmtakes a connectionist approach while the third uses a simulatedannealing approach.<P>Besides the comparison report, one goal of this project is to providecode modules that will allow future CS766 students to experiment withmultilevel thresholding of images.<HR SIZE=5><LI><B>Jon Weyers</B><BR><I>Multiple Baseline Stereo with Non-Calibrated Views</I><P>I will attempt to apply the Multiple Baseline Stereo method(Okutomi and Kanade) to a set of three images, taken from highlyseparated, uncalibrated viewpoints. This differs from the methoddescribed in the paper in two ways. First, I will be testing theperformance of the method using only two baselines, the minimumnumber necessary for meaningful results. Second, I will calculatethe relative lengths of the baselines from "conjugate triples"specified interactively by the user, rather than assuming theabsolute lengths of the baselines are known. This makes the methodapplicable to snapshots taken from an unmounted camera. I willapply the method to images made using the Apple QuickTake camera,displaying results as a gray-level map of relative distances.(The exact distance would require exact knowledge of the lengthof the baselines.)<HR SIZE=5></UL></BODY></HTML>
⌨️ 快捷键说明
复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?