📄 http:^^nugget.cs.ucla.edu:8001^publications^index.html
字号:
geographic applications, data formats, and storage systems, as well asthe complexity of data and the computational requirements ofscientific queries. In particular, our goal is to provide thenecessary combination of expressiveness, ease of use, flexibility, andefficiency to effectively support the analysis of complexspatio-temporal geoscientific datasets maintained by heterogeneousdata sources in many different formats. Conquest's data model is richyet conceptually simple; it captures some important structural andsemantic properties common to geoscientific data which influence thechoice of query processing strategies, and is flexible enough to serveas the canonical model for a wide variety of scientific andnon-scientific data. Conquest supports a graphical dataflowprogramming environment in which scientists can interactivelymanipulate and visualize scientific data. The extensible parallelquery execution server supports varieties of inter- and intra-operatorparallelism through the use of various support operators. To providea convenient heterogeneous distributed scientific data processingenvironment to scientists, the system supports a set of interfaces toa variety of scientific data sources including several external dataformats and database servers. We also report some early experienceswith benchmarking the performance of the system.<P><HR><P><A NAME="ride_nds_96_oasis"><H3>E. Mesrobian, R.R. Muntz, E. Shek, S. Nittel, M. LaRouche, and M.Krieger,<!WA31><A HREF="http://nugget.cs.ucla.edu:8001/publications/ride_nds_96_oasis.ps.Z">"OASIS: An Open Architecture Scientific Information System"</A>,<!WA32><A HREF="http://www.cs.pitt.edu/ride96/ride96.html">Sixth International Workshop on Research Issues in Data Engineering: Interoperability of Nontraditional Database Systems</A>,New Orleans, Louisiana, Feb. 1996.</H3><P>Motivated by the premise that heterogeneity of software applications andhardware systems is here to stay, we are developingOASIS, a flexible, extensible, and seamless environment for scientific dataanalysis, knowledge discovery, visualization, and collaboration. In thispaper we discuss our OASIS design goals and present the system architectureand major components of our prototype environment.<P><HR><P><A NAME="sisic_95_conquest"><H3>E.C. Shek, E. Mesrobian, and R.R. Muntz,<!WA33><A HREF="http://nugget.cs.ucla.edu:8001/publications/sisic_95_conquest.ps.Z">"Optimization of Access to Heterogeneous Data Repositoriesin a Geoscientific Query Processing System"</A>,<!WA34><A HREF="http://nssdc.gsfc.nasa.gov/sisic/sisic.html">Science Information Systems Interoperability Conference</A>,College Park, MD, Nov. 1995.</H3><P><HR><P><A NAME="sisic_95_oasis"><H3>E. Mesrobian, R.R. Muntz, M. LaRouche, S. Nittel, E. Shek, and M.Krieger,"OASIS: An Open Architecture Scientific Information System",<!WA35><A HREF="http://nssdc.gsfc.nasa.gov/sisic/sisic.html">Science Information Systems Interoperability Conference</A>,College Park, MD, Nov. 1995.</H3><P><HR><P><A NAME="kdd_95"><H3>P. Stolorz, E. Mesrobian, R.R. Muntz, E.C. Shek, J.R. Santos, J. Yi, K. Ng, S.Y. Chien, H. Nakamura, C.R. Mechoso, and J.D. Farrara,<!WA36><A HREF="http://nugget.cs.ucla.edu:8001/publications/kdd_95.ps.Z">"Fast Spatio-Temporal Data Mining of Large Geophysical Datasets"</A>,<!WA37><A HREF="http://www-aig.jpl.nasa.gov:80/kdd95/">The First International Conference on Knowledge Discovery and DataMining</A>,Montreal, Quebec, Canada, Aug 1995. </H3><P>The important scientific challenge of understanding global climatechange is one that clearly requires the application of knowledgediscovery and data mining techniques on a massive scale. Advances inparallel supercomputing technology, enabling high-resolution modeling,as well as in sensor technology, allowing data capture on anunprecedented scale, conspire to overwhelm present-day analysisapproaches. We present here early experiences with a prototypeexploratory data analysis environment, CONQUEST, designed to providecontent-based access to such massive scientific datasets. CONQUEST(CONtent-based Querying in Space and Time) employs a combination ofworkstations and massively parallel processors (MPPs) to minegeophysical datasets possessing a prominent temporal component. It isdesigned to enable complex multi-modal interactive querying andknowledge discovery, while simultaneously coping with theextraordinary computational demands posed the scope of the datasetsinvolved. After outlining a working prototype, we concentrate here onthe description of several associated feature extraction algorithmsimplemented on MPP platforms, together with some typical results.<P><HR><P><A NAME="gis_95"><H3>R.R. Muntz, E. Mesrobian, C.R. Mechoso, D. McCleese, R. Haskins,R. Zurek, and T. Barnett,"Integrating Distributed Object Management into EOS", Geo Info Systems, 5(5):58-59, May 1995.</H3><P><HR><P><A NAME="ieee_pacrim_95"><H3>E. Mesrobian, R.R. Muntz, E.C. Shek, J.R. Santos, J. Yi, K. Ng, S.Y. Chien, C.R. Mechoso, J.D. Farrara, P. Stolorz, and H. Nakamura,<!WA38><A HREF="http://nugget.cs.ucla.edu:8001/publications/ieee_pacrim_95.ps.Z">"Exploratory Data Mining and Analysis Using Conquest"</A>,IEEE Pacific Rim Conference on Communications, Computers, Visualization,and Signal Processing,Victoria, British Columbia, Canada, May 1995.</H3><P>Exploratory data mining and analysis requires an extensible environmentwhich provides facilities for the user-friendly expression and rapidexecution of "scientific queries". In this paper we present theConquest environment and illustrate its use for exploratory data analysisand data mining of spatio-temporal phenomena from geophysical datasets.<P><HR><P><A NAME="sc_94"><H3>E. Mesrobian, R.R. Muntz, E.C. Shek, C.R. Mechoso, J.D. Farrara, J.A. Sphar, and P. Stolorz,<!WA39><A HREF="http://nugget.cs.ucla.edu:8001/publications/sc_94.ps.Z">"Real Time Data Mining, Management, and Visualization of GCM Output"</A>,Supercomputing 94 Poster,Washington, DC, Nov 1994.</H3><P>The output of simulations (e.g., global circulation models)can run into terabytes.The computational cost as well as the cost of storing and retrievingmodel data can be quite high. Recently there have been some effortsto develop on-line visualization capabilities that can be used, forexample, to monitor whether the model is behaving properly.There are however, many other uses for on-line data analysis includingfeature extraction, computational steering of the model, andcontrolled saving of model output (e.g., more frequent samples ofstate information under certain conditions).Each of these applications is a potential client of model output data.We present a software architecture which stresses modularity andflexibility and supports a variety of clients. Some preliminaryperformance numbers are given from a prototype implementation.<P><HR><P><A NAME="cs940039"><H3>E.C. Shek, and R.R. Muntz,<!WA40><A HREF="http://nugget.cs.ucla.edu:8001/publications/cs940039.ps.Z">"The Conquest Modeling Framework for Geoscientific Data"</A>,UCLA CSD Technical Report #940039, Oct 1994.</H3><P>Geoscience studies produce data from various observations,experiments, and simulations at an enormous rate. At UCLA, we aredeveloping the Conquest parallel scientific query processing systemto effectively support the analysis of complex spatio-temporal geoscientific datasets maintained by heterogeneous data sources in many different formats.In this paper, we describe the Conquest data modeling framework whichcaptures some of the important semantics of geoscientific data and common processing paradigms. The central concept behind theConquest data model is that of the field, which is the association ofgeometric cells in a coordinate space with dependent variable values.Cells with different semantics and structures can model a largevariety of traditional and scientific datasets. The properties ofcells also influence the choice of data storage, indexing, and queryoptimization strategies. In addition to providing a flexible datamodel to serve as the canonical model for a wide variety of scientificand non-scientific data, it is important for a system tobe able to efficiently retrieve and operate on scientific data.As a result, we define an algebra for the Conquest datamodel in which queries and operations against scientific data can beconveniently expressed. In addition, we present an overview of theConquest architecture, and some interesting query processing issues related to parallelism, extensibility and heterogeneity.<P><HR><P><A NAME="igarss_94"><H3>E. Mesrobian, E.C. Shek, R.R. Muntz, and W. Cheng,"Quest: An Environment for Content-Based Access to Geoscienitfic Datasets",<!WA41><A HREF="http://stardust.jpl.nasa.gov/igarss/igarss.htm">1994 International Geoscience and Remote Sensing Symposium</A>,Pasadena, CA, Aug. 1994.</H3><P>Recent advances in fine and coarse-grained super computers have enablescientists to create models which, in the past, were computationallyintractable. These advances have also provided scientists with theopportunity to greatly improve the success of their simulation resultsby allowing for finer model resolutions. Similarly, advances in sensor technology have led to instrument suites capable ofcapturing high spatial resolution, multi-spectral data at very highrates (e.g., Earth Observer Satellites (EOS) are expected to generatea terabyte of data per day). Unfortunately, software environments forstorage, retrieval, analysis, interpretation and visualization ofscientific information have not keep pace with their hardware counterparts. We have developed a prototype system called QUEST to providecontent-based query access to massive datasets. QUEST employsworkstations as well as teraFLOP computers to analyze geoscience datain order to produce spatial-temporal features that are used ashigh-level indexes into terabyte datasets. Examples are presented ofthe use of QUEST for the content-based access of Global CirculationModel (GCM) datasets. <P><HR><P><A NAME="aaai_env_tech_94"><H3>E. Mesrobian, R.R. Muntz, E.C. Shek, C.R. Mechoso, J.D. Farrara, and P. Stolorz,<!WA42><A HREF="http://nugget.cs.ucla.edu:8001/publications/aaai_env_tech_94.ps.Z">"QUEST: Content-based Access to Geophysical Databases"</A>,AAAI Workshop on AI Technologies in Environmental Applications,Seattle, WA, Jul-Aug 1994.</H3><P>A major challenge facing geophysical science today is the unavailabilityof high-level analysis tools with which to study the massive amount of data produced by sensors or long simulations of climate models.As part of a NASA HPCC Grand Challenge effort [Mun92], we have developeda prototype environment called QUEST to provide content-based queryaccess to massive datasets used in geophysical applications. QUESTemploys workstations as well as massively parallel processors to producespatio-temporal features that are used as high-level indexes intoterabyte datasets. This paper discusses our continued development ofthe QUEST environment.<P><HR><P><A NAME="ieee_vis_94"><H3>E. Mesrobian, R.R. Muntz, J.R. Santos, E.C. Shek,C.R. Mechoso, J.D. Farrara, and P. Stolorz,<!WA43><A HREF="http://nugget.cs.ucla.edu:8001/publications/ieee_vis_94.ps.Z">"Extracting Spatio-Temporal Patterns from Geoscience Datasets"</A>,IEEE Workshop on Visualization and Machine Vision,Seattle, WA, Jun 1994.</H3><P>A major challenge facing geophysical science today is the unavailabilityof high-level analysis tools with which to study the massive amount ofdata produced by sensors or longsimulations of climate models.We have developed a prototype system called QUEST to provide content-basedaccess to massive datasets. QUEST employs workstations as well as teraFLOPcomputers to analyze geoscience data to produce spatial-temporalfeatures that can be used as high-level indexes.Our first application area is global change climate modeling.In the initial prototype, the first features extracted are cyclonestrajectories from the output of multi-year climate simulations producedby a General Circulation Model.We present an algorithm for cyclone extraction and illustrate the use of cyclone indexes to access subsetsof GCM data for further analysis and visualization.<P><HR width=100% SIZE=2 NOSHADE><P><!-- Text-based index --><center><!WA44><A HREF="http://nugget.cs.ucla.edu:8001/general/">[General]</A><!WA45><A HREF="http://nugget.cs.ucla.edu:8001/projects/">[Projects]</A><!WA46><A HREF="http://nugget.cs.ucla.edu:8001/people/">[People]</A><!WA47><A HREF="http://nugget.cs.ucla.edu:8001/publications/">[Publications]</A><!WA48><A HREF="http://nugget.cs.ucla.edu:8001/presentations/">[Presentations]</A><!WA49><A HREF="http://nugget.cs.ucla.edu:8001/demos/">[Demos]</A><!WA50><A HREF="http://nugget.cs.ucla.edu:8001/misc/">[Related]</A></center><P><address>This page Copyright © 1996; Webmaster www@nugget.cs.ucla.edu<BR> Created: 5/12/95; Last Updated: 11/13/96</address></BODY></HTML>
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -