Ranking Queries on Uncertain Data

Ranking Queries on Uncertain Data

Theoretical and algorithmic results of ranking queries on uncertain data are presented in the last section of this book.

Author: Ming Hua

Publisher: Springer Science & Business Media

ISBN: 1441993800

Category: Computers

Page: 224

View: 413

Uncertain data is inherent in many important applications, such as environmental surveillance, market analysis, and quantitative economics research. Due to the importance of those applications and rapidly increasing amounts of uncertain data collected and accumulated, analyzing large collections of uncertain data has become an important task. Ranking queries (also known as top-k queries) are often natural and useful in analyzing uncertain data. Ranking Queries on Uncertain Data discusses the motivations/applications, challenging problems, the fundamental principles, and the evaluation algorithms of ranking queries on uncertain data. Theoretical and algorithmic results of ranking queries on uncertain data are presented in the last section of this book. Ranking Queries on Uncertain Data is the first book to systematically discuss the problem of ranking queries on uncertain data.
Categories: Computers

Ranked Retrieval in Uncertain and Probabilistic Databases

Ranked Retrieval in Uncertain and Probabilistic Databases

Ranking queries are widely used in data exploration, data analysis and decision making scenarios.

Author: Mohamed A. Soliman

Publisher:

ISBN: OCLC:827755066

Category:

Page: 172

View: 754

Ranking queries are widely used in data exploration, data analysis and decision making scenarios. While most of the currently proposed ranking techniques focus on deterministic data, several emerging applications involve data that are imprecise or uncertain. Ranking uncertain data raises new challenges in query semantics and processing, making conventional methods inapplicable. Furthermore, the interplay between ranking and uncertainty models introduces new dimensions for ordering query results that do not exist in the traditional settings. This dissertation introduces new formulations and processing techniques for ranking queries on uncertain data. The formulations are based on marriage of traditional ranking semantics with possible worlds semantics under widely-adopted uncertainty models. In particular, we focus on studying the impact of tuple-level and attribute-level uncertainty on the semantics and processing techniques of ranking queries. Under the tuple-level uncertainty model, we introduce a processing framework leveraging the capabilities of relational database systems to recognize and handle data uncertainty in score-based ranking. The framework encapsulates a state space model, and efficient search algorithms that compute query answers by lazily materializing the necessary parts of the space. Under the attribute-level uncertainty model, we give a new probabilistic ranking model, based on partial orders, to encapsulate the space of possible rankings originating from uncertainty in attribute values. We present a set of efficient query evaluation algorithms, including sampling-based techniques based on the theory of Markov chains and Monte-Carlo method, to compute query answers. We build on our techniques for ranking under attribute-level uncertainty to support rank join queries on uncertain data. We show how to extend current rank join methods to handle uncertainty in scoring attributes. We provide a pipelined query operator implementation of uncertainty-aware rank join algorithm integrated with sampling techniques to compute query answers.
Categories:

Probabilistic Ranking Techniques in Relational Databases

Probabilistic Ranking Techniques in Relational Databases

Ranking queries are widely used in data exploration, data analysis and decision making scenarios.

Author: Ihab F. Ilyas

Publisher: Morgan & Claypool Publishers

ISBN: 9781608455676

Category: Computers

Page: 63

View: 829

Ranking queries are widely used in data exploration, data analysis and decision making scenarios. While most of the currently proposed ranking techniques focus on deterministic data, several emerging applications involve data that are imprecise or uncertain. Ranking uncertain data raises new challenges in query semantics and processing, making conventional methods inapplicable. Furthermore, the interplay between ranking and uncertainty models introduces new dimensions for ordering query results that do not exist in the traditional settings. This lecture describes new formulations and processing techniques for ranking queries on uncertain data. The formulations are based on marriage of traditional ranking semantics with possible worlds semantics under widely-adopted uncertainty models. In particular, we focus on discussing the impact of tuple-level and attribute-level uncertainty on the semantics and processing techniques of ranking queries. Under the tuple-level uncertainty model, we describe new processing techniques leveraging the capabilities of relational database systems to recognize and handle data uncertainty in score-based ranking. Under the attribute-level uncertainty model, we describe new probabilistic ranking models and a set of query evaluation algorithms, including sampling-based techniques. We also discuss supporting rank join queries on uncertain data, and we show how to extend current rank join methods to handle uncertainty in scoring attributes. Table of Contents: Introduction / Uncertainty Models / Query Semantics / Methodologies / Uncertain Rank Join / Conclusion
Categories: Computers

Rough Sets Fuzzy Sets Data Mining and Granular Computing

Rough Sets  Fuzzy Sets  Data Mining  and Granular Computing

The models of ranking queries on uncertain data demonstrate the challenges in summarizing different query answers in different possible worlds and some ...

Author: Davide Ciucci

Publisher: Springer

ISBN: 9783642412189

Category: Computers

Page: 398

View: 389

This book constitutes the thoroughly refereed conference proceedings of the 14th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, RSFDGrC 2013, held in Halifax, Canada in October 2013 as one of the co-located conference of the 2013 Joint Rough Set Symposium, JRS 2013. The 69 papers (including 44 regular and 25 short papers) included in the JRS proceedings (LNCS 8170 and LNCS 8171) were carefully reviewed and selected from 106 submissions. The papers in this volume cover topics such as inconsistency, incompleteness, non-determinism; fuzzy and rough hybridization; granular computing and covering-based rough sets; soft clustering; image and medical data analysis.
Categories: Computers

Managing and Mining Uncertain Data

Managing and Mining Uncertain Data

Ranking queries on uncertain data: A probabilistic threshold approach. In SIGMOD, 2008. [14] P. Kalnis, G. Ghinita, K. Mouratidis, and D. Papadias.

Author: Charu C. Aggarwal

Publisher: Springer Science & Business Media

ISBN: 9780387096902

Category: Computers

Page: 494

View: 591

Managing and Mining Uncertain Data, a survey with chapters by a variety of well known researchers in the data mining field, presents the most recent models, algorithms, and applications in the uncertain data mining field in a structured and concise way. This book is organized to make it more accessible to applications-driven practitioners for solving real problems. Also, given the lack of structurally organized information on this topic, Managing and Mining Uncertain Data provides insights which are not easily accessible elsewhere. Managing and Mining Uncertain Data is designed for a professional audience composed of researchers and practitioners in industry. This book is also suitable as a reference book for advanced-level students in computer science and engineering, as well as the ACM, IEEE, SIAM, INFORMS and AAAI Society groups.
Categories: Computers

Web Age Information Management

Web Age Information Management

Ge, T., Zdonik, S., Madden, S.: Top-k queries on uncertain data: On score ... Hua, M., Pei, J., Zhang, W., Lin, X.: Ranking queries on uncertain data: ...

Author: Jianyong Wang

Publisher: Springer

ISBN: 9783642385629

Category: Computers

Page: 812

View: 122

This book constitutes the refereed proceedings of the 14th International Conference on Web-Age Information Management, WAIM 2013, held in Beidaihe, China, in June 2013. The 47 revised full papers presented together with 29 short papers and 5 keynotes were carefully reviewed and selected from a total of 248 submissions. The papers are organized in topical sections on data mining; information integration and heterogeneous systems; big data; spatial and temporal databases; information extraction; new hardware and miscellaneous; query processing and optimization; social network and graphs; information retrieval; workflow systems and service computing; recommender systems; security, privacy, and trust; semantic Web and ontology.
Categories: Computers

Handbook of Big Geospatial Data

Handbook of Big Geospatial Data

... 2008 ACM SIGMOD International Conference on Management of Data, pp 213–226 Lian X, Chen L (2008b) Probabilistic ranked queries in uncertain databases.

Author: Martin Werner

Publisher: Springer Nature

ISBN: 9783030554620

Category: Computers

Page: 641

View: 635

This handbook covers a wide range of topics related to the collection, processing, analysis, and use of geospatial data in their various forms. This handbook provides an overview of how spatial computing technologies for big data can be organized and implemented to solve real-world problems. Diverse subdomains ranging from indoor mapping and navigation over trajectory computing to earth observation from space, are also present in this handbook. It combines fundamental contributions focusing on spatio-textual analysis, uncertain databases, and spatial statistics with application examples such as road network detection or colocation detection using GPUs. In summary, this handbook gives an essential introduction and overview of the rich field of spatial information science and big geospatial data. It introduces three different perspectives, which together define the field of big geospatial data: a societal, governmental, and governance perspective. It discusses questions of how the acquisition, distribution and exploitation of big geospatial data must be organized both on the scale of companies and countries. A second perspective is a theory-oriented set of contributions on arbitrary spatial data with contributions introducing into the exciting field of spatial statistics or into uncertain databases. A third perspective is taking a very practical perspective to big geospatial data, ranging from chapters that describe how big geospatial data infrastructures can be implemented and how specific applications can be implemented on top of big geospatial data. This would include for example, research in historic map data, road network extraction, damage estimation from remote sensing imagery, or the analysis of spatio-textual collections and social media. This multi-disciplinary approach makes the book unique. This handbook can be used as a reference for undergraduate students, graduate students and researchers focused on big geospatial data. Professionals can use this book, as well as practitioners facing big collections of geospatial data.
Categories: Computers

Issues in Artificial Intelligence Robotics and Machine Learning 2013 Edition

Issues in Artificial Intelligence  Robotics and Machine Learning  2013 Edition

Also, ranking queries over uncertain data has become a research focus of uncertain data management.” Our news journalists obtained a quote from the research ...

Author:

Publisher: ScholarlyEditions

ISBN: 9781490105970

Category: Computers

Page: 1209

View: 749

Issues in Artificial Intelligence, Robotics and Machine Learning: 2013 Edition is a ScholarlyEditions™ book that delivers timely, authoritative, and comprehensive information about Expert Systems. The editors have built Issues in Artificial Intelligence, Robotics and Machine Learning: 2013 Edition on the vast information databases of ScholarlyNews.™ You can expect the information about Expert Systems in this book to be deeper than what you can access anywhere else, as well as consistently reliable, authoritative, informed, and relevant. The content of Issues in Artificial Intelligence, Robotics and Machine Learning: 2013 Edition has been produced by the world’s leading scientists, engineers, analysts, research institutions, and companies. All of the content is from peer-reviewed sources, and all of it is written, assembled, and edited by the editors at ScholarlyEditions™ and available exclusively from us. You now have a source you can cite with authority, confidence, and credibility. More information is available at http://www.ScholarlyEditions.com/.
Categories: Computers

Database Systems for Advanced Applications

Database Systems for Advanced Applications

... Semantics of ranking queries for probabilistic data and expected ranks. ... X.: Ranking queries on uncertain data: a probabilistic threshold approach.

Author: Sang-goo Lee

Publisher: Springer

ISBN: 9783642290350

Category: Computers

Page: 320

View: 653

This two volume set LNCS 7238 and LNCS 7239 constitutes the refereed proceedings of the 17th International Conference on Database Systems for Advanced Applications, DASFAA 2012, held in Busan, South Korea, in April 2012. The 44 revised full papers and 8 short papers presented together with 2 invited keynote papers, 8 industrial papers, 8 demo presentations, 4 tutorials and 1 panel paper were carefully reviewed and selected from a total of 159 submissions. The topics covered are query processing and optimization, data semantics, XML and semi-structured data, data mining and knowledge discovery, privacy and anonymity, data management in the Web, graphs and data mining applications, temporal and spatial data, top-k and skyline query processing, information retrieval and recommendation, indexing and search systems, cloud computing and scalability, memory-based query processing, semantic and decision support systems, social data, data mining.
Categories: Computers

Querying And Mining Uncertain Data Streams

Querying And Mining Uncertain Data Streams

However, traditional techniques for deterministic data streams cannot be applied to deal with data uncertainty directly due to the exponential growth of possible solution space.This book provides a comprehensive overview of the authors' ...

Author: Cheqing Jin

Publisher: World Scientific

ISBN: 9789813142923

Category: Computers

Page: 164

View: 907

Data uncertainty widely exists in many applications, and an uncertain data stream is a series of uncertain tuples that arrive rapidly. However, traditional techniques for deterministic data streams cannot be applied to deal with data uncertainty directly due to the exponential growth of possible solution space.This book provides a comprehensive overview of the authors' work on querying and mining uncertain data streams. Its contents include some important discoveries dealing with typical topics such as top-k query, ER-Topk query, rarity estimation, set similarity, and clustering.Querying and Mining Uncertain Data Streams is written for professionals, researchers, and graduate students in data mining and its various related fields.
Categories: Computers

On the Move to Meaningful Internet Systems OTM 2016 Conferences

On the Move to Meaningful Internet Systems  OTM 2016 Conferences

Amarilli, A., Amsterdamer, Y., Milo, T.: Uncertainty in crowd data sourcing ... Hua, M., Pei, J., Zhang, W., Lin, X.: Ranking queries on uncertain data: a ...

Author: Christophe Debruyne

Publisher: Springer

ISBN: 9783319484723

Category: Computers

Page: 977

View: 775

This volume constitutes the refereed proceedings of the Confederated International Conferences: Cooperative Information Systems, CoopIS 2016, Ontologies, Databases, and Applications of Semantics, ODBASE 2016, and Cloud and Trusted Computing, C&TC, held as part of OTM 2016 in October 2016 in Rhodes, Greece. The 45 full papers presented together with 16 short papers were carefully reviewed and selected from 133 submissions. The OTM program every year covers data and Web semantics, distributed objects, Web services, databases, information systems, enterprise workow and collaboration, ubiquity, interoperability, mobility,grid and high-performance computing.
Categories: Computers

Scientific and Statistical Database Management

Scientific and Statistical Database Management

Therefore, rank updates for the query (baby) have to be applied, ... A lot of work was performed in the direction of ranking among uncertain data ...

Author: Judith Bayard Cushing

Publisher: Springer

ISBN: 9783642223518

Category: Computers

Page: 602

View: 847

This book constitutes the refereed proceedings of the 23rd International Conference on Scientific and Statistical Database Management, SSDBM 2011, held in Portland, OR, USA, in July 2011. The 26 long and 12 short papers presented together with 15 posters were carefully reviewed and selected from 80 submissions. The topics covered are ranked search; temporal data and queries; workflow and provenance; querying graphs; clustering and data mining; architectures and privacy; and applications and models.
Categories: Computers

Database Systems for Advanced Applications

Database Systems for Advanced Applications

ACM Transaction on Information System 15(1) Hua, M., Pei, J., Zhang, W., Lin, X.: Ranking queries on uncertain data: a probabilistic threshold approach.

Author: Hiroyuki Kitagawa

Publisher: Springer

ISBN: 9783642120268

Category: Computers

Page: 646

View: 832

This two volume set LNCS 5981 and LNCS 5982 constitutes the refereed proceedings of the 15th International Conference on Database Systems for Advanced Applications, DASFAA 2010, held in Tsukuba, Japan, in April 2010. The 39 revised full papers and 16 revised short papers presented together with 3 invited keynote papers, 22 demonstration papers, 6 industrial papers, and 2 keynote talks were carefully reviewed and selected from 285 submissions. The papers of the first volume are organized in topical sections on P2P-based technologies, data mining technologies, XML search and matching, graphs, spatialdatabases, XML technologies, time series and streams, advanced data mining, query processing, Web, sensor networks and communications, information management, as well as communities and Web graphs. The second volume contains contributions related to trajectories and moving objects, skyline queries, privacy and security, data streams, similarity search and event processing, storage and advanced topics, industrial, demo papers, and tutorials and panels.
Categories: Computers

Database Systems for Advanced Applications

Database Systems for Advanced Applications

Probabilistic Inverse Ranking Queries over Uncertain Data Xiang Lian and Lei Chen Hong Kong University of Science and Technology Clear Water Bay, Kowloon, ...

Author: Xiaofang Zhou

Publisher: Springer

ISBN: 9783642008870

Category: Computers

Page: 797

View: 211

This book constitutes the refereed proceedings of the 14th International Conference on Database Systems for Advanced Applications, DASFAA 2009, held in Brisbane, Australia, in April 2009. The 39 revised full papers and 22 revised short papers presented together with 3 invited keynote papers, 9 demonstration papers, 3 tutorial abstracts, and one panel abstract were carefully reviewed and selected from 186 submissions. The papers are organized in topical sections on uncertain data and ranking, sensor networks, graphs, RFID and data streams, skyline and rising stars, parallel and distributed processing, mining and analysis, XML query, privacy, XML keyword search and ranking, Web and Web services, XML data processing, and multimedia.
Categories: Computers

Handbook of Data Quality

Handbook of Data Quality

Hua M, Pei J, Zhang W, Lin X (2008) Ranking queries on uncertain data: a probabilistic threshold approach. In: Proceedings of the ACM SIGMOD international ...

Author: Shazia Sadiq

Publisher: Springer Science & Business Media

ISBN: 9783642362576

Category: Computers

Page: 438

View: 188

The issue of data quality is as old as data itself. However, the proliferation of diverse, large-scale and often publically available data on the Web has increased the risk of poor data quality and misleading data interpretations. On the other hand, data is now exposed at a much more strategic level e.g. through business intelligence systems, increasing manifold the stakes involved for individuals, corporations as well as government agencies. There, the lack of knowledge about data accuracy, currency or completeness can have erroneous and even catastrophic results. With these changes, traditional approaches to data management in general, and data quality control specifically, are challenged. There is an evident need to incorporate data quality considerations into the whole data cycle, encompassing managerial/governance as well as technical aspects. Data quality experts from research and industry agree that a unified framework for data quality management should bring together organizational, architectural and computational approaches. Accordingly, Sadiq structured this handbook in four parts: Part I is on organizational solutions, i.e. the development of data quality objectives for the organization, and the development of strategies to establish roles, processes, policies, and standards required to manage and ensure data quality. Part II, on architectural solutions, covers the technology landscape required to deploy developed data quality management processes, standards and policies. Part III, on computational solutions, presents effective and efficient tools and techniques related to record linkage, lineage and provenance, data uncertainty, and advanced integrity constraints. Finally, Part IV is devoted to case studies of successful data quality initiatives that highlight the various aspects of data quality in action. The individual chapters present both an overview of the respective topic in terms of historical research and/or practice and state of the art, as well as specific techniques, methodologies and frameworks developed by the individual contributors. Researchers and students of computer science, information systems, or business management as well as data professionals and practitioners will benefit most from this handbook by not only focusing on the various sections relevant to their research area or particular practical work, but by also studying chapters that they may initially consider not to be directly relevant to them, as there they will learn about new perspectives and approaches.
Categories: Computers

Database and Expert Systems Applications

Database and Expert Systems Applications

Hua, M., Pei, J., Zhang, W., Lin, X.: Ranking queries on uncertain data: A probabilistic threshold approach. In: SIGMOD (2008) 11.

Author: Pablo García Bringas

Publisher: Springer Science & Business Media

ISBN: 9783642152504

Category: Computers

Page: 504

View: 899

This book constitutes the refereed proceedings of the 21 International Conference on Database and Expert Systems Applications, DEXA 2010, held in Bilbao, Spain, August 30 - September 3, 2010. The 45 revised full papers and 36 short papers were carefully reviewed and selected from 197 submissions. The papers are organized in topical sections on Data Mining Systems, Parallelism and Query Planning, Data Warehousing and Decision Support Systems, Temporal, Spatial and High Dimensional Databases, Data Mining Algorithms, Information Retrieval, Query Processing and Optimization.
Categories: Computers

Issues in Information Science Information Technology Systems and Security 2012 Edition

Issues in Information Science   Information Technology  Systems  and Security  2012 Edition

Uncertain data is inherent in a few important applications. It is far from trivial to extend ranking queries (also known as top-k queries), a popular type ...

Author:

Publisher: ScholarlyEditions

ISBN: 9781481648158

Category: Computers

Page: 88

View: 660

Issues in Information Science—Information Technology, Systems, and Security: 2012 Edition is a ScholarlyEditions™ eBook that delivers timely, authoritative, and comprehensive information about Database Management. The editors have built Issues in Information Science—Information Technology, Systems, and Security: 2012 Edition on the vast information databases of ScholarlyNews.™ You can expect the information about Database Management in this eBook to be deeper than what you can access anywhere else, as well as consistently reliable, authoritative, informed, and relevant. The content of Issues in Information Science—Information Technology, Systems, and Security: 2012 Edition has been produced by the world’s leading scientists, engineers, analysts, research institutions, and companies. All of the content is from peer-reviewed sources, and all of it is written, assembled, and edited by the editors at ScholarlyEditions™ and available exclusively from us. You now have a source you can cite with authority, confidence, and credibility. More information is available at http://www.ScholarlyEditions.com/.
Categories: Computers

Advances in Spatial and Temporal Databases

Advances in Spatial and Temporal Databases

Ranking queries on uncertain data: a probabilistic threshold approach. In: Proc. of ACM SIGMOD, pp. 673–686 (2008) Schurgers, C., et al.

Author: Dieter Pfoser

Publisher: Springer Science & Business Media

ISBN: 9783642229213

Category: Computers

Page: 518

View: 436

This volume constitutes the refereed proceedings of the 12th International Symposium on Spatial and Temporal Databases, SSTD 2011, held in Minneapolis, USA, in August 2011. The 24 revised full papers presented together with one keynote, 8 short papers, and 8 demonstration papers, were thoroughly reviewed and selected from a total of 63 research submissions, 21 vision and challenges submissions and 16 demonstration submissions. The papers are organized in topical sections on knowledge discovery; spatial networks; access methods; moving objects and sensor networks; multidimensional query processing; and temporal and streaming data.
Categories: Computers

Database Systems for Advanced Applications

Database Systems for Advanced Applications

Cormode et al. have listed a set of properties to describe the semantics of ranking queries on uncertain data, namely exact-k, containment, unique-rank, ...

Author: Jeffrey Xu Yu

Publisher: Springer Science & Business Media

ISBN: 9783642201486

Category: Computers

Page: 583

View: 320

This two volume set LNCS 6587 and LNCS 6588 constitutes the refereed proceedings of the 16th International Conference on Database Systems for Advanced Applications, DASFAA 2011, held in Saarbrücken, Germany, in April 2010. The 53 revised full papers and 12 revised short papers presented together with 2 invited keynote papers, 22 demonstration papers, 4 industrial papers, 8 demo papers, and the abstract of 1 panel discussion, were carefully reviewed and selected from a total of 225 submissions. The topics covered are social network, social network and privacy, data mining, probability and uncertainty, stream processing, graph, XML, XML and graph, similarity, searching and digital preservation, spatial queries, query processing, as well as indexing and high performance.
Categories: Computers