AUT LibraryAUT
View Item 
  •   Open Theses & Dissertations
  • Doctoral Theses
  • View Item
  •   Open Theses & Dissertations
  • Doctoral Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Truth Discovery in Streaming Data and Crowdsourcing Applications

Yang, Yi
Thumbnail
View/Open
Thesis - Ch6 Embargoed (2.667Mb)
Permanent link
http://hdl.handle.net/10292/13402
Metadata
Show full metadata
Abstract
[NOTE: Chapter 6 is embargoed until June 16, 2021]

With the development of Internet and cellular network, it becomes much easier for people to receive information from multiple data sources. However, the data from different sources describing the same entity or object is usually conflicting and erroneous. Therefore, it is important to assess the data veracity, resolve the conflicts and extract the trustworthy information among the multi-source data for the downstream applications.

In this thesis, I focus on the truth discovery models to assess data veracity. Truth discovery is an emerging technique that estimates the most trustworthy information (also known as truth) of each object from the multi-source data. Specifically, a truth discovery model is usually an unsupervised learning model that learns the unknown source reliability from the observed multi-source data to better estimate object truth. This thesis advances truth discovery in applications where data is collected from data streams and crowdsourcing applications, specifically studies how to use object correlation in streaming data truth discovery and how to improve the accuracy and efficiency of streaming data truth discovery. For crowdsourcing applications, the thesis presents two truth discovery models that can better model human behaviors in the truth discovery steps. As most truth discovery methods are unsupervised learning models in which the ground truths of objects are unknown, the thesis also discusses how to use a small set of ground truths to guide the source reliability estimation and develops a semi-supervised truth discovery model to better discover object truths.
Keywords
Truth discovery; Crowdsourcing; Source reliability; Label aggregation
Date
2020
Item Type
Thesis
Supervisor(s)
Bai, Quan; Liu, Qing
Degree Name
Doctor of Philosophy
Publisher
Auckland University of Technology

Contact Us
  • Admin

Hosted by Tuwhera, an initiative of the Auckland University of Technology Library

 

 

Browse

Open Theses & DissertationsTitlesAuthorsDateThesis SupervisorDoctoral ThesesTitlesAuthorsDateThesis Supervisor

Alternative metrics

 

Statistics

For this itemFor all Open Theses & Dissertations

Share

 
Follow @AUT_SC

Contact Us
  • Admin

Hosted by Tuwhera, an initiative of the Auckland University of Technology Library