Abstract
Fake news is a major threat to global democracy resulting in diminished trust in government, journalism and civil society. The public popularity of social media and social networks has caused a contagion of fake news where conspiracy theories, disinformation and extreme views flourish. Detection and mitigation of fake news is one of the fundamental problems of our times and has attracted widespread attention. While fact checking websites such as snopes, politifact and major companies such as Google, Facebook, and Twitter have taken preliminary steps towards addressing fake news, much more remains to be done. As an interdisciplinary topic, various facets of fake news have been studied by communities as diverse as machine learning, databases, journalism, political science and many more.
The objective of this tutorial is two-fold. First, we wish to familiarize the database community with the efforts by other communities on combating fake news. We provide a panoramic view of the state-of-the-art of research on various aspects including detection, propagation, mitigation, and intervention of fake news. Next, we provide a concise and intuitive summary of prior research by the database community and discuss how it could be used to counteract fake news. The tutorial covers research from areas such as data integration, truth discovery and fusion, probabilistic databases, knowledge graphs and crowdsourcing from the lens of fake news. Effective tools for addressing fake news could only be built by leveraging the synergistic relationship between database and other research communities. We hope that our tutorial provides an impetus towards such synthesis of ideas and the creation of new ones.
-
H. Allcott and M. Gentzkow. Social media and fake news in the 2016 election.
Journal of economic perspectives
, 31(2):211--36, 2017.
Google Scholar
-
V. Bakir and A. McStay. Fake news and the economy of emotions: Problems, causes, solutions.
Digital Journalism
, 6(2):154--175, 2018.
Google Scholar
Cross Ref
-
S. Bharathi, D. Kempe, and M. Salek. Competitive influence maximization in social networks. In
International workshop on web and internet economics
, pages 306--311. Springer, 2007.
Google Scholar
Digital Library
-
A. Borodin, Y. Filmus, and J. Oren. Threshold models for competitive influence in social networks. In
International workshop on internet and network economics
, pages 539--550. Springer, 2010.
Google Scholar
Digital Library
-
C. Castillo, M. Mendoza, and B. Poblete. Information credibility on twitter. In
WWW
, pages 675--684. ACM, 2011.
Google Scholar
Digital Library
-
G. L. Ciampaglia, P. Shiralkar, L. M. Rocha, J. Bollen, F. Menczer, and A. Flammini. Computational fact checking from knowledge networks.
PloS one
, 10(6):e0128193, 2015.
Google Scholar
Cross Ref
-
S. Cohen, C. Li, J. Yang, and C. Yu. Computational journalism: A call to arms to database researchers. 2011.
Google Scholar
-
X. Dong, E. Gabrilovich, G. Heitz, W. Horn, N. Lao, K. Murphy, T. Strohmann, S. Sun, and W. Zhang. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In
KDD
, pages 601--610. ACM, 2014.
Google Scholar
Digital Library
-
X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: the role of source dependence.
PVLDB
, 2(1):550--561, 2009.
Google Scholar
Digital Library
-
A. Halevy, A. Rajaraman, and J. Ordille. Data integration: the teenage years. In
PVLDB
, pages 9--16. VLDB Endowment, 2006.
Google Scholar
Digital Library
-
W. Hamilton, P. Bajaj, M. Zitnik, D. Jurafsky, and J. Leskovec. Embedding logical queries on knowledge graphs. In
Advances in Neural Information Processing Systems
, pages 2026--2037, 2018.
Google Scholar
Digital Library
-
N. Hassan, G. Zhang, F. Arslan, J. Caraballo, D. Jimenez, S. Gawsane, S. Hasan, M. Joseph, A. Kulkarni, A. K. Nayak, et al. Claimbuster: The first-ever end-to-end fact-checking system.
PVLDB
, 10(12):1945--1948, 2017.
Google Scholar
Digital Library
-
Z. Jin, J. Cao, Y. Zhang, and J. Luo. News verification by exploiting conflicting social viewpoints in microblogs. In
AAAI
, 2016.
Google Scholar
Digital Library
-
J. Kim, B. Tabibian, A. Oh, B. Schölkopf, and M. Gomez-Rodriguez. Leveraging the crowd to detect and reduce the spread of fake news and misinformation. In
WSDM
, pages 324--332. ACM, 2018.
Google Scholar
Digital Library
-
D. M. Lazer, M. A. Baum, Y. Benkler, A. J. Berinsky, K. M. Greenhill, F. Menczer, M. J. Metzger, B. Nyhan, G. Pennycook, D. Rothschild, et al. The science of fake news.
Science
, 359(6380):1094--1096, 2018.
Google Scholar
Cross Ref
-
Y. Li, J. Gao, C. Meng, Q. Li, L. Su, B. Zhao, W. Fan, and J. Han. A survey on truth discovery.
ACM Sigkdd Explorations Newsletter
, 17(2):1--16, 2016.
Google Scholar
Digital Library
-
Y. Lin, Z. Liu, M. Sun, Y. Liu, and X. Zhu. Learning entity and relation embeddings for knowledge graph completion. In
Twenty-ninth AAAI conference on artificial intelligence
, 2015.
Google Scholar
Digital Library
-
N. Mele, D. Lazer, M. Baum, N. Grinberg, L. Friedland, K. Joseph, W. Hobbs, and C. Mattsson. Combating fake news: An agenda for research and action, 2017.
Google Scholar
-
R. Pastor-Satorras and A. Vespignani. Immunization of complex networks.
Physical review E
, 65(3):036104, 2002.
Google Scholar
-
G. Pennycook and D. Rand. Crowdsourcing judgments of news source quality.
SSRN. com
, 2018.
Google Scholar
-
K. Popat, S. Mukherjee, J. Strötgen, and G. Weikum. Where the truth lies: Explaining the credibility of emerging claims on the web and social media. In
WWW
, pages 1003--1012, 2017.
Google Scholar
Digital Library
-
C. Shao, G. L. Ciampaglia, O. Varol, A. Flammini, and F. Menczer. The spread of fake news by social bots.
arXiv preprint arXiv:1707.07592
, pages 96--104, 2017.
Google Scholar
-
B. Shi and T. Weninger. Discriminative predicate path mining for fact checking in knowledge graphs.
Knowledge-based systems
, 104:123--133, 2016.
Google Scholar
Digital Library
-
B. Shi and T. Weninger. Proje: Embedding projection for knowledge graph completion. In
Thirty-First AAAI Conference on Artificial Intelligence
, 2017.
Google Scholar
Digital Library
-
P. Shiralkar, A. Flammini, F. Menczer, and G. L. Ciampaglia. Finding streams in knowledge graphs to support fact checking. In
2017 IEEE International Conference on Data Mining (ICDM)
, pages 859--864. IEEE, 2017.
Google Scholar
Cross Ref
-
K. Shu, H. R. Bernard, and H. Liu. Studying fake news via network analysis: detection and mitigation. In
Emerging Research Challenges and Opportunities in Computational Social Network Analysis and Mining
, pages 43--65. Springer, 2019.
Google Scholar
Cross Ref
-
K. Shu, A. Sliva, S. Wang, J. Tang, and H. Liu. Fake news detection on social media: A data mining perspective.
ACM SIGKDD Explorations Newsletter
, 19(1):22--36, 2017.
Google Scholar
Digital Library
-
E. C. Tandoc Jr, Z. W. Lim, and R. Ling. Defining "fake news" a typology of scholarly definitions.
Digital Journalism
, 6(2):137--153, 2018.
Google Scholar
Cross Ref
-
S. Tschiatschek, A. Singla, M. Gomez Rodriguez, A. Merchant, and A. Krause. Fake news detection in social networks via crowd signals. In
WWW
, pages 517--524. WWW, 2018.
Google Scholar
Digital Library
-
S. Vosoughi, D. Roy, and S. Aral. The spread of true and false news online.
Science
, 359(6380):1146--1151, 2018.
Google Scholar
Cross Ref
-
J. Widom. Trio: A system for integrated management of data, accuracy, and lineage. Technical report, Stanford InfoLab, 2004.
Google Scholar
-
L. Wu and H. Liu. Tracing fake-news footprints: Characterizing social media messages by how they propagate. In
WSDM
, pages 637--645. ACM, 2018.
Google Scholar
Digital Library
-
X. Zhou and R. Zafarani. Fake news: A survey of research, detection methods, and opportunities.
arXiv preprint arXiv:1812.00315
, 2018.
Google Scholar
-
Published in