信息诊断系统设计思路:人工核查、广众参与和人工智能的三合一运用Theorizing an Information Diagnosis System: The Hybrid Uses of Manual Fact-checking, Crowdsourcing and Artificial Intelligence
曾姿颖,黄煜,张引,宋韵雅,周琳
摘要(Abstract):
当前对虚假新闻治理的研究较少出于技术逻辑的思路。真假新闻的界限在实践中通常微妙以及难以辨别,加上人们对真假新闻往往有着不同的见解和解读。本文基于来自香港的实证数据分析以及本地虚假新闻核查平台的初步实践经验,提出"人工核查—广众参与—人工智能"的协同核查模式,以最大限度优化社交平台虚假新闻的治理效果。
关键词(KeyWords): 虚假新闻;信息诊断;人工智能;事实核查
基金项目(Foundation):
作者(Author): 曾姿颖,黄煜,张引,宋韵雅,周琳
DOI: 10.16602/j.gmj.20210003
参考文献(References):
- 黄金兰、林以正、谢亦泰、程威铨(2012):中文版“语文探索与字词计算”词典之建立,《中华心理学刊》,第54卷第2期,185-201页。
- 史安斌(2020年4月24日):史安斌:疫情后真相、后权威和后情感,《环球时报》,获取自https://opinion.huanqiu.com/article/3xxbigEoaAS。
- Berezow,A.(2017).Should we ban fake health news?American Council on Science and Health.Retrieved from https://www.acsh.org/news/2017/10/17/should-we-ban-fake-health-news-11975.
- Boczkowski,P.(2017).Fake news and the future of journalism.NiemanLab.Retrieved from https://www.niemanlab.org/2016/12/fake-news-and-the-future-of-journalism/.
- Burfoot,C.& Baldwin,T.(2009).Automaticsatire detection:Are you having a laugh?In Proceedings of the ACL-IJCNLP 2009 Conference Short Papers (pp.161-164).Suntec,Singapore:Association for Computational Linguistics.
- Castelo,S.,Almeida,T.,Elghafari,A.,Santos,A.,Pham,K.,Nakamura,E.& Freire,J.(2019).A topic-agnostic approach for identifying fake news pages.In Companion Proceedings of the 2019 World Wide Web Conference (pp.975-980).San Francisco USA:ACM.doi:10.1145/3308560.3316739.
- Castillo,C.,Mendoza,M.& Poblete,B.(2011).Information credibility on twitter.In Proceedings of the 20th International Conference on World Wide Web (pp.675-684).Hyderabad,India:ACM.doi:10.1145/1963405.1963500.
- Ciampaglia,G.L.,Shiralkar,P.,Rocha,L.M.,Bollen,J.,Menczer,F.& Flammini,A.(2015).Computational fact checking from knowledge networks.PLoS One,10(10),e0128193.doi:10.1371/journal.pone.0128193.
- de Marneffe,M.C.,MacCartney,B.& Manning,C.D.(2006).Generating typed dependency parses from phrase structure parses.In Proceedings of the 5th International Conference on Language Resources and Evaluation.Genoa,Italy:European Language Resources Association.
- Draief,M.,Kutzkov,K.,Scaman,K.& Vojnovic,M.(2018).KONG:Kernels for ordered-neighborhood graphs.arXiv preprint arXiv:1805.10014.
- Ferreira,W.& Vlachos,A.(2016).Emergent:A novel data-set for stance classification.In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies (pp.1163-1168).San Diego,California:Association for Computational Linguistics.doi:10.18653/v1/N16-1138.
- Goel,S.,Anderson,A.,Hofman,J.& Watts,D.J.(2016).The structural virality of online diffusion.Management Science,62(1),180-196.doi:10.1287/mnsc.2015.2158.
- Gunther,A.C.& Schmitt,K.(2004).Mapping boundaries of the hostile media effect.Journal of Communication,54(1),55-70.doi:10.1111/j.1460-2466.2004.tb02613.x.
- Gurajala,S.,White,J.S.,Hudson,B.,Voter,B.R.& Matthews,J.N.(2016).Profile characteristics of fake Twitter accounts.Big Data & Society,3(2),1-13.doi:10.1177/2053951716674236.
- Hansson,S.,Orru,K.,Siibak,A.,B?ck,A.,Krüger,M.,Gabel,F.& Morsut,C.(2020).Communication-related vulnerability to disasters:A heuristic framework.International Journal of Disaster Risk Reduction,51,101931.doi:10.1016/j.ijdrr.2020.101931.
- Horne,B.D.& Adal?,S.(2017).This just in:Fake news packs a lot in title,uses simpler,repetitive content in text body,more similar to satire than real news.arXiv preprint arXiv:1703.09398.
- Jhu-Jyun,H.,Yen-Heng,T.,Zi-Ying,C.& You-Chuan,Y.(2020).Using RoBERTa and linguistic features to detect fake news.In The 10th International Conference on Frontier Computing (FC2020) (p.444).
- Jones,J.M.(2018a).Americans:Much misinformation,bias,inaccuracy in news.Gallup.Retrieved from https://news.gallup.com/opinion/gallup/235796/americans-misinformation-bias-inaccuracy-news.aspx.
- Jones,J.M.(2018b).U.S.media trust continues to recover from 2016 low.Gallup.Retrieved from https://news.gallup.com/poll/243665/media-trust-continues-recover-2016-low.aspx.
- Ma,J.,Gao,W.,Mitra,P.,Kwon,S.,Jansen,B.J.,Wong,K.F.& Cha,M.(2016).Detecting rumors from microblogs with recurrent neural networks.In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (pp.3818-3824).New York:AAAI Press.
- Ma,J.,Gao,W.& Wong,K.F.(2017).Detect rumors in Microblog posts using propagation structure via kernel learning.In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1:Long Papers) (pp.708-717).Vancouver,Canada:Association for Computational Linguistics.doi:10.18653/v1/P17-1066.
- Ma,J.,Gao,W.,Joty,S.& Wong,K.F.(2019).Sentence-level evidence embedding for claim verification with hierarchical attention networks.In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp.2561-2571).Florence,Italy:Association for Computational Linguistics.
- Mann,W.C.& Thompson,S.A.(1988).Rhetorical structure theory:Toward a functional theory of text organization.Text,8(3),243-281.doi:10.1515/text.1.1988.8.3.243.
- Mitra,T.& Gilbert,E.(2015).CREDBANK:Alarge-scale social media corpus with associated credibility annotations.In Proceedings of the Ninth International AAAI Conference on Web and Social Media (pp.258-267).Palo Alto,California:AAAI Press.
- Nielsen,R.K.& Graves,L.(2017).“News you don't believe”:Audience perspectives on fake news.Reuters Institute for the Study of Journalism Report.Retrieved from https://reutersinstitute.politics.ox.ac.uk/sites/default/files/2017-10/Nielsen&Graves_factsheet_1710v3_FINAL_download.pdf.
- Pan,J.Z.,Pavlova,S.,Li,C.X.,Li,N.X.,Li,Y.M.& Liu,J.S.(2018).Content based fake news detection using knowledge graphs.In International Semantic Web Conference (pp.669-683).Monterey,CA,USA:Springer.
- Popat,K.,Mukherjee,S.,Yates,A.& Weikum,G.(2018).DeClarE:Debunking fake news and false claims using evidence-aware deep learning.arXiv preprint arXiv:1809.06416.
- Qian,F.,Gong,C.Y.,Sharma,K.& Liu,Y.(2018).Neural user response generator:Fake news detection with collective user intelligence.In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (pp.3834-3840).Stockholm,Sweden.
- Rubin,V.L.,Conroy,N.J.& Chen,Y.M.(2015).Towards news verification:Deception detection methods for news discourse.In Proceedings of the Hawaii International Conference on System Sciences.
- Ruchansky,N.,Seo,S.& Liu,Y.(2017).CSI:A hybrid deep model for fake news detection.In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (pp.797-806).Singapore:ACM.
- Shiang-Jiun,C.,Yi-Wei,M.,Cheng-Mou,Chin-Shen,F.& Wei-Liang,W.(2020).Fake news detection on social media based on the propagation behavior,Taiwan Academic Network Conference (Tanent 2020),P1156.
- Shu,K.,Sliva,A.,Wang,S.H.,Tang,J.L.& Liu,H.(2017).Fake news detection on social media:A data mining perspective.arXiv preprint arXiv:1708.01967.
- Shu,K.,Mahudeswaran,D.,Wang,S.H.,Lee,D.& Liu,H.(2019a).FakeNewsNet:A data repository with news content,social context and spatialtemporal information for studying fake news on social media.arXiv preprint ArXiv:1809.01286.
- Shu,K.,Wang,S.H.& Liu,H.(2019b).Beyond news contents:The role of social context for fake news detection.In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (pp.312-320).Melbourne VIC,Australia:ACM.doi:10.1145/3289600.3290994.
- Shu,K.,Bhattacharjee,A.,Alatawi,F.,Nazer,T.,Ding,K.,Karami,M.& Liu,H.(2020).Combating disinformation in a social media age.arXiv preprint arXiv:2007.07388.
- Sunstein,C.R.(2014).On rumors:How falsehoods spread,why we believe them,and what can be done.Princeton:Princeton University Press.
- Thorne,J.,Vlachos,A.,Christodoulopoulos,C.& Mittal,A.(2018).FEVER:A large-scale dataset for Fact Extraction and VERification.arXiv preprint arXiv:1803.05355.
- Tsang,S.J.(2020a).Issue stance and perceived journalistic motives explain divergent audience perceptions of fake news.Journalism.
- Tsang,S.J.(2020b).Motivated fake news perception:The impact of news sources and policy support on audiences' assessment of news fakeness.Journalism & Mass Communication Quarterly.doi:10.1177/1077699020952129.
- Tsfati,Y.& Ariely,G.(2014).Individual and contextual correlates of trust in media across 44 countries.Communication Research,41(6),760-782.doi:10.1177/0093650213485972.
- Vallone,R.P.,Ross,L.& Lepper,M.R.(1985).The hostile media phenomenon:Biased perception and perceptions of media bias in coverage of the Beirut massacre.Journal of Personality and Social Psychology,49(3),577-585.doi:10.1037/0022-3514.49.3.577.
- Vosoughi,S.,Roy,D.& Aral,S.(2018).The spread of true and false news online.Science,359(6380),1146-1151.doi:10.1126/science.aap9559.
- Wang,W.Y.(2017).“Liar,Liar Pants on Fire”:A new benchmark dataset for fake news detection.arXiv preprint arXiv:1705.00648.
- Zhao,J.,Cao,N.,Wen,Z.,Song,Y.L.,Lin,Y.R.& Collins,C.(2014).#FluxFlow:Visual analysis of anomalous information spreading on social media.IEEE Transactions on Visualization and Computer Graphics,20(12),1773-1782.doi:10.1109/TVCG.2014.2346922.
- Zhou,X.Y.& Zafarani,R.(2019).Network-based fake news detection:A pattern-driven approach.ACM SIGKDD Explorations Newsletter,21(2),48-60.doi:10.1145/3373464.3373473.
- Zhou,X.Y.,Jain,A.,Phoha,V.V.& Zafarani,R.(2020).Fake news early detection:A theory-driven model.Digital Threats:Research and Practice,1(2),Article No.:12.doi:10.1145/3377478.
- Zhou,X.Y.& Zafarani,R.(2020).A survey of fake news:Fundamental theories,detection methods,and opportunities.ACM Computing Surveys,53(5),Article No.:109.doi:10.1145/3395046.