Show simple item record

dc.contributor.advisorK.M, George
dc.contributor.authorKammarpally, Prashanth Reddy
dc.date.accessioned2016-09-29T18:40:46Z
dc.date.available2016-09-29T18:40:46Z
dc.date.issued2015-07-01
dc.identifier.urihttps://hdl.handle.net/11244/45270
dc.description.abstractTwitter is a powerful real-time micro-blogging service and it is a platform where users provide and obtain information, called tweets, at a rapid pace. Because of the volume, velocity, and unstructured nature of tweets, Twitter data can be viewed as big data. In this thesis we study the veracity of tweets using oil industry related tweets. Previous research has shown that most of the tweets posted on twitter are truthful. But the same platform (Twitter) is also used often to spread misinformation intentionally or unintentionally. There is no definitive measures to determine the veracity of tweets based on the tweets themselves. So there is a need for better mechanisms to measure levels of accuracy from tweets.In this thesis, we propose three measures to estimate the veracity/accuracy of topics based on analysis of tweets. They are topic diffusion, geographic dispersion, and spam rate. We collect tweets associated to topics. Using the tweets we compute the measures and estimate the veracity of topics. Reliable geographic dispersion data was not available in our data set and hence it is not used in validation process. To validate measures, we verity the tweeted information using official data. For this study we streamed oil industry data. Several topics were identified for our analysis. In the case of each topic, tweets unrelated to the topic are considered noise. After noise elimination, tweets are classified according to company names, then the proposed measures are computed. The results are compared against the verification results. In majority of cases, the estimates of veracity of topics by the proposed measures are confirmed by the verification results.
dc.formatapplication/pdf
dc.languageen_US
dc.rightsCopyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.titleCase Study on Veracity in Twitter Data Using Oil Company Related Tweets
dc.typetext
dc.contributor.committeeMemberNophill, Park
dc.contributor.committeeMemberChan-Tin, Eric
osu.filenameKammarpally_okstate_0664M_14262.pdf
osu.accesstypeOpen Access
dc.description.departmentComputer Science
dc.type.genreThesis


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record