Approach to Quantify Information in Tweets

Ramineni, Ruchishya

dc.contributor.advisor	George, K. M.
dc.contributor.author	Ramineni, Ruchishya
dc.date.accessioned	2019-10-25T20:25:21Z
dc.date.available	2019-10-25T20:25:21Z
dc.date.issued	2019-05-01
dc.identifier.uri	https://hdl.handle.net/11244/321606
dc.description.abstract	Microblogs such as Twitter play an important role in online social communications. Unlike traditional media, hot topics and emerging news will become much more popular in a short span with the help of information spreading platforms like Twitter. Nowadays Twitter is widely used in many professions to analyze data. For example, sentiment analysis is the popular approach to opinion mining where the sentiment values of the tweets are classified into weighted classes positive, negative or neutral. These signed weights may not be the best approach for analysis in all cases. Information diffusion is an alternative method to analyze the information defined as information passing through person to person where the research mostly focuses on graph-based models. The edges of the network graph are constructed based on either retweet status or hashtags, and information flow is modeled as transmission from node to node where nodes are users.
dc.description.abstract	Generally speaking, an analysis of tweets quantify information inherent in tweets. In this research, a new approach is proposed to quantify information in tweets as unsigned weights. This approach is suitable to analyze problems if tweets can be interpreted to convey unsigned weight contribution to the problem. The weight computation method presented in this thesis extract keywords called tokens from tweets. Then weights are associated with tokens. The weights are interpreted as quantification of information. To identify tokens two methods are used, one approach uses a technique in Topic Modeling LDA (Latent Dirichlet allocation) to determine tokens and their weights. The second approach is iterative which starts with some anchor words (keywords set) and with similarity measure between anchor word set and the words in tweets. More words are added based on some threshold value of similarity. To associate weights to tokens NMF (Nonnumeric Matrix Factorization) is used. To compute the weight contribution of a tweet, a formula for its potential is used.
dc.format	application/pdf
dc.language	en_US
dc.rights	Copyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.title	Approach to Quantify Information in Tweets
dc.contributor.committeeMember	Johnson, Thomas P.
dc.contributor.committeeMember	Akbas, Esra
osu.filename	Ramineni_okstate_0664M_16237.pdf
osu.accesstype	Open Access
dc.type.genre	Thesis
dc.type.material	Text
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Oklahoma State University

Files in this item

Name:: Ramineni_okstate_0664M_16237.pdf
Size:: 1.519Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

OSU Theses [15752]

Show simple item record

SHAREOK^TM

advancing Oklahoma scholarship, research and institutional memory

Approach to Quantify Information in Tweets

Files in this item

This item appears in the following Collection(s)