Show simple item record

dc.contributor.advisorCrick, Christopher John
dc.contributor.authorKaruparthy, Srikanth
dc.date.accessioned2016-04-15T21:49:24Z
dc.date.available2016-04-15T21:49:24Z
dc.date.issued2015-05-01
dc.identifier.urihttps://hdl.handle.net/11244/33423
dc.description.abstractIn online social networks like Twitter, the users usually get inundated with the continuous stream of short messages or tweets. This problem can be handled using classification. Classification is a supervised data mining technique which involves assigning a label to a set of unlabeled objects. A conventional approach for classifying text or tweets is to extract features from the linguistic content posted by the users. A recurrent problem in classification is feature selection, that is, to decide the best set of features for making a particular classification decision among the infinite possible different sets of features. This process usually involves heuristic approaches that require manual feature selection by experts, which involves guesswork, prior information about the dataset and a great deal of tweaking and experimental validation. To address this problem we propose and employ a non-heuristic machine learning approach which will automatically decide the feature set for a classification task. Our analysis shows that our automated feature selection process for Twitter content classification performs on par with current state-of-the-art approaches which incorporate painstaking, time-consuming human effort to manually and heuristically select a feature set. This approach will improve the timeliness and accessibility of data mining social media data streams.
dc.formatapplication/pdf
dc.languageen_US
dc.publisherOklahoma State University
dc.rightsCopyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.titleNon-heurisitc Machine Learning Apprach for Classifying Twitter Content
dc.typetext
dc.contributor.committeeMemberCline, David
dc.contributor.committeeMemberPark, Nohpill
osu.filenameKaruparthy_okstate_0664M_13788.pdf
osu.accesstypeOpen Access
dc.description.departmentComputer Science
dc.type.genreThesis


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record