Show simple item record

dc.contributor.authorHasan, Quazi Mainulen_US
dc.date.accessioned2011-03-03T21:51:34Z
dc.date.available2011-03-03T21:51:34Z
dc.date.issued2011-03-03
dc.date.submittedJanuary 2010en_US
dc.identifier.otherDISS-10927en_US
dc.identifier.urihttp://hdl.handle.net/10106/5428
dc.description.abstractIn Wikipedia, each article represents an entity. Entity can have different types like person, country, school, science etc. Although Wikipedia encapsulates category information for each page, sometimes it is not sufficient to deduce the type of a page just from its categories. But, incorporating the clear type information in a Wikipedia page is very important for the users, as it will help them to explore the pages in more organized way. Hence, in my thesis, we explore different standard classification techniques, mainly Naïve Bayes and Support Vector Machines and experiment how these techniques can be made more effective for typifying Wikipedia articles by using different feature selection methods. We proposed a method where Wikipedia categories are used as features. Moreover, we combine features to build a meta classifier which outperforms the other standard methods. To compare our methods we calculate the accuracy of different methods and used well known data mining tool "WEKA".en_US
dc.description.sponsorshipLi, Chengkaien_US
dc.language.isoenen_US
dc.publisherComputer Science & Engineeringen_US
dc.titleTypifying Wikipedia Articlesen_US
dc.typeM.S.en_US
dc.contributor.committeeChairLi, Chengkaien_US
dc.degree.departmentComputer Science & Engineeringen_US
dc.degree.disciplineComputer Science & Engineeringen_US
dc.degree.grantorUniversity of Texas at Arlingtonen_US
dc.degree.levelmastersen_US
dc.degree.nameM.S.en_US


Files in this item

Thumbnail


This item appears in the following Collection(s)

Show simple item record