DSpace Repository

Identifying Duplicate Questions on Quora

Show simple item record

dc.contributor.advisor Guha, Sumanta
dc.contributor.author Chennu, Akhileshwar
dc.contributor.other Dung, Phan Minh
dc.contributor.other Bohez, Erik L. J.
dc.date.accessioned 2017-12-12T02:53:45Z
dc.date.available 2017-12-12T02:53:45Z
dc.date.issued 2017-12-07
dc.identifier.other AIT
dc.identifier.uri http://www.cs.ait.ac.th/xmlui/handle/123456789/880
dc.description 40 p. en_US
dc.description.abstract Finding whether the two questions are asking the same thing can be challenging, as word choice and sentence structure may vary significantly. Some of the natural language processing techniques have been found to have the limited success in separating related question from duplicate ones. Quora is a very good source which helps the users to exchange their knowledge and they also face this problem of duplicate questions. Since Quora gives importance to similar questions problem, it want to provide a good experience for both the question seeker and writer. Using a data set question pairs provided by Quora in Kaggle, we extract the features from the data set by using some methods like common word share, Jaccard Similarity Coefcient, Cosine Similarity, Tf-Idf. After extracting the features from the data we use some machine learning algorithms to build a model using training data. By using this model we get the final values of the test data set. en_US
dc.description.sponsorship AIT Fellowship en_US
dc.language.iso en_US en_US
dc.publisher AIT en_US
dc.subject Quora en_US
dc.subject Questions en_US
dc.subject Similarity en_US
dc.title Identifying Duplicate Questions on Quora en_US
dc.type Research report en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace

Advanced Search


My Account