dc.contributor.advisor | Li, Chengkai | |
dc.creator | Shingavi, Ankit Anil | |
dc.date.accessioned | 2019-02-07T15:57:16Z | |
dc.date.available | 2019-02-07T15:57:16Z | |
dc.date.created | 2018-08 | |
dc.date.issued | 2018-09-17 | |
dc.date.submitted | August 2018 | |
dc.identifier.uri | http://hdl.handle.net/10106/27666 | |
dc.description.abstract | Several applications deploy the use of large entity graphs. Given the entirety of its application scope, it is challenging to select a single entity graph for a particular need from numerous data sources. For a comprehensible overview of the entity graph, we may project a preview table for compact representation of an entity graph. Each preview table represents a single entity type in the dataset. We need to find the representative entities for a given entity type from the entity graph to show the coverage of a dataset. In this paper, we propose a method to find representative entities for a given entity type from the entity graph. Each entity of the same type is represented by a multi-dimensional label vector using neighborhood nodes. We apply the k-means clustering algorithm on the generated label vectors of the same entity type. The clustering algorithm divides a set of entities into k disjoint clusters. The nearest entity to the centroid of each cluster is used as the representative entity for the given entity type. We have performed experiments on the Freebase dataset, based off of which, we got diverse and important representative entities for the tv, film and location domain. We can use these representative entities in the generation of preview tables. This helps the data worker understand the coverage of a particular entity type in the dataset. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | |
dc.subject | Representative entities | |
dc.subject | Entity similarity | |
dc.subject | Graph mining | |
dc.subject | Entity graph | |
dc.title | Finding Representative Entities From Entity Graph By Using Neighborhood Based Entity Similarity | |
dc.type | Thesis | |
dc.degree.department | Computer Science and Engineering | |
dc.degree.name | Master of Science in Computer Science | |
dc.date.updated | 2019-02-07T15:57:17Z | |
thesis.degree.department | Computer Science and Engineering | |
thesis.degree.grantor | The University of Texas at Arlington | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science in Computer Science | |
dc.type.material | text | |