A keyword extraction method from twitter messages represented as graphs

Abilhoa W.D.; De Castro L.N.

A keyword extraction method from twitter messages represented as graphs

dc.contributor.author	Abilhoa W.D.
dc.contributor.author	De Castro L.N.
dc.date.accessioned	2024-03-13T01:00:22Z
dc.date.available	2024-03-13T01:00:22Z
dc.date.issued	2014
dc.description.abstract	Twitter is a microblog service that generates a huge amount of textual content daily. All this content needs to be explored by means of text mining, natural language processing, information retrieval, and other techniques. In this context, automatic keyword extraction is a task of great usefulness. A fundamental step in text mining techniques consists of building a model for text representation. The model known as vector space model, VSM, is the most well-known and used among these techniques. However, some difficulties and limitations of VSM, such as scalability and sparsity, motivate the proposal of alternative approaches. This paper proposes a keyword extraction method for tweet collections that represents texts as graphs and applies centrality measures for finding the relevant vertices (keywords). To assess the performance of the proposed approach, three different sets of experiments are performed. The first experiment applies TKG to a text from the Time magazine and compares its performance with that of the literature. The second set of experiments takes tweets from three different TV shows, applies TKG and compares it with TFIDF and KEA, having human classifications as benchmarks. Finally, these three algorithms are applied to tweets sets of increasing size and their computational running time is measured and compared. Altogether, these experiments provide a general overview of how TKG can be used in practice, its performance when compared with other standard approaches, and how it scales to larger data instances. The results show that TKG is a novel and robust proposal to extract keywords from texts, particularly from short messages, such as tweets. © 2014 Elsevier Inc. All rights reserved.
dc.description.firstpage	308
dc.description.lastpage	325
dc.description.volume	240
dc.identifier.doi	10.1016/j.amc.2014.04.090
dc.identifier.issn	0096-3003
dc.identifier.uri	https://dspace.mackenzie.br/handle/10899/36375
dc.relation.ispartof	Applied Mathematics and Computation
dc.rights	Acesso Restrito
dc.subject.otherlanguage	Centrality measures
dc.subject.otherlanguage	Graph theory
dc.subject.otherlanguage	Keyword extraction
dc.subject.otherlanguage	Knowledge discovery
dc.subject.otherlanguage	Text mining
dc.subject.otherlanguage	Twitter data
dc.title	A keyword extraction method from twitter messages represented as graphs
dc.type	Artigo
local.scopus.citations	97
local.scopus.eid	2-s2.0-84901273966
local.scopus.subject	Centrality measures
local.scopus.subject	Keyword extraction
local.scopus.subject	NAtural language processing
local.scopus.subject	Text mining
local.scopus.subject	Text mining techniques
local.scopus.subject	Text representation
local.scopus.subject	Twitter data
local.scopus.subject	Vector space models
local.scopus.updated	2024-05-01
local.scopus.url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84901273966&origin=inward

Coleções

Artigos de periódico

A keyword extraction method from twitter messages represented as graphs

Arquivos

Coleções