Gender classification of twitter data based on textual meta-attributes extraction
dc.contributor.author | Filho J.A.B.L. | |
dc.contributor.author | Pasti R. | |
dc.contributor.author | De Castro L.N. | |
dc.date.accessioned | 2024-03-13T00:55:34Z | |
dc.date.available | 2024-03-13T00:55:34Z | |
dc.date.issued | 2016 | |
dc.description.abstract | © Springer International Publishing Switzerland 2016.With the growth of social media in recent years, there has been an increasing interest in the automatic characterization of users based on the informal content they generate. In this context, the labeling of users in demographic categories, such as age, ethnicity, origin and race, among the investigation of other attributes inherent to users, such as political preferences, personality and gender expression, has received a great deal of attention, especially based on Twitter data. The present paper focuses on the task of gender classification by using 60 textual meta-attributes, commonly used on text attribution tasks, for the extraction of gender expression linguistic cues in tweets written in Portuguese. Therefore, taking into account characters, syntax, words, structure and morphology of short length, multi-genre, content free texts posted on Twitter to classify author's gender via three different machine-learning algorithms as well as evaluate the influence of the proposed meta-attributes in this process. | |
dc.description.firstpage | 1025 | |
dc.description.lastpage | 1034 | |
dc.description.volume | 444 | |
dc.identifier.doi | 10.1007/978-3-319-31232-3_97 | |
dc.identifier.issn | 2194-5357 | |
dc.identifier.uri | https://dspace.mackenzie.br/handle/10899/36106 | |
dc.relation.ispartof | Advances in Intelligent Systems and Computing | |
dc.rights | Acesso Restrito | |
dc.subject.otherlanguage | Classification | |
dc.subject.otherlanguage | Extraction | |
dc.subject.otherlanguage | Gender | |
dc.subject.otherlanguage | Machine-learning | |
dc.subject.otherlanguage | Meta-attributes | |
dc.subject.otherlanguage | Portuguese language | |
dc.subject.otherlanguage | Social media | |
dc.subject.otherlanguage | ||
dc.title | Gender classification of twitter data based on textual meta-attributes extraction | |
dc.type | Artigo de evento | |
local.scopus.citations | 21 | |
local.scopus.eid | 2-s2.0-84961629208 | |
local.scopus.subject | Gender | |
local.scopus.subject | Meta-attributes | |
local.scopus.subject | Portuguese languages | |
local.scopus.subject | Social media | |
local.scopus.subject | ||
local.scopus.updated | 2024-05-01 | |
local.scopus.url | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84961629208&origin=inward |