A multi-label, semi-supervised classification approach applied to personality prediction in social media

Lima A.C.E.S.; de Castro L.N.

A multi-label, semi-supervised classification approach applied to personality prediction in social media

dc.contributor.author	Lima A.C.E.S.
dc.contributor.author	de Castro L.N.
dc.date.accessioned	2024-03-13T01:01:54Z
dc.date.available	2024-03-13T01:01:54Z
dc.date.issued	2014
dc.description.abstract	Social media allow web users to create and share content pertaining to different subjects, exposing their activities, opinions, feelings and thoughts. In this context, online social media has attracted the interest of data scientists seeking to understand behaviours and trends, whilst collecting statistics for social sites. One potential application for these data is personality prediction, which aims to understand a user's behaviour within social media. Traditional personality prediction relies on users' profiles, their status updates, the messages they post, etc. Here, a personality prediction system for social media data is introduced that differs from most approaches in the literature, in that it works with groups of texts, instead of single texts, and does not take users' profiles into account. Also, the proposed approach extracts meta-attributes from texts and does not work directly with the content of the messages. The set of possible personality traits is taken from the Big Five model and allows the problem to be characterised as a multi-label classification task. The problem is then transformed into a set of five binary classification problems and solved by means of a semi-supervised learning approach, due to the difficulty in annotating the massive amounts of data generated in social media. In our implementation, the proposed system was trained with three well-known machine-learning algorithms, namely a Naïve Bayes classifier, a Support Vector Machine, and a Multilayer Perceptron neural network. The system was applied to predict the personality of Tweets taken from three datasets available in the literature, and resulted in an approximately 83% accurate prediction, with some of the personality traits presenting better individual classification rates than others. © 2014 Elsevier Ltd.
dc.description.firstpage	122
dc.description.lastpage	130
dc.description.volume	58
dc.identifier.doi	10.1016/j.neunet.2014.05.020
dc.identifier.issn	1879-2782
dc.identifier.uri	https://dspace.mackenzie.br/handle/10899/36461
dc.relation.ispartof	Neural Networks
dc.rights	Acesso Restrito
dc.subject.otherlanguage	Big Five
dc.subject.otherlanguage	Multi-label classification
dc.subject.otherlanguage	Personality
dc.subject.otherlanguage	Semi-supervised learning
dc.subject.otherlanguage	Social media
dc.subject.otherlanguage	Twitter
dc.title	A multi-label, semi-supervised classification approach applied to personality prediction in social media
dc.type	Artigo
local.scopus.citations	89
local.scopus.eid	2-s2.0-84906075994
local.scopus.subject	Big five
local.scopus.subject	Multi-label classifications
local.scopus.subject	Personality
local.scopus.subject	Semi-supervised learning
local.scopus.subject	Social media
local.scopus.subject	Twitter
local.scopus.subject	Algorithms
local.scopus.subject	Artificial Intelligence
local.scopus.subject	Bayes Theorem
local.scopus.subject	Humans
local.scopus.subject	Neural Networks (Computer)
local.scopus.subject	Personality
local.scopus.subject	Social Media
local.scopus.subject	Support Vector Machines
local.scopus.updated	2024-05-01
local.scopus.url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84906075994&origin=inward

Coleções

Artigos de periódico

A multi-label, semi-supervised classification approach applied to personality prediction in social media

Arquivos

Coleções