Análise adaptativa de fluxos de sentimento

Ismael Santana Silva

Análise adaptativa de fluxos de sentimento

dc.creator	Ismael Santana Silva
dc.date.accessioned	2019-08-13T17:19:45Z
dc.date.accessioned	2025-09-09T00:45:22Z
dc.date.available	2019-08-13T17:19:45Z
dc.date.issued	2012-03-26
dc.description.abstract	In recent years, the sentiment analysis task has attracted the interest of the machine learning researchers. This interest has grown significantly due to the large volume of opinionated content generated and shared via social media. Considering the benefits of to know the sentiment of the population regarding to different topics and entities, the analysis of the generated content by social media, it is a promising and necessary task. Many automatic classification techniques have been used to perform sentiment analysis, however is consensus that the arrival pattern of messages from social media follows the data stream paradigm and the traditional classification techniques are not adequate to address the specific characteristics of the created sentiment stream. Among the challenges to the classification techniques can be highlighted: (1) concept drift (i.e., constant changes in data characteristics, which in this study was approached as sentiment drift), (2) the need of real-time update of the classification model from the most recent messages and (3) the limitation of computing and training resource, which makes the two firsts cited challenges more difficult.We analyze these problems from a proposal of semi-supervised learning. Our algorithm adapts the training set, to the changes in the data, from a self-augmenting training process with the passes of the stream. It uses a small seed of initial training and then classification models are produced in real time using association rules. This strategy keeps the model up-to-date incrementally, so that at any time of the event the model reflects the sentiment that is being transmitted. In order to address the sentiment drift, messages to training are projected on-demand, according to the message content that is being classified. Projection of the training data offers a number of advantages including the ability to quickly detect emerging trends in the information stream. We conducted a case study using the Twitter messages, posted in real time, related to major events in 2010 year. In these experiments the performance of the prediction keeps the same or increases, with the passes of the stream and the inclusion of new messages in the training set. We evaluated the proposed solution in different languages, in cases where the sentiment distribution changed in different way over time and in cases where the initial training seed is rather small.
dc.identifier.uri	https://hdl.handle.net/1843/ESBF-8SVFPH
dc.language	Português
dc.publisher	Universidade Federal de Minas Gerais
dc.rights	Acesso Aberto
dc.subject	Análise de sentimento
dc.subject	Influência (Psicologia) Redes de relações sociais
dc.subject	Computação
dc.subject	Redes de relações sociais
dc.subject.other	Concept Drift
dc.subject.other	Análise de sentimento
dc.subject.other	Aprendizado de máquina
dc.subject.other	Fluxo de dados
dc.subject.other	Mídias Sociais
dc.title	Análise adaptativa de fluxos de sentimento
dc.type	Dissertação de mestrado
local.contributor.advisor-co1	Adriano Alonso Veloso
local.contributor.advisor1	Renato Antonio Celso Ferreira
local.contributor.referee1	Gisele Lobo Pappa
local.contributor.referee1	Nivio Ziviani
local.contributor.referee1	Wagner Meira Junior
local.description.resumo	Nos últimos anos, a tarefa de análise de sentimentos tem atraído o interesse de pesquisadores de aprendizado de máquina. Esse interesse tem crescido significativamente devido o grande volume de conteúdo opinativo gerado e compartilhado através das mídias sociais online. Considerando os benefícios de conhecer o sentimento da população em relação a diferentes tópicos e entidades, a análise deste conteúdo, gerado pelas mídias sociais, se faz uma tarefa promissora e necessária. Muitas técnicas de classificação automática têm sido utilizadas para realizar a análise de sentimento, contudo é consenso que o modelo de chegada de mensagens a partir de mídias sociais segue o paradigma de fluxo de dados e as técnicas de classificação tradicionais não estão adequadas para tratar as características especificas deste fluxo de sentimento que é criado. Entre os desafios impostos às técnicas classificação podemos destacar: (1) o concept drift (i.e., constantes mudanças nas características dos dados, que neste trabalho foi abordado como sentiment drift), (2) a necessidade de atualização em tempo real do modelo de classificação a partir de mensagens mais recentes e (3) a limitação de tempo de computação e dados para treinamento, o que torna ainda mais difícil os dois primeiros desafios citados.
local.publisher.initials	UFMG

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1

Nome:: ismaelsantanasilva.pdf
Tamanho:: 5.04 MB
Formato:: Adobe Portable Document Format

Baixar

Coleções

Pós-Graduação em Ciência da Computação - Dissertações