Uma análise de fatores que influenciam interações entre usuários do twitter

Giovanni Ventorim Comarela

Please use this identifier to cite or link to this item: http://hdl.handle.net/1843/ESBF-8UZHY8

Full metadata record

DC Field	Value	Language
dc.contributor.advisor1	Virgilio Augusto Fernandes Almeida	pt_BR
dc.contributor.referee1	Adriano Alonso Veloso	pt_BR
dc.contributor.referee2	Artur Ziviani	pt_BR
dc.creator	Giovanni Ventorim Comarela	pt_BR
dc.date.accessioned	2019-08-12T12:37:29Z	-
dc.date.available	2019-08-12T12:37:29Z	-
dc.date.issued	2012-06-01	pt_BR
dc.identifier.uri	http://hdl.handle.net/1843/ESBF-8UZHY8	-
dc.description.abstract	In information networks where users send messages to one another, the issue of information overload naturally arises: which are the most important messages? In this work we study the problem of understanding the importance of messages in Twitter. We approach this problem in two stages. First, we perform an extensive characterization of a very large Twitter data set which includes all users, social relations, and messages posted from the beginning of the service up to August 2009. We show evidence that information overload is present: users sometimes have to search through hundreds of messages to find those that are interesting to reply or retweet. We then identify factors that influence user response or retweet probability: previous responses to the same tweeter, the tweeter\\\'s sending rate, the age and some basic text elements of the tweet. In our second stage, we show that some of these factors can be used to improve the ordering of tweets as presented to the user. First, by inspecting user activity over time, we construct a simple on-off model of user behavior that allows us to infer when a user is actively using Twitter. Then, we explore two methods from machine learning for ranking tweets: a Naive Bayes predictor and a Support Vector Machine classifier. We show that it is possible to reorder tweets to increase the fraction of replied or retweeted messages appearing in the first positions of the list by as much as 60%.	pt_BR
dc.description.resumo	Nesta dissertação estuda-se o problema de entender interações entre usuários na rede de informação Twitter. O problema é abordado em duas etapas: primeiro, é realizada uma caracterização extensiva de uma grande coleção de dados, através da qual, identifica-se por exemplo que algumas vezes os usuários passam por centenas de mensagens até encontrarem alguma que tem interesse em interagir. Estes resultados motivam a identificação de fatores que influenciam as probabilidades de respostas e compartilhamento de mensagens no Twitter. Na segunda etapa, utilizando algoritmos de aprendizado de máquina, mostra-se que alguns destes fatores podem ser utilizados para melhorar o mecanismo usual de apresentação de mensagens. Estes algoritmos são avaliados através de estudos de simulação, os quais mostram que a fração de mensagens respondidas e compartilhadas próximas ao topo da lista de mensagens dos usuários cresce em até 60%.	pt_BR
dc.language	Português	pt_BR
dc.publisher	Universidade Federal de Minas Gerais	pt_BR
dc.publisher.initials	UFMG	pt_BR
dc.rights	Acesso Aberto	pt_BR
dc.subject	twitter	pt_BR
dc.subject	Interações	pt_BR
dc.subject	Redes sociais online	pt_BR
dc.subject.other	Influência (Psicologia) Redes de relações sociais	pt_BR
dc.subject.other	Computação	pt_BR
dc.subject.other	Redes de relações sociais	pt_BR
dc.title	Uma análise de fatores que influenciam interações entre usuários do twitter	pt_BR
dc.type	Dissertação de Mestrado	pt_BR
Appears in Collections:	Dissertações de Mestrado

Files in This Item:

File	Description	Size	Format
giovannicomarela.pdf		1.65 MB	Adobe PDF	View/Open

Show simple item record