People2vec: Learning latent representations of users using their social-media activities

By Kathleen Carley, Kumar Sumeet
International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation | July 2018

Citation

Carley, Kathleen., Sumeet, Kumar. People2vec: Learning latent representations of users using their social-media activities International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation .

International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation, 2018

Abstract

In most social network studies, it is assumed that nodes are simple and carry no information, and links are explicit ties such as friendship. Which nodes are in which group is determined as a function of these explicit ties. For example, given a set of random walks through the network, it is possible to learn a vector for each node which contains a latent representation of the node. These latent representations have useful properties that can be easily exploited by statistical models for tasks like identifying groups and inferring implicit links. However, most existing representation learning methods ignore node attributes. In many cases, there is a rich body of information and events associated with nodes that also can be used for node clustering and to infer ties. In social media, e.g., an explicit relationship is friendship, and another is the follower-followee relation. Besides, there are the set of messages passed by the users, as well as, their activities in the form of liking or mentioning. What is needed is a way of collectively using both the explicit ties and this rich body of additional information in learning these latent node representations. Combining such data should enable more effective link inference and grouping strategies. In this research, we propose People2Vec an algorithm to learn representations that takes into account proximity between users due to their social media activities. We validate our model by experiments on two different social-media datasets and find the model to perform better than prior state-of-the-art approaches.

Sumeet Kumar is an Assistant Professor of Information Systems at the Indian School of Business (ISB). He studies problems at the intersection of technology and society. He is interested in analysing user behaviour, quantifying polarisation on online forums , and finding advertisements disguised as regular content on online platforms. His current focus is on identifying implicit or hidden advertisements in videos posted on children’s platforms such as YouTube Kids.

Additionally, Professor Kumar has conducted research in software design and development, with particular emphasis on user experience. He has investigated the use of mobile phone sensors during emergencies to improve situational awareness. His study on the Wireless Emergency Alerts (WEA) service in the United States addressed several issues of critical importance to emergency alerts effectiveness and adoption. Notably, some of his research recommendations was included in the US Federal Communications Commission (FCC) proposed changes to WEA.

He completed his undergraduate education at Indian Institute of Technology (IIT) Kanpur. He holds two Master’s degrees—in Software Engineering and in Machine Learning--both from Carnegie Mellon University, where he also earned his doctorate degree.

More publications by author

Sumeet Kumar