Social media mining

Social Media Mining is the process of representing, analyzing, and extracting actionable patterns from social media data. Social Media Mining, introduces basic concepts and principal algorithms suitable for investigating massive social media data; it discusses theories and methodologies from different disciplines such as computer science, data mining, machine learning, social network analysis, network science, sociology, ethnography, statistics, optimization, and mathematics. It encompasses the tools to formally represent, measure, model, and mine meaningful patterns from large-scale social media data.[1]

Background

As defined by Kaplan and Haenlein,[2] social media is the “group of internet-based applications that build on the ideological and technological foundations of Web 2.0, and that allow the creation and exchange of user-generated content.” There are many categories of social media including, but not limited to, social networking (Facebook or LinkedIn), microblogging (Twitter), photo sharing (Flickr, Photobucket, or Picasa), news aggregation (Google reader, StumbleUpon, or Feedburner), video sharing (YouTube, MetaCafe), livecasting (Ustream or Twitch.tv), virtual worlds (Kaneva), social gaming (World ofWarcraft), social search (Google, Bing, or Ask.com), and instant messaging (Google Talk, Skype, or Yahoo! messenger).

The first social media site was introduced by Geocities in 1994, which allowed users to create their own homepages. The first social networking site, SixDegree.com, was introduced in 1997. Since then, many other social media sites have been introduced, each providing service to millions of people. These individuals form a virtual world in which individuals (social atoms), entities (content, sites, etc.) and interactions (between individuals, between entities, between individuals and entities) coexist. Social norms and human behavior govern this virtual world. By understanding these social norms and models of human behavior and combining them with the observations and measurements of this virtual world, one can systematically analyze and mine social media. Social media mining is the process of representing, analyzing, and extracting meaningful patterns from data in social media, resulting from social interactions. It is an interdisciplinary field encompassing techniques from computer science, data mining, machine learning, social network analysis, network science, sociology, ethnography, statistics, optimization, and mathematics. Social media mining faces grand challenges such as the big data paradox, obtaining sufficient samples, the noise removal fallacy, and evaluation dilemma.

Social media mining represents the virtual world of social media in a computable way, measures it, and designs models that can help us understand its interactions. In addition, social media mining provides necessary tools to mine this world for interesting patterns, analyze information diffusion, study influence and homophily, provide effective recommendations, and analyze novel social behavior in social media.

Research

Research Areas

Publication Venues

Social media mining research articles are published in computer science, social science, and data mining conferences and journals:

Conferences

Conference papers can be found in proceedings of Knowledge Discovery and Data Mining (KDD), World Wide Web (WWW), Association for Computational Linguistics (ACL), Conference on Information and Knowledge Management (CIKM), International Conference on Data Mining (ICDM), Internet Measuring Conference (IMC).

Journals

Social media mining is also present on many data management/database conferences such as the ICDE Conference, SIGMOD Conference and International Conference on Very Large Data Bases

See also

Methods
Application domains
Related topics

References

  1. Zafarani, Reza; Abbasi, Mohammad Ali; Liu, Huan (2014). "Social Media Mining: An Introduction". Retrieved 15 November 2014.
  2. Kaplan, Andreas M.; Haenlein, Michael (2010). "Users of the world, unite! The challenges and opportunities of social media". Business Horizons 53 (1).
  3. Tang, Jiliang; Tang, Jie; Liu, Huan (2014). "Recommendation in Social Media - Recent Advances and New Frontiers". In Proceedings of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.
  4. Tang, Jiliang; Hu, Xia; Liu, Huan (2013). "Social Recommendation: A Review". Social Network Analysis and Mining.
  5. Horowitz, Damon; Kamvar, Sepandar (2013). "The Anatomy of a Large-Scale Social Search Engine". In Proceedings of the 19th international conference on World wide web, pp. 431-440. ACM, 2010.
  6. Hu, Xia; Tang, Lei; Tang, Jiliang; Liu, Huan (2013). "Exploiting Social Relations for Sentiment Analysis in Microblogging". In Proceedings of the 6th ACM International Conference on Web Search and Data Mining.
  7. Hu, Xia; Tang, Jiliang; Gao, Huiji; Liu, Huan (2013). "Unsupervised Sentiment Analysis with Emotional Signals". In Proceedings of the 22nd International World Wide Web Conference.
  8. Hu, Xia; Tang, Jiliang; Zhang, Yanchao; Liu, Huan (2013). "Social Spammer Detection in Microblogging". In Proceedings of the 23rd International Joint Conference on Artificial Intelligence.
  9. Hu, Xia; Tang, Jiliang; Liu, Huan (2014). "Online Social Spammer Detection". In Proceedings of the 28th AAAI Conference on Artificial Intelligence.
  10. Hu, Xia; Tang, Jiliang; Liu, Huan (2014). "Leveraging Knowledge across Media for Spammer Detection in Microblogging". In Proceedings of the 37th Annual ACM SIGIR Conference.
  11. Hu, Xia; Tang, Jiliang; Gao, Huiji; Liu, Huan (2014). "Social Spammer Detection with Sentiment Information". In Proceedings of the IEEE International Conference on Data Mining.
  12. Tang, Jiliang; Liu, Huan (2012). "Feature Selection with Linked Data in Social Media". In Proceedings of SIAM International Conference on Data Mining.
  13. Tang, Jiliang; Liu, Huan (2014). "Feature Selection for Social Media Data". ACM Transactions on Knowledge Discovery from Data (TKDD),8(4) Pages 19:1-19:27.
  14. Tang, Jiliang; Liu, Huan (2012). "Unsupervised Feature Selection for Linked Social Media Data". In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
  15. Tang, Jiliang; Liu, Huan (2014). "Unsupervised Feature Selection for Linked Social Media Data". IEEE Transactions on Knowledge and Data Engineering (TKDE), 26(12): 2914-1927.
  16. Tang, Jiliang; Liu, Huan (2014). "Trust in Social Computing". In Proceedings of the 23rd International World Wide Web Conference.
  17. Tang, Jiliang; Gao, Huiji; Liu, Huan (2012). "mTrust: Discerning Multi-Faceted Trust in a Connected World". the 5th ACM International Conference on Web Search and Data Mining.
  18. Tang, Jiliang; Gao, Huiji; DasSarma, Atish; Liu, Huan (2012). "eTrust: Understanding Trust Evolution in an Online World". In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
  19. Tang, Jiliang; Gao, Huiji; Hu, Xia; Liu, Huan (2013). "Exploiting Homophily Effect for Trust Prediction". the 6th ACM International Conference on Web Search and Data Mining.
  20. Tang, Jiliang; Hu, Xia; Liu, Huan (2014). "Is Distrust the Negation of Trust? The Value of Distrust in Social Media". In Proceedings of ACM Hypertext conference.
  21. Tang, Jiliang; Hu, Xia; Chang, Yi; Liu, Huan (2014). "Predictability of Distrust with Interaction Data". ACM International Conference on Information and Knowledge Management.
  22. Tang, Jiliang; Chang, Shiyu; Aggarwal, Charu; Liu, Huan (2015). "Negative Link Prediction in Social Media". In Proceedings ofACM International Conference on Web Search and Data Mining.
  23. Bruno, Nicola (2011). "Tweet first, verify later? How real-time information is changing the coverage of worldwide crisis events". Oxford: Reuters Institute for the Study of Journalism, University of Oxford 10: 2010–2011.
  24. Sakaki, Takashi; Okazaki, Makoto; Yutaka, Matsuo (2010). "Earthquake shakes Twitter users: real-time event detection by social sensors". Proceedings of the 19th international conference on World wide web: 851–860.
  25. Mendoza, Marcelo; Poblete, Barbara; Castillo, Carlos (2010). "Twitter under crisis: Can we trust what we RT?". Proceedings of the first workshop on social media analytics: 71–79.
  26. Kumar, Shamanth; Barbier, Geoffrey; Abbasi, Mohammad Ali; Liu, Huan (2011). "TweetTracker: An Analysis Tool for Humanitarian and Disaster Relief". The 5th International AAAI Conference on Weblogs and Social Media. Retrieved 1 December 2014.
  27. Kumar, Shamanth; Hu, Xia; Liu, Huan (2014). "A behavior analytics approach to identifying tweets from crisis regions". Proceedings of the 25th ACM conference on Hypertext and social media: 255–260.
  28. Gao, Huiji; Tang, Jiliang; Liu, Huan (2012). "Exploring Social-Historical Ties on Location-Based Social Networks". In Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media.
  29. Gao, Huiji; Tang, Jiliang; Liu, Huan (2012). "Mobile Location Prediction in Spatio-Temporal Context". Nokia Mobile Data Challenge Workshop 2012.
  30. Gao, Huiji; Tang, Jiliang; Liu, Huan (2012). "gSCorr: Modeling Geo-Social Correlations for New Check-ins on Location-Based Social Networks". In Proceedings of the 21st ACM International Conference on Information and Knowledge Management.
  31. Gao, Huiji; Tang, Jiliang; Hu, Xia; Liu, Huan (2013). "Exploring Temporal Effects for Location Recommendation on Location-Based Social Networks". In Proceedings of the 7th ACM Recommender Systems Conference.
  32. Gao, Huiji; Tang, Jiliang; Hu, Xia; Liu, Huan (2014). "Content-Aware Point of Interest Recommendation on Location-Based Social Networks". In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence.
  33. Gao, Huiji; Tang, Jiliang; Liu, Huan (2014). "Personalized Location Recommendation on Location-based Social Networks". In Proceedings of the 8th ACM Recommender Systems Conference.
  34. Barbier, Geoffrey; Feng, Zhuo; Gundecha, Pritam; Liu, Huan (2013). "Provenance Data in Social Media". Synthesis Lectures on Data Mining and Knowledge Discovery.
  35. Gundecha, Pritam; Feng, Zhuo; Liu, Huan (2013). "Seeking Provenance of Information in Social Media". In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management Conference.
  36. Gundecha, Pritam; Barbier, Geoffrey; Tang, Jiliang; Liu, Huan (2014). "User Vulnerability and its Reduction on a Social Networking Site". Journals of Transactions on Knowledge Discovery from Data.

External links

Wikimedia Commons has media related to Social media mining.