A Framework for Mining Instant Messaging Services

Total Page:16

File Type:pdf, Size:1020Kb

A Framework for Mining Instant Messaging Services A Framework for Mining Instant Messaging Services John Resig Ankur Teredesai Data Mining Research Group Department of Computer Science Rochester Institute of Technology {jer5513,amt}@cs.rit.edu Abstract cation protocols (information packets) are very similar Developing a framework for analysis of large scale mass- to one another. communication media such as instant messaging (popularly Online The user’s client is connected to the known as IM) has gone largely unexplored up until this central server and the user is active point. This paper explores various data mining issues and (currently typing or moving the mouse how they relate to Instant Messaging and current Counter- on his computer). Terrorism efforts. Specific topics include user pattern anal- Offline The user’s client is not connected to the ysis, anomaly detection, limited message size based textual messaging server at this time. topic detection, and largely generic social network analysis Idle The user’s client is connected to the in this context. Several interesting questions are posed and central server, but the user is not active. the current framework being developed explores some of the Additionally, how long a user has been possible solutions. idle can be determined from their status. Away The user is logged on but away from 1 Introduction the station. Sometime users specify a The medium of Instant Messaging on the Internet is a text message that can be viewed by any- well-established means by which users can quickly and one who wishes to get more information effectively communicate with one another. Long utilized about where they are or why they are by the public as a quick form of free communication, away. (e.g. ”Out to lunch.”, ”Watching data mining tasks have not been attempted over Instant TV.”) In fact a user can be either idle, Messaging. Additionally, on a corporate or government or active, while an away message is ex- level, people are just beginning to take notice of the plicitly up. potential that IM provides in terms of the type of information that can be collected from these networks. Table 1: Possible user statuses. As shown above an IM Many large software or internet based corporations client can be in one of the above statuses at a given have started Instant Messaging networks of their own, time. generally open to the public after registration, including Time Warner, Yahoo, and Microsoft. Currently, some Most Instant Messaging networks follow a strict of the most popular Instant Messaging networks are run Client-Server model in which a server (or a cluster by some of the aforementioned companies: of servers) is maintained by a service provider who controls traffic coming to and from the server. Users • AOL Instant Messenger who wish to utilize a certain network generally register themselves with the service provider, then download • Yahoo! Instant Messenger a provider-approved client for use on their network. Using this client, users can connect to the central • MSN Instant Messenger server in order to be able to send and receive messages • Various IRC Networks and collect account information. A friend is generally another registered user (the term friend is server- Interestingly enough, even with all the various networks specific, but exists on almost all messaging networks). being developed by corporations for profit, their physi- The concept is that a user may maintain a Buddy List cal structures (client-server architecture) and communi- under which a listing of their immediate friends may Figure 2: The Proposed IM Mining Framework Figure 1: Existing Instant Messaging Network data generated in turn is very useful for data mining to analyze user behavior. However, in order to utilize the exist. Using this, the server then sends a client updates flow of information offered by these networks, a data based upon the statuses of their friends. Once the collection framework need will have to be established. connection process has completed, the server performs This paper proposes one such framework which has all future communication in the form of Update Packets. been developed. Information distributed by the Instant An update packet is sent from the server to a client Messaging networks can be broken down into two simple whenever an action occurs that is associated with him. groups: User status-change and communication-flow For example, when a friend performs a status change (Instant Messages, Chat Rooms). or if a message is being sent to a user’s client. An The first item collected, user status change, can unfortunate consequence of the server maintaining such be achieved relatively simply as the current structure buddy lists is that it can impose restrictions upon the of Instant Messaging networks support the collection maximum number of friends which a user is allowed to process. One interesting feature, previously discussed, maintain (this number is generally around 200). Since of Instant Messaging networks is that of ’Buddy Lists’ a client does not directly communicate with any other - lists of friends of a user. The direct benefit of connected client, and only the server, the server is this feature is the fact that whenever a buddy (a then in charge of disseminating any potentially useful member of a user’s buddy list) performs a status change, information from one client to another. Once such the client is immediately notified of it by the server. piece of critical information is a user’s status. Table Utilizing this feature set, one could set up a client 1 describes a list of possible statuses that a client can of their own, with an arbitrary buddy list, and begin be in. Status is an attribute generally associated with a collecting information about their ’buddies’ resulting user’s client and often indicates how a user responds to actions. This is significant due to the fact that most an Instant Message. Whenever a user’s status changes, Instant Messaging networks don’t require that someone an update packet is relayed by the central server to actually be a friend of another user in order to watch everyone who has the user on their buddy list. their status changes. Another important aspect of communication flow Using this standard model, it is relatively simple within an Instant Messaging network is the traffic of to set up a tracking client whose only job is to collect messages between users. The amount of information re- pertinent information about users that are on its buddy vealed concerning instant messages is generally limited list - aptly named, in this framework, Tracking Client. to the information which is directly related to a user. In order to maintain a tracking client a Tracking Server Such information paths include chat rooms (a group is constructed which manages the actions of its associ- discussion area where multiple people can communicate ated tracking clients. The Tracking Server marshalls with one another stimultaneously) and private Instant communication between an arbitrary number of track- Messages (messages sent directly from one user to an- ing clients and the database server. Whenever a new other). Tracking Client spawns and connects to the Tracking Server the server attempts to determine which Instant 2 Data Collection Messaging users need to be tracked, from a list of po- Between the various information resources provided by tential users. Due to the restrictions imposed by the Instant Messaging networks, there are a number of various Instant Messaging networks as to the size of a valuable resources available to the average user. The user’s buddy list this distributed Tracking Client struc- ture is required in order to be able to track the maxi- mer of 2003. 207 participants were tracked and 55061 mum number of people at any given time. An advan- unique data packets were received and stored. Two sep- tage to this distributed network is that no one client is arate tracking clients were used to collect results, both dependant upon for all tracking efforts or network band- of which aggregated their data to a single database for width usage. Each Tracking Client within the network later retrieval. watches a given number of other clients in order to ver- Figure 3 shows the probability that a given user ify that they are, in fact, still connected to the network was in a certain state over the course of 10 weeks. It - if not then a communication is sent to the tracking can be quickly surmised that most users have the ability server and another client is spawned to cover the users to maintain a fairly steady record from week-to-week. not being tracked by its disabled peer. As information Additionally, due to the polar differences that some of packets come in from the server to each tracking client, the users seem to exhibit, it becomes apparent that the client attempts to determine if the packet should be the concept of User Profiles is an important step to re-transmitted to the server for storage in the central determining a user’s common course of action (more database. information can be found in Section 5.1 User Pattern Another tracking effort that is currently being ex- Analysis). plored is that of monitoring inter-user communication. One resource offered by most Instant Messages networks Timestamp User ID Status (and exclusively by others, see IRC) is that of a public 70242 68 Online chat room. A tracking client has the ability to con- 70303 118 Online nect to one of these rooms as a spectator, simply to 70325 65 Offline view the flow of conversation. Similar to how the server 70447 68 Idle performed by sending data packets concerning a user’s 70453 16 Idle status change, the server will also send packets detail- 70725 98 Offline ing messages being publicly sent from one user to an- 70743 89 Away other within this chat room setting.
Recommended publications
  • SILC-A SECURED INTERNET CHAT PROTOCOL Anindita Sinha1, Saugata Sinha2 Asst
    ISSN (Print) : 2320 – 3765 ISSN (Online): 2278 – 8875 International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering Vol. 2, Issue 5, May 2013 SILC-A SECURED INTERNET CHAT PROTOCOL Anindita Sinha1, Saugata Sinha2 Asst. Prof, Dept. of ECE, Siliguri Institute of Technology, Sukna, Siliguri, West Bengal, India 1 Network Engineer, Network Dept, Ericsson Global India Ltd, India2 Abstract:-. The Secure Internet Live Conferencing (SILC) protocol, a new generation chat protocol provides full featured conferencing services, compared to any other chat protocol. Its main interesting point is security which has been described all through the paper. We have studied how encryption and authentication of the messages in the network achieves security. The security has been the primary goal of the SILC protocol and the protocol has been designed from the day one security in mind. In this paper we have studied about different keys which have been used to achieve security in the SILC protocol. The main function of SILC is to achieve SECURITY which is most important in any chat protocol. We also have studied different command for communication in chat protocols. Keywords: SILC protocol, IM, MIME, security I.INTRODUCTION SILC stands for “SECURE INTERNET LIVE CONFERENCING”. SILC is a secure communication platform, looks similar to IRC, first protocol & quickly gained the status of being the most popular chat on the net. The security is important feature in applications & protocols in contemporary network environment. It is not anymore enough to just provide services; they need to be secure services. The SILC protocol is a new generation chat protocol which provides full featured conferencing services; additionally it provides security by encrypting & authenticating the messages in the network.
    [Show full text]
  • 14 Investigating Applications Applications
    CNIT 121: Computer Forensics 14 Investigating Applications Applications • Not part of the operating system • User applications • Internet browsers, email clients, office suites, chat programs, and more • Service applications • Web servers, database servers, email servers, etc. Application Data • Many different formats • Text, binary, open- and closed-source • Sometimes independent of the operating system • Most commonly relevant • Email clients, web browsers, and instant messaging clients Where is Application Data Stored? Windows • Default application installation directory • C:\Program Files • C:\Program Files (x86) • For 32-bit apps on a 64-bit OS • Default application data directory • C:\Program Data • C:\Users\username\AppData Windows • Registry uninstall information • HKLM\SOFTWARE\Microsoft\CurrentVersion \Uninstall • Value: InstallLocation • On 64-bit Windows, check HKLM\SOFTWARE \Wow6432Node\Microsoft\Windows \CurrentVersion\Uninstall Windows • Default registry configuration data locations • HKLM\SOFTWARE • On 64-bit systems, HKLM\SOFTWARE \Wow6432Node OS X • Default application installation directory • /Applications • Application user data directory • /Users/username/Library/Application Support Linux • Locations vary based on distribution and customizations • Two ways to locate application data • Manually inspect the file system • Query the package manager Filesystem Hierarchy Standard (FHS) Package Managers • RPM (RPM Package Manager) • Used by Red Hat Enterprise Linux (RHEL), CentOS, OpenSUSE, and Fedora • This command shows all
    [Show full text]
  • Lawsuit Claims Facebook Is Reading Your Private Messages
    Lawsuit Claims Facebook Is Reading Your Private Messages MSN news Facebook could be reading your private messages to help advertisers figure out what you want to buy. Facebook could be reading your private messages to help advertisers figure out what you want to buy, so you know there are definitely going to be users (and lawyers) who aren't happy about that. The company was hit this week with a class action lawsuit alleging it is scanning private messages for links and passing the information along to advertisers, in violation of California's Electronic Communications Privacy Act . The suit, brought by Matthew Campbell and Michael Hurley on behalf of all Facebook users, claims that the sneaky practice is extremely lucrative: “Representing to users that the content of Facebook messages is “private” creates an especially profitable opportunity for Facebook, because users who believe they are communicating on a service free from surveillance are likely to reveal facts about themselves that they would not reveal had they known the content was being monitored. Thus, Facebook has positioned itself to acquire pieces of the users’ profiles that are likely unavailable to other data aggregators”. The suit alleges that Facebook uses private messages to "mine user data and profit from those data by sharing them with third parties – namely, advertisers, marketers, and other data aggregators." Campbell and Hurley relied on information from an independent security researcher that shows that Facebook clicked on links sent through private messages. According to the court filing: “Contrary to its representations, “private” Facebook messages are systematically intercepted by the Company in an effort to learn the contents of the users’ communications.
    [Show full text]
  • Lesson Assessment TEACHER VERSION
    SAFETY > RISKY ONLINE RELATIONSHIPS Lesson Assessment TEACHER VERSION 1. True or false: Most people who ask teens to talk about sexual things online are older adults. a) True b) False Answer feedback The correct answer is b, False. It’s more typical that teens receive requests to talk about sexual things online from other teens or from young adults than from older adults. 2. Mimi just signed up for online chat. How can she avoid getting into risky online chat situations? Read the following choices and then select the best answer. a) Avoid chat during evening and night b) Tell others she meets online that she is older than she actually is c) Avoid flirting with people online d) Tell others she meets online that she is already in a relationship Answer feedback The correct answer is c. People who flirt online, especially with strangers, are more likely to deal with unwanted or risky contact. 3. Dana is chatting with someone she only knows online. She starts to feel uncomfortable about what they’re talking about. Which actions could Dana take to make the situation better? Circle all that apply. a) Stop talking to the person altogether b) Ask a friend to talk to the person and pretend to be her c) Don’t do anything d) Log off of chat for a while e) Keep talking with the person anyway f) Change the subject g) Block the user name Answer feedback The correct answers are a, d, f, and g. If you feel uncomfortable while chatting online, you can try to take the conversation in a new direction.
    [Show full text]
  • Ways to Stay in Touch 5 Ways of Well-Being Game
    Ways to Stay in Touch 5 Ways of Well-being Game There are many apps and online ways to say connected with friends and family while being safe and practicing physical distancing. Face time Have an iPhone, iPad or Mac computer? You can use Apple’s built-in FaceTime app to video chat with any of your contacts Google Duo/hang out You can use the Google Duo app on your Android phone, iOS smartphone, tablet or computer to video chat with up to eight of your friends at a time. It uses your phone number and accesses your phone’s contacts list to let you start creating groups. Google Hangouts is great for your computer. You can group chat and video chat. Marco Polo This easy-to-use app combines the best of texting, social media and video chats on one platform. You can’t use this in real time. You can send video messages to individuals or groups that they can watch and respond to at their leisure. Skype Skype messaging, as well as video and audio calling, was created for both one-on-one and group conversations, and you can use it on your mobile phone; PCs and Macs; Xboxes and Alexa. It’s free to send messages, and to conduct audio and video calls with groups of up to 50 people (paying a nominal fee also enables you to call mobile and landline phones or send online text messages). Facebook Messager & WhatsApp You can send instant messages, photos or videos, stickers, GIF files, emojis and more.
    [Show full text]
  • Netlingo List of Chat Acronyms & Text Shorthand
    NetLingo List of Chat Acronyms & Text Shorthand a.k.a. Internet acronyms, text message jargon, abbreviations, initialisms, cyberslang, leetspeak, SMS code, textese With hundreds of millions of people texting regularly, it's no wonder you've seen this cryptic looking code! Commonly used wherever people get online -- including IMing, SMSing, cell phones, Blackberries, PDAs, Web sites, games, newsgroup postings, in chat rooms, on blogs, or on social media -- these abbreviations are used by people around the world to communicate with each other. NetLingo is also tracking a global list of worldwide text terms and international online jargon! • !I have a comment • *$Starbucks • **//it means wink wink, nudge nudge • ,!!!!Talk to the hand • 02Your (or my) two cents worth, also seen as m.02 • 10QThank you • 1174Nude club • 121One to one • 1337Elite -or- leet -or- L337 • 14it refers to the fourteen words • 143I love you • 1432I Love You Too • 14AA41One for All and All for One • 182I hate you • 187it means murder/ homicide • 190 hand • 1dafulit means wonderful • 2it means to, too, two • 20Location • 24/7Twenty Four Seven, as in all the time • 2bTo be • 2B or not 2BTo Be Or Not To Be • 2b@To Be At • 2BZ4UQTToo Busy For You Cutey • 2B~not2BTo be or not to be • 2d4To die for • 2dayToday • 2DLooToodle oo • 2G2B4GToo Good To Be Forgotten • 2G2BTToo Good To Be True • 2moroTomorrow • 2niteTonight • 2QTToo Cute • 2U2To You Too • 303Mom • 4For, Four • 404I haven't a clue • 411Information • 420Marijuana • 459I love you • 4COLFor Crying Out Loud • 4eForever
    [Show full text]
  • Instant Messaging Software Offers a Tantalizing Proposition: Combine the Speed of Real-Time Conversation with the Functionality of E-Mail
    Members Name ID # Andrew Hiew Antoni Kowalczewski X Introduction Instant messaging software offers a tantalizing proposition: Combine the speed of real-time conversation with the functionality of e-mail. Send quick notes to friends and colleagues; attach files, URLs, and even invitations to join group chat without creating large e-mail databases. The promise: Communicate, and collaborate, faster, better, and cheaper than ever before. Instant messaging has undergone some of the most explosive growth of any method of communication in history, and according to a recent Ziff-Davis article (1), “Instant messaging, by some accounts, is growing faster than any communications medium in history. AOL says its three instant-messaging services have nearly 80 million users, and crossed the 50 million mark in less than 2 1/2 years -- compared with five years for the Internet and 13 years for television.” It is plainly obvious that this technology is here to stay – it’s just a matter of time before it becomes as common as a telephone or even email. What makes instant messaging so appealing? X What is it? Presence & Instant Messaging software (or IM for short) allow people to communicate in real time over the internet, at speeds an order of magnitude faster than the next-fastest mode of communication, email. IM software allows users to create and maintain lists of contacts, friends, and acquaintances, and allows them to know exactly when they are using the Internet (or at the very least, when their friends have the IM application running). A quick glance at one’s IM client tells you exactly which of your friends are online, and what their status is – for example, in ICQ, users can be in one of many different modes – for example: Away, Available, Extended Away, Available To Chat, Busy, Do Not Disturb, etc.
    [Show full text]
  • Thinkuknow Parents Guide to Snapchat
    Thinkuknow Parents and Carers guide to Snapchat Snapchat Snapchat is a picture sharing app that has gained considerable popularity with young people. Thinkuknow have created a guide for parents and carers to help them understand the functions and features of Snapchat and ways to help young people stay safe if they are using the app. What is it? Snapchat is a mobile application that allows users to create, edit and share pictures and videos . Images sent on Snapchat are often referred to as ‘snaps’. Snapchat’s tag line is , “life’s more fun when you live in the moment”. Thus, Snapchat focuses on capturing ‘moments’ quickly; users take images and videos and send them instantly. The unique feature of Snapchat is that images disappear with seconds. Snapchat users can specify how many seconds they would like their images to be seen by the recipient(s). This can range from 1 to 10 seconds. After the chosen time, the image will ‘disappear‘ from the recipient’s view. However, this does not necessarily mean that images are permanently deleted. Images can be screen shot by other users or recorded using another device. Snapchat alerts users when a screenshot has been taken, however this is not always 100% accurate. Snapchat has a range of filters and fun effects that can be applied to images and videos. Snapchat users can use ‘geofilters’ – these are overlays that include graphics and picture effects that show where a person is in the world. This uses the phone’s geo-location data. Images and videos can be sent directly to other users via Snapchat.
    [Show full text]
  • PARENT SURVIVAL GUIDE: Social Media Tips & Tricks to Monitor Your Child's Online Activity
    PARENT SURVIVAL GUIDE: Social Media Tips & tricks to monitor your child's online activity M O N I T O R I N G C O N S I D E R A T I O N S F O R P A R E N T S Know who your child is connecting with online and what apps/programs they are accessing Talk about appropriate online behavior expectations and the consequences of inappropriate behavior Request passwords to review accounts on a random basis Hold youth accountable for their behavior online Review your child's posts and the posts of their friends Stay up to date on the latest social media & internet trends; they are quickly outdated Take time to learn the social media apps your kids are using Ask your child about their activity online and have them teach you about what they are doing M O N I T O R I N G A P P S Net Norton Nanny Family Kaspersky Qustodio OurPact FamilyTime Premier Safe Kids Parents can call their phone carrier to ask about available monitoring apps. This is not an exhaustive list of monitoring apps; parents are encouraged to do their own review based on the needs of their families. R E S O U R C E S F O R P A R E N T S To stay informed about social media & internet activity https://smartsocial.com https://www.fosi.org https://www.connectsafely.org/ https://savvycyberkids.org/ Federal Trade Commission - OnGuard Online Department of Human Services - Stop.Think.Connect Internet Matters - Social Media Guides V A U L T A P P S Apps that are used to hide other apps that may be restricted by parents.
    [Show full text]
  • Best Free Talk and Text App for Android
    Best Free Talk And Text App For Android Is Paulo nominal or unearned when curtsies some danders elopes trancedly? Is Barney indeclinable or morbidlyfriskiest after and reversedeventuating Giavani his ignobleness loges so stellately? narrow-mindedly Multijugate and and issuably. longhand Westley overbuild Turn off this free internet connection can often seen an option you agree that they also be in one of space on your number apps enable users. Spice up to call app supports regular landline. Get started with your notes has a notification directly if you through facebook messenger alternatives, tell is perfect for businesses for. Of several reasons why do i am not on their app and video calling minutes are your computer or annoying if you to make calls using multiple free? You can talk to app free talk and best text for android ecosystem in maintaining order yours from asia for anyone in the best chat. Relying on any personal. Combined with a specific consents were under one person you can also includes fewer features does just call. Google voice chat with visual voicemail transcription software which would definitely not. Zello worked with the likes of communication? Start to send emoji and business systems to make calls and telemarketers call a field can we pay! And exclusive deals, you can have conversations are the. Available for several years ago, but live your app free and best talk and clients without internet connection, this app to new technology in disaster such as well. Another audio communication technology site is here do it may disclose your dead skin makes your content of free talk about secret chat with your.
    [Show full text]
  • Workspace Desktop Edition User's Guide
    Workspace Desktop Edition User's Guide Handle A Facebook Interaction 10/2/2021 Handle A Facebook Interaction Handle A Facebook Interaction In this lesson, you will learn how to handle inbound Facebook interactions, how to check the spelling of your messages before you send, and other Facebook-specific functionality. This lesson contains the following sections: • Receiving and Handling an Inbound Facebook Interaction • Facebook-Specific Functionality • Workspace Functionality About Facebook Facebook is a social networking service and website. Users must register before using the site, after which they may create a personal profile, add other users as friends, and exchange messages, including automatic notifications, when they update their profile. Businesses can use Facebook to reach their target customers, and choose their audience by location, age and interests. Businesses can also promote their Facebook page or website, use Facebook's "Like" button to increase their advertising influence, and build a community around their business. A plug-in for Workspace enables it to handle Facebook social media interactions. This lesson contains information about how to handle Facebook interactions. Receiving and Handling an Inbound Facebook Interaction If you are the selected internal target of an inbound Facebook interaction, you receive a notice on your workstation desktop. You can decide to accept or reject the interaction. If you accept it, the Facebook Interaction window is displayed. The Facebook Interaction window contains information about the interaction and the controls that you need to complete the interaction. Lesson: Handling an inbound Facebook interaction Purpose: To handle an inbound Facebook interaction properly. Prerequisites • You are logged in to Workspace (see Lesson: Logging in to Workspace).
    [Show full text]
  • WEAPONS of MASS DISTRACTION: Foreign State-Sponsored Disinformation in the Digital Age
    WEAPONS OF MASS DISTRACTION: Foreign State-Sponsored Disinformation in the Digital Age MARCH 2019 PARK ADVISORS | Weapons of Mass Distraction: Foreign State-Sponsored Disinformation in the Digital Age Authored by Christina Nemr and William Gangware Acknowledgements The authors are grateful to the following subject matter experts who provided input on early drafts of select excerpts: Dr. Drew Conway, Dr. Arie Kruglanski, Sean Murphy, Dr. Alina Polyakova, and Katerina Sedova. The authors also appreciate the contributions to this paper by Andrew Rothgaber and Brendan O’Donoghue of Park Advisors, as well as the editorial assistance provided by Rhonda Shore and Ryan Jacobs. This report was produced with support from the US Department of State’s Global Engagement Center. Any views expressed in this report are those of the authors and do not necessarily reflect the views of the US State Department, Park Advisors, or its subject matter expert consultants. Any errors contained in this report are the authors’ alone. PARK ADVISORS | Weapons of Mass Distraction: Foreign State-Sponsored Disinformation in the Digital Age 0. Table of Contents 01 Introduction and contextual analysis 04 How do we define disinformation? 06 What psychological factors drive vulnerabilities to disinformation and propaganda? 14 A look at foreign state-sponsored disinformation and propaganda 26 Platform-specific challenges and efforts to counter disinformation 39 Knowledge gaps and future technology challenges PARK ADVISORS | Weapons of Mass Distraction: Foreign State-Sponsored Disinformation in the Digital Age 1 Introduction and 1. contextual analysis On July 12, 2014, viewers of Russia’s main state-run television station, Channel One, were shown a horrific story.
    [Show full text]