Nodexl Pro Tutorial: Social Network and Content Analysis with Twitter Network Data – Step by Step
Total Page:16
File Type:pdf, Size:1020Kb
NodeXL Pro Tutorial: Social network and content analysis with Twitter network data – step by step Last updated: Februrary 12th, 2019 About this Tutorial About this Tutorial 1. Getting Started 2. Data import 3. Prepare data This tutorial shows you how you can 4. Group by cluster run a full social network and 5. Calculate metrics 6. Time series analysis content analysis with NodeXL Pro. 7. Text/sentiment analysis While we will use Twitter network 8. Network Top Items 9. Autofill columns data as an example, this approach 10. Customize graph can be applied to any network 11. Save the network map dataset of your choice (depending 12. Automation on the available metadata). Literature/Links 2 If you have any questions, please send us an email: [email protected] More NodeXL Pro Tutorials can be found here: https://www.smrfoundation.org/nodexl/tutorials 1. Getting Started About this Tutorial 1. Before downloading network data, save the file to your machine. It 1. Getting Started is helpful to add the data source and date to the name file e.g.: 2. Data import 3. Prepare data Social Network Analysis Twitter 2019-01-25.xlsx 4. Group by cluster 2. Open the Import Data Options window to select basic options 5. Calculate metrics related to the data import: Data > Import > Import Options 6. Time series analysis 7. Text/sentiment analysis 3. Select the options shown below and click OK. 8. Network Top Items 9. Autofill columns 10. Customize graph 11. Save the network map 12. Automation Literature/Links 3. 3 2. 2. Data Import 2. About this Tutorial 3. 1. Getting Started 1. Open the Twitter Search Network importer: 2. Data import Data > Import > From Twitter Search Network… 3. Prepare data 2. Enter a search query of your choice. 4. Group by cluster 4. 5. Calculate metrics 3. Select Basic network. 6. Time series analysis 7. Text/sentiment analysis 4. Limit # of tweets to 1.000. 8. Network Top Items 5. Click OK. 5. 9. Autofill columns 10. Customize graph 6. Wait for approx. 3-5 minutes. 11. Save the network map 12. Automation 7. Save the file after the data download is finished. The Edges and Vertices spreadsheets now contains Literature/Links network data and additional metadata. Take a look at the worksheets and explore the data. 4 2. Data Import You can build advanced search queries with Twitter Standard Search Operators: About this Tutorial https://developer.twitter.com/en/docs/tweets/rules-and-filtering/overview/standard-operators 1. Getting Started 2. Data import 3. Prepare data Standard search: Tesla 4. Group by cluster 5. Calculate metrics Exact phrase: “Tesla Autopilot“ 6. Time series analysis 7. Text/sentiment analysis Exclude term: Tesla –Nikola 8. Network Top Items Boolean query: electric (car OR vehicle) 9. Autofill columns 10. Customize graph User search: @Tesla 11. Save the network map 12. Automation Search by language: Tesla lang:en (fr/de/nl/…) Search by date: Tesla from:2019-02-05 Literature/Links Tesla until:2018-02-12 (!) Test your query before downloading data: https://twitter.com/search-home 5 2. Data Import When you click on Show Graph/Refresh Graph at the top of the graph window, you can already About this Tutorial observe the connected structure of the downloaded raw data. 1. Getting Started 2. Data import 3. Prepare data 4. Group by cluster 5. Calculate metrics 6. Time series analysis 7. Text/sentiment analysis 8. Network Top Items 9. Autofill columns 10. Customize graph 11. Save the network map 12. Automation Literature/Links 6 3. Prepare Data 1. Click on Data > Prepare Data > Count and Merge Duplicate Edges. About this Tutorial 2. Check the box Count duplicate edges and insert the counts into an Edge Weight column and click OK. 1. Getting Started 3. Navigate to column BA Edge Weight that has just been created in the Edges worksheet. This column will 2. Data import be used later to visulize the strength of connections between the vertices. 3. Prepare data 4. Group by cluster 5. Calculate metrics 6. Time series analysis 7. Text/sentiment analysis 8. Network Top Items 1. 9. Autofill columns 10. Customize graph 11. Save the network map 12. Automation Literature/Links 2. 3. 7 4. Group by Cluster 1. 1. Open the Group by cluster window: Analysis > Groups > About this Tutorial Group by Cluster… 2. Select Clauset-Newman-Moore. 1. Getting Started 2. Data import 3. Select Put all neighborless vertices into one group. 3. Prepare data 4. Group by cluster 4. Click OK and wait. 5. Calculate metrics 5. Take a look the Group Vertices and Group Edges 6. Time series analysis worksheets which have just been created. Further, group 7. Text/sentiment analysis related columns have been added to the Edges and Vertices 8. Network Top Items worksheets. 9. Autofill columns 10. Customize graph 6. Click Refresh Graph. Vertex colors and shapes have been 11. Save the network map added automatically. 12. Automation Literature/Links 2. 3. 4. 8 Learn more about the Clauset-Newman-Moore clustering algorithm: A. Clauset, M. E. J. Newman, and C. Moore (2004): Finding community structure in very large networks. In: Phys. Rev. E 70. 5. Calculate Metrics 1. 1. Open the Graph Metrics window: Analysis > Graph Metrics > Graph Metrics About this Tutorial 2. Check the boxes as seen on the right. 1. Getting Started 2. 2. Data import 3. Click Calculate Metrics and wait. 3. Prepare data 4. Have a look at the newly created worksheet Overall 4. Group by cluster Metrics to analyze the composition of the network. 5. Calculate metrics Also take a look at the Vertices and Groups 6. Time series analysis worksheets where the metrics have been added. 7. Text/sentiment analysis 8. Network Top Items 9. Autofill columns 10. Customize graph 11. Save the network map 12. Automation Literature/Links 3. 9 4. 1. 6. Time Series Analysis 1. Open the Graph Metrics window: Analysis > Graph Metrics > Graph Metrics 3. About this Tutorial 2. Deselect everything but Time Series. 2. 1. Getting Started 2. Data import 3. Click on Options to open the Time Series window. 3. Prepare data 4. Select the options as seen on the right. 4. Group by cluster 5. Calculate metrics 5. Click OK and then Calculate Metrics to create the new 6. Time series analysis spreadsheet Time Series. 4. 7. Text/sentiment analysis 8. Network Top Items 9. Autofill columns 10. Customize graph 11. Save the network map 12. Automation Literature/Links 5. 10 1. 7. Text / Sentiment Analysis NodeXL Pro is shipped with a skip word list as well as a 3. positive and a negative sentiment words list in English About this Tutorial language. These lists can be modified to your needs. 2. 1. Getting Started This feature counts the frequency of words and word 2. Data import pairs in a text column. At the same time any word is 3. Prepare data checked for occurence in one of the sentiment lists. 4. Group by cluster 5. Calculate metrics 1. Open the Graph Metrics window: Analysis > Graph 6. Time series analysis Metrics > Graph Metrics 4. 7. Text/sentiment analysis 2. Deselect everything but Words and Word Pairs. 8. Network Top Items 9. Autofill columns 3. Click on Options to open the Word and Word Pair 10. Customize graph Metrics window. 11. Save the network map 12. Automation 4. Choose column Tweet. 5. Optional: Enter your own list of keywords into List 3. Literature/Links 6. Click OK and then Calculate Metrics. 5. 7. Take a look at the two new spreadsheets Words and Word Pairs that have just been created. Further, sentiment related columns have been added to the 6. 11 Edges and Vertices spreadsheets. 1. 8. Network Top Items This feature summarizes the top contents of the network data by collecting the most frequently occuring URLs, domains, hashtags, 3. About this Tutorial words and word pairs from the Edges worksheet. 1. Getting Started 1. Open the Graph Metrics window: Analysis > Graph Metrics > 2. Data import Graph Metrics 3. Prepare data 4. Group by cluster 2. Deselect everything but Network Top Items. 2. 5. Calculate metrics 3. Click on Options to open the Network Top Items window. 6. Time series analysis 7. Text/sentiment analysis 4. Select Tweet to collect top words and word pairs from the 8. Network Top Items Edges worksheet). 9. Autofill columns 10. Customize graph 5. Click on Add… to open the window below. 5. 11. Save the network map 6. Select the column URLs in Tweet, choose the number of items 12. Automation to get and set the column delimiter to Space. Click OK. Literature/Links 4. 6. 12 8. Network Top Items 7. Repeat step six from the previous page for columns Domains in Tweet and Hashtags in Tweet. About this Tutorial 8. When the Network Top Items window looks like the one on the right, 1. Getting Started click OK and then Calculate Metrics. This step may take a few 2. Data import minutes. 3. Prepare data 4. Group by cluster 9. Have a look at the new worksheet Network Top items. 5. Calculate metrics 6. Time series analysis 7. Text/sentiment analysis 8. 8. Network Top Items 9. Autofill columns 10. Customize graph 11. Save the network map 12. Automation Literature/Links 13 9. 9. Autofill Columns About this Tutorial 1. Getting Started 2. Data import Based on the previous calculations, we will now start 3. Prepare data with the visualization. The Autofill Columns feature 4. Group by cluster helps to fill a number of columns with a few clicks. 5. Calculate metrics 6.