{"id":837,"date":"2015-07-16T10:00:06","date_gmt":"2015-07-16T10:00:06","guid":{"rendered":"http:\/\/localhost\/?p=837"},"modified":"2018-04-30T16:15:57","modified_gmt":"2018-04-30T15:15:57","slug":"displaying-data-analysing-text","status":"publish","type":"post","link":"https:\/\/visualagency.com\/?p=837","title":{"rendered":"Displaying Data: Analysing Text"},"content":{"rendered":"<p>Language is our primary form of communication, as the words we write or speak are used as tools to transfer meaning. \u00a0But what meaning can data visualisation give about words? \u00a0Can data visualisation reveal patterns and insights into large bodies of text or is it used simply as a medium to produce art?<\/p>\n<p>In this post for our ongoing series on Displaying Data, I&#8217;ll be looking at ways bodies of text have been visualised.<\/p>\n<h2>Word Clouds<\/h2>\n<p>Also known as a &#8220;Tag Cloud&#8221;, this form of visualising text displays how frequently words appear in a body of text, by displaying a cluster of words in which\u00a0they are all\u00a0sized in proportion to their frequency. \u00a0You can see an example of this below, where I&#8217;ve used the text Plato&#8217;s Republic as an example:<\/p>\n<p><img decoding=\"async\" class=\"aligncenter\" src=\"https:\/\/visu.al\/wp-content\/uploads\/2015\/07\/wordcloud-01.png\" alt=\"analysing text\" \/><\/p>\n<p>You can see words like &#8220;Yes&#8221;, &#8220;one&#8221; and &#8220;good&#8221; stand out the most, which shows that these words have been used and repeated the most in the book.<\/p>\n<p>While this is the typical format for Word Clouds, they&#8217;re not limited to clusters\/clouds. \u00a0The words can also be arranged in layouts other than a cloud cluster: on horizontal lines, columns or within a shape. Also, the size of the words can be in proportion to another variable assigned to them, not\u00a0just by their frequency. \u00a0A good example of this would be to display a Word Cloud with all the World&#8217;s countries and have the size of the country&#8217;s name in proportion to its population size or GDP.<\/p>\n<p>The downside to Word Clouds are that longer words and words that contain many ascenders and descenders are given more emphasis. \u00a0Also World Clouds are not great for any analytical accuracy and are therefore more for aesthetic use.<\/p>\n<p>Word Clouds are not so popular anymore, as they have a bad reputation of being tacky and cheesy. However, below are some great examples of where Word Clouds have been combined with other methods of displaying data visualisation to produce some interesting results.<\/p>\n<h2 class=\"p1\">US presidential inauguration speeches: how does Obama&#8217;s second compare?<\/h2>\n<p>Working for The Guardian, Santiago Ortiz created\u00a0a visualisation that\u00a0uses all the words said in every US Presidential inauguration speech since Richard Nixon in 1969.<\/p>\n<figure style=\"width: 781px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/www.theguardian.com\/news\/datablog\/interactive\/2013\/jan\/21\/presidential-inauguration-speeches-obama-compared\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/visu.al\/wp-content\/uploads\/2015\/07\/analysing-text.png\" alt=\"analysing text\" width=\"791\" height=\"685\" \/><\/a><figcaption class=\"wp-caption-text\">Source: <a href=\"https:\/\/www.theguardian.com\/news\/datablog\/interactive\/2013\/jan\/21\/presidential-inauguration-speeches-obama-compared\">The Guardian<\/a><\/figcaption><\/figure>\n<p>The colours used are assigned to each\u00a0US President, which you can see in the legend at the bottom. Ortiz has\u00a0combined both a Word Cloud and a 100% Bar Graph to produce this visualisation: the size\u00a0of each word in all speeches are displayed in proportion to their frequency, while a 100% Bar Graph is displayed behind each word and is segmented based on how much each President has said it.<\/p>\n<p>Filters are also in place\u00a0to narrow down results. \u00a0Hovering your mouse over a word will only show the other words connected to it and the frequency it&#8217;s been mentioned by each President is displayed in the legend at the bottom as a mini bar graph. \u00a0If you hover the mouse over each President in the legend, then only their Word Clouds are displayed.<\/p>\n<h2>The Republican Nation Convention<\/h2>\n<p>Another political visualisation here from The New York Times, which has visualised how frequently speakers at the Republican Nation Convention have used specific phrases and words. \u00a0The data are sourced from the Federal News Service and is continuously updated.<\/p>\n<figure style=\"width: 744px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/www.nytimes.com\/interactive\/2012\/08\/28\/us\/politics\/convention-word-counts.html#!\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/visu.al\/wp-content\/uploads\/2015\/07\/Screen-Shot-2015-07-08-at-11.09.47.png\" alt=\"analysing text\" width=\"754\" height=\"484\" \/><\/a><figcaption class=\"wp-caption-text\">Source: <a href=\"https:\/\/www.nytimes.com\/interactive\/2012\/08\/28\/us\/politics\/convention-word-counts.html#!\">The New York Times<\/a><\/figcaption><\/figure>\n<p>This visualisation has combined\u00a0a Word Cloud with a proportional area chart (displaying shapes in proportion to the data amount) and has displayed the number of mentions, allowing to accurate referencing. \u00a0Clicking on a &#8220;word bubble&#8221; will highlight every mention of that word in the transcripts below.<\/p>\n<h2>Word Trees<\/h2>\n<p>In this visualisation method, a tree of phrases is depicts the parallel sequencing of words in a body of text. \u00a0Like a Word Cloud, the size of the words displayed is proportional to their usage. \u00a0Word Trees are useful for showing which words most follow or precede a target word or to show a hierarchy of the terms.<\/p>\n<figure style=\"width: 890px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/hint.fm\/papers\/wordtree_final2.pdf\" target=\"_blank\" rel=\"noopener nofollow\" class=\"broken_link\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/visu.al\/wp-content\/uploads\/2015\/07\/word-tree.png\" alt=\"analysing text\" width=\"900\" height=\"519\" \/><\/a><figcaption class=\"wp-caption-text\">Source: <a href=\"https:\/\/hint.fm\/papers\/wordtree_final2.pdf\" class=\"broken_link\" rel=\"nofollow\">The Wordtree<\/a><\/figcaption><\/figure>\n<h2 class=\"p1\">Understanding Shakespeare<\/h2>\n<p>In this B.A. thesis project, Stephan Thiel introduced a new way of reading drama to help people understand Shakespeare&#8217;s work. \u00a0Thiel produced an interesting variety of new ways in which we consume written narrative work and knowledge through the use of code and data visualisation.<\/p>\n<figure style=\"width: 890px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/www.understanding-shakespeare.com\/index.html\" target=\"_blank\" rel=\"noopener nofollow\" class=\"broken_link\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/visu.al\/wp-content\/uploads\/2015\/07\/understanding-shakespeare.png\" alt=\"analysing text\" width=\"900\" height=\"542\" \/><\/a><figcaption class=\"wp-caption-text\">Source: <a href=\"https:\/\/www.understanding-shakespeare.com\/index.html\" class=\"broken_link\" rel=\"nofollow\">Understanding Shakespeare<\/a><\/figcaption><\/figure>\n<p>In the above example, the major character&#8217;s speeches have been highlighted yellow to illustrate the amount of spoke words they&#8217;ve used, compared to the rest of the play. Also the size of each word is displayed in proportion to its frequency.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Language is our primary form of communication, as the words we write or speak are used as tools to transfer meaning. \u00a0But what meaning can data visualisation give about words? \u00a0Can data visualisation reveal patterns and insights into large bodies of text or is it used simply as a medium to produce art? In this &#8230; <a title=\"Displaying Data: Analysing Text\" class=\"read-more\" href=\"https:\/\/visualagency.com\/?p=837\">Read more<\/a><\/p>\n","protected":false},"author":5,"featured_media":4132,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_mi_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"generate_page_header":"","footnotes":""},"categories":[14],"tags":[81,8,75,91,92,93],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v17.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Displaying Data: Analysing Text - VISU.AL AGENCY<\/title>\n<meta name=\"description\" content=\"In this post in our ongoing series of displaying data we&#039;ll be looking at analysing text with data visualisation.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/visualagency.com\/?p=837\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Displaying Data: Analysing Text - VISU.AL AGENCY\" \/>\n<meta property=\"og:description\" content=\"In this post in our ongoing series of displaying data we&#039;ll be looking at analysing text with data visualisation.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/visualagency.com\/?p=837\" \/>\n<meta property=\"og:site_name\" content=\"VISU.AL AGENCY\" \/>\n<meta property=\"article:published_time\" content=\"2015-07-16T10:00:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2018-04-30T15:15:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/visualagency.com\/wp-content\/uploads\/2015\/07\/analysing_text_2.png\" \/>\n\t<meta property=\"og:image:width\" content=\"640\" \/>\n\t<meta property=\"og:image:height\" content=\"279\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Severino Ribecca\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebSite\",\"@id\":\"https:\/\/visualagency.com\/#website\",\"url\":\"https:\/\/visualagency.com\/\",\"name\":\"VISU.AL AGENCY\",\"description\":\"A Digital Growth Agency for the Now Generation\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/visualagency.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-GB\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/visualagency.com\/?p=837#primaryimage\",\"inLanguage\":\"en-GB\",\"url\":\"https:\/\/visualagency.com\/wp-content\/uploads\/2015\/07\/analysing_text_2.png\",\"contentUrl\":\"https:\/\/visualagency.com\/wp-content\/uploads\/2015\/07\/analysing_text_2.png\",\"width\":640,\"height\":279},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/visualagency.com\/?p=837#webpage\",\"url\":\"https:\/\/visualagency.com\/?p=837\",\"name\":\"Displaying Data: Analysing Text - VISU.AL AGENCY\",\"isPartOf\":{\"@id\":\"https:\/\/visualagency.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/visualagency.com\/?p=837#primaryimage\"},\"datePublished\":\"2015-07-16T10:00:06+00:00\",\"dateModified\":\"2018-04-30T15:15:57+00:00\",\"author\":{\"@id\":\"https:\/\/visualagency.com\/#\/schema\/person\/81c7d9e4611a17421e15c988ad30d06d\"},\"description\":\"In this post in our ongoing series of displaying data we'll be looking at analysing text with data visualisation.\",\"breadcrumb\":{\"@id\":\"https:\/\/visualagency.com\/?p=837#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/visualagency.com\/?p=837\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/visualagency.com\/?p=837#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/visualagency.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Displaying Data: Analysing Text\"}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/visualagency.com\/#\/schema\/person\/81c7d9e4611a17421e15c988ad30d06d\",\"name\":\"Severino Ribecca\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/visualagency.com\/#personlogo\",\"inLanguage\":\"en-GB\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/887d6cd0ade62b37fdf11c0e40d40db8?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/887d6cd0ade62b37fdf11c0e40d40db8?s=96&d=mm&r=g\",\"caption\":\"Severino Ribecca\"},\"url\":\"https:\/\/visualagency.com?author_name\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Displaying Data: Analysing Text - VISU.AL AGENCY","description":"In this post in our ongoing series of displaying data we'll be looking at analysing text with data visualisation.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/visualagency.com\/?p=837","og_locale":"en_GB","og_type":"article","og_title":"Displaying Data: Analysing Text - VISU.AL AGENCY","og_description":"In this post in our ongoing series of displaying data we'll be looking at analysing text with data visualisation.","og_url":"https:\/\/visualagency.com\/?p=837","og_site_name":"VISU.AL AGENCY","article_published_time":"2015-07-16T10:00:06+00:00","article_modified_time":"2018-04-30T15:15:57+00:00","og_image":[{"width":640,"height":279,"url":"https:\/\/visualagency.com\/wp-content\/uploads\/2015\/07\/analysing_text_2.png","type":"image\/png"}],"twitter_card":"summary_large_image","twitter_misc":{"Written by":"Severino Ribecca","Estimated reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebSite","@id":"https:\/\/visualagency.com\/#website","url":"https:\/\/visualagency.com\/","name":"VISU.AL AGENCY","description":"A Digital Growth Agency for the Now Generation","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/visualagency.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-GB"},{"@type":"ImageObject","@id":"https:\/\/visualagency.com\/?p=837#primaryimage","inLanguage":"en-GB","url":"https:\/\/visualagency.com\/wp-content\/uploads\/2015\/07\/analysing_text_2.png","contentUrl":"https:\/\/visualagency.com\/wp-content\/uploads\/2015\/07\/analysing_text_2.png","width":640,"height":279},{"@type":"WebPage","@id":"https:\/\/visualagency.com\/?p=837#webpage","url":"https:\/\/visualagency.com\/?p=837","name":"Displaying Data: Analysing Text - VISU.AL AGENCY","isPartOf":{"@id":"https:\/\/visualagency.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/visualagency.com\/?p=837#primaryimage"},"datePublished":"2015-07-16T10:00:06+00:00","dateModified":"2018-04-30T15:15:57+00:00","author":{"@id":"https:\/\/visualagency.com\/#\/schema\/person\/81c7d9e4611a17421e15c988ad30d06d"},"description":"In this post in our ongoing series of displaying data we'll be looking at analysing text with data visualisation.","breadcrumb":{"@id":"https:\/\/visualagency.com\/?p=837#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/visualagency.com\/?p=837"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/visualagency.com\/?p=837#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/visualagency.com\/"},{"@type":"ListItem","position":2,"name":"Displaying Data: Analysing Text"}]},{"@type":"Person","@id":"https:\/\/visualagency.com\/#\/schema\/person\/81c7d9e4611a17421e15c988ad30d06d","name":"Severino Ribecca","image":{"@type":"ImageObject","@id":"https:\/\/visualagency.com\/#personlogo","inLanguage":"en-GB","url":"https:\/\/secure.gravatar.com\/avatar\/887d6cd0ade62b37fdf11c0e40d40db8?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/887d6cd0ade62b37fdf11c0e40d40db8?s=96&d=mm&r=g","caption":"Severino Ribecca"},"url":"https:\/\/visualagency.com?author_name"}]}},"_links":{"self":[{"href":"https:\/\/visualagency.com\/index.php?rest_route=\/wp\/v2\/posts\/837"}],"collection":[{"href":"https:\/\/visualagency.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/visualagency.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/visualagency.com\/index.php?rest_route=\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/visualagency.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=837"}],"version-history":[{"count":0,"href":"https:\/\/visualagency.com\/index.php?rest_route=\/wp\/v2\/posts\/837\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/visualagency.com\/index.php?rest_route=\/wp\/v2\/media\/4132"}],"wp:attachment":[{"href":"https:\/\/visualagency.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=837"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/visualagency.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=837"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/visualagency.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=837"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}