visualisation – QUT Social Media Research Group https://socialmedia.qut.edu.au Wed, 03 May 2017 07:15:11 +0000 en-US hourly 1 https://wordpress.org/?v=6.7.2 Twitter in Australia: How We’ve Grown and What We Talk About https://socialmedia.qut.edu.au/2017/05/03/twitter-in-australia-how-weve-grown-and-what-we-talk-about/ https://socialmedia.qut.edu.au/2017/05/03/twitter-in-australia-how-weve-grown-and-what-we-talk-about/#respond Wed, 03 May 2017 07:15:11 +0000 http://socialmedia.qut.edu.au/?p=1059 There are plenty of assumptions and not a great deal of reliable data about how we use social media. Twitter, for example, is variously accused of being a haven for leftist outrage and a cesspool of alt-right fascists; it is seen as a crucial tool for crisis communication and a place where millennials share photos of their lunch. Surely, these can’t all be true at the same time.

Part of the problem here is that we all design our own filter bubbles, as the journalism researcher Paul Bradshaw has put it: what two random users see of Twitter might be entirely different, depending on what other accounts they choose to follow. If all you ever see is food porn, perhaps you’d care to make some new connections. Or perhaps that’s what you’re there for.

But if we could look beyond our own, personal networks, what would we see? What are the major drivers of Twitter take-up, in Australia and elsewhere? Do we connect around shared interests, shared location, or pre-existing offline relationships? And when, in the eleven-year history of the platform, did these structures form?

These are the questions that guided a new, long-term study of the Australian national Twittersphere that my colleagues and I have undertaken. Drawing on TrISMA, a major multi-institutional facility for social media analytics, we identified some 3.7 million Australian Twitter accounts in existence by early 2016, and captured the 167 million follower/followee connections they have amongst each other.

New Accounts per Day

Twitter took off in Australia in 2009, some three years after its launch, and saw a fairly steady sign-up rate of 1,000-2,000 new accounts between 2010 and 2014. Growth has slowed since then, and this may indicate market saturation. There are a number of obvious spikes in new account sign-ups, too: the series of natural disasters in early 2011 attracts users to the platform who recognise its role in crisis communication, and the political turmoil of 2013 also seems to drive take-up.

A major spike in 2015 appears to coincide with the devastating Nepal earthquake, but we’ve yet to determine why that event would lead to new Twitter accounts being created in Australia.

To focus in on the core parts of the network, we further filtered this to accounts that have at least 1,000 connections in the global Twittersphere, which left us with the 255,000 best-connected accounts. We visualised their network using Gephi’s Force Atlas 2 algorithm, which places accounts close to each other if they share many connections, and further apart if they are only poorly connected.

Australian Twittersphere

The network map shows clear clustering tendencies: dense regions, where many accounts are closely connected, are separated from each other by lower-density spaces. We systematically examined these clusters, and labelled them based on the overarching themes that emerged from an analysis of the account profiles in each cluster. The result is a kind of birds-eye view of the Twitter landscape, from politics to popular culture and from education to sports.

Perhaps surprisingly, accounts connecting around teen culture make up the largest part of this network: 61,000 of our 255,000 accounts are located here. Other major clusters include aspirational accounts (these include self-declared social media gurus, self-improvement and life-coaching practitioners, and others who sought to use Twitter for professional betterment), at 26,000 accounts; sports, with 25,000 accounts (including distinct sub-clusters for cycling and horse racing); and netizens, technologists, and software developers (17,000 accounts).

Shared interests emerge from this as the central drivers of our connections on Twitter: for the most part, we follow others because of the topics they cover, not because they’re from the same city or state or because we already know them offline. An equivalent map for Facebook, where connections are much more strongly based on prior acquaintance, would likely look very different.

We further found that these accounts also arrived on Twitter at very different times: both netizen and aspirational accounts were created very early in the history of the platform. As expected, netizens constituted the vast majority of Australia’s early adopters, with aspirational accounts close behind; fully half of the population in both these clusters had arrived on Twitter by mid-2010. Sports took a year longer, and may well have been helped along by Twitter Australia itself as it reached out to key sporting codes to get their teams and players signed up.

11 New accounts in clusters per month

By contrast, the teen culture accounts arrived a great deal later. It took until mid-2012 until half that cluster’s population had joined – the teen invasion of Twitter represents a secondary adoption event, following the first big influx of Australian users in 2009/10. Here, too, we suspect active encouragement from key bands like One Direction and Five Seconds of Summer as a major driver.

In spite of Twitter’s reputation as a space for political debate and agitation, politics attracts only some 13,000 accounts (including 1,500 that form a separate, staunchly right-wing cluster); there’s a great deal more to Twitter than political argument.

But if all you ever see on Twitter is partisan bickering, there may be a reason: per capita, the political accounts are some of the most active in the Australian Twittersphere. Over their lifetimes, they’ve posted an average of 7.2 tweets per day (and the accounts in the hard right cluster even manage 12.5 per day); in the turbulent first quarter of 2017, those averages are even higher. Most of the other major cluster communities have managed less than half that work rate; historically, only the teen culture accounts have been similarly active.

Twitter is what its users make it, and Australian users have made it a diverse and dynamic place, even if perhaps they’re less aware of each other than they should be. As users, we should step beyond our networks more often, to avoid becoming trapped in our own filter bubbles – and this goes doubly for politicians, journalists, and others who now treat their immediate Twitter networks as an instant source of popular opinion.

And as a company, Twitter too has much work to do to enable its users to experience the full variety of networked communication and culture that the platform has to offer. Changes to how it recommends new accounts to follow, and how it reveals trending topics outside of our existing networks, could help a great deal in combatting the threat of getting stuck in your own filter bubble.

It doesn’t stop there, of course. We can only speculate what the equivalent networks for Facebook, Instagram, or Snapchat would look like, and what they might tell us about how people are using these platforms.

(An edited version of this article was published in The Conversation on 3 May 2017.)

]]>
https://socialmedia.qut.edu.au/2017/05/03/twitter-in-australia-how-weve-grown-and-what-we-talk-about/feed/ 0
Who’s Joining Twitter? A look at 1 million recent IDs https://socialmedia.qut.edu.au/2013/04/05/whos-joining-twitter-a-look-at-1-million-recent-ids/ https://socialmedia.qut.edu.au/2013/04/05/whos-joining-twitter-a-look-at-1-million-recent-ids/#respond Fri, 05 Apr 2013 06:07:44 +0000 http://socialmedia.qut.edu.au/?p=250 Currently, at QUT Social Media HQ, we’re in the process of developing the new version of our Twitter capture software, led by CCI Data Scientist Troy Sadkowsky. During development, we’ve extracted a few interesting datasets, and this blog post is going to examine one of those; a set of one million Twitter IDs. This set was gathered by registering a new Twitter account on 19 March, and then capturing the user profiles of the 1 million Twitter IDs that immediately preceded that; the data being collected several days after the account creation. As it happened, these IDs had creation dates covering a range of 8 hours and, by the time we collected the data, 422,794 individual accounts. The discrepancy between the number of IDs and accounts requires further exploration; while a number of them could be closed accounts, it seems unlikely that Twitter closed almost 600,000 newly opened accounts within a few days. Thus, we are left to wonder if some IDs are never allocated, whether IDs are allocated at the start of the registration process and never activated, or whether something else entirely is going on. Regardless, the 422,794 accounts in 8 hours represents a rate of 833 new accounts per minute. There were some other interesting findings, so on we go..

Registration Engines?

twitter_1mill_full

Firstly, I should mention that the above diagram, and all the others in this blog post, are from Tableau rather than Excel, which we are beginning to use for our analysis. The above graph has Twitter ID on the vertical axis, and Time Created on the horizontal, covering the full range of 1 million IDs and just over 8 hours. As you can see, accounts are being allocated in a more or less linear fashion (implying that old, deleted, account IDs are not recycled), but there appears to be a slight disconnect, in that at the same time account IDs are being allocated in two different ranges. In fact, as you can see by zooming in, there are actually 3..

twitter_1mill_zoom

This graph zooms in on a smaller period of time, between approximately 6:55 and 7:30pm UTC on 18 March. By zooming in, we can see that there are three approximately parallel lines, with the bottom one being out of sync from the top 2 by almost 2000 IDs, or about 11.5 minutes. One idea for the cause of is that Twitter has three separate registration engines allocating IDs, with each engine being allocated a range of IDs periodically, however we are unable to currently verify this; it could also be that there is some caching process before new accounts are added to the database.

It is also worth noting that of these 1 million IDs, there are 1762 accounts for which the API returns profile information, but have no username. One current theory is that these may be deleted and/or banned accounts in which the username is freed for re-use, but Twitter keep the account ID active for internal recordkeeping, however again further work needs to be conducted to confirm this. Given that there were a few days between the accounts being created and the data being collected, 1762 would seem a more reasonable number than 600,000 for banned accounts.

Where are they?

One advantage of Tableau is that it allows us to produce ‘easy’ visualisations of where in the world Twitter users are. There are a couple of different ways of doing this, and they all rely on users volunteering correct information. Of the 422,794 new users, 7,461 had geo-location enabled. A map of these users can be seen below, and this provides a relatively precise measure of the location of these users. What is interesting here, particularly in reference to the diagram that follows, is that both Russia and Canada have virtually no users with geo-location, yet both have a quite substantial number of overall users. By contrast, geo-located users are more concentrated in the United States and Europe, and Mexico and South America are also strongly represented.

twitter_1mill_geomap

 

The second technique we used was to map the users approximate location according to the timezone set in their user profile. As it turned out, this was a relatively tedious process of mapping Twitter timezones (which use a variation of time zone (‘Eastern time’, ‘Pacific Time’), City (‘Melbourne’) and Country (‘Greenland’)). In case anyone repeats that same exercise in the future, I have made a spreadsheet of the conversion available here, which you should be able to import into Tableau in CSV format. There are a few caveats with this data; countries such as The Netherlands and Morocco seem to be over-represented, which we believe to be caused by them being the first available location for popular timezones; for example Amsterdam is the first listed alphabetically for Central European Time, which includes populous countries such as France and Germany. This data also shows large numbers of registrations for the United States and Brazil. It is also worth mentioning that the time span here, approximately 4pm – 1am UTC, would be afternoon and evening in Europe, and noon-9pm in the US, while being late night and early morning in Australia, which may explain the low number of Australians in the dataset.

 

twitter_1mill_heatmap

What do they do?

The three charts below show number of statuses, followers, and following respectively for these users a few days after their account was created. These more or less stand alone, however it is worth noting that for the chart showing total followers I removed 5 data points for the visualisation – these appeared to be accounts of celebrities, and had 75k, 32.5k, 24.9k, 24.5k and 17.5k followers.

 

Status Count vs. Date of Account Creation:

twitter_1mill_statuscount

Followers Count vs. Date of Account Creation (note previous caveat):

twitter_1mill_followerscount

‘Following’ Count vs. Date of Account Creation:

twitter_1mill_friendscount

 

So, that’s who’s joining Twitter — now to think about the 25.2million new accounts that may have been created by the time this post goes live..

]]>
https://socialmedia.qut.edu.au/2013/04/05/whos-joining-twitter-a-look-at-1-million-recent-ids/feed/ 0