Politics, elections and piffle plinking

When analysts are no longer enough.

There’s a cracker of an article titled Rise of the Data Scientist over on what is probably my favourite geek site on all of the internets – Flowing Data (which you really should check out, at least for the visualisations). It links into another article from Dataspora called The Three Sexy Skills of Data Geeks, both of which touch on the argument that conventional data analysis is no longer enough in these crazy data driven times. It tackles an issue that I find myself wrestling with here nearly everyday – how to bridge the gap between data analysis and public understanding of that analysis through data visualisation.

Until recently, an effective data analyst was one that simply knew their way around the theory and mechanics of statistics, had a good grasp of statistical software and where the standout analysts also had a really solid understanding of the practical knowledge of the topic that the data actually came from.

After completing the analysis and publishing it somewhere – usually obscure – the most common way the results of the analysis came to the attention of the wider public was through journalists in the mainstream media – or recently in new media.

But that sort of two stage process is becoming redundant as the world we live in becomes more and more saturated with information – data by any other name.

As public demand continues to grow for what was only recently very specialised fields, those attempting to meet that demand, attempting to bring more sophisticated analysis to the public sphere are facing a quandary – the way we used to do things is simply no longer good enough.

Being solid in statistics is now only part of the story – to be an effective analyst we also need to be designers, story tellers, programmers and knowledgeable in the fields we’re looking at. We have to design ways to transmit the often complicated results of our data analysis into something that is easily understood by folks that aren’t statisticians, in ways that can engage them and satisfy their curiosity. We effectively have to become the analyst, the publisher, the journalist and the web developer all rolled into one.

Unfortunately, there isn’t a university course around that accommodates what will become the bread and butter of the pointy end of new media over the next few years. Ideally, such a course would not only have a heavy quantitative empirical component – stats 100,200 and 300 units etc, but computer science components that cover some combination of Python, SQL, flash, javascript etc. Some design elements would also be useful, not to mention units from journalism as well as some major elective area of study on top of that – like economics for instance, or political science or some other area of specialist knowledge that provides the intellectual underpinnings for the topic that one wishes to analyse data from.

Yet, it’s not only in areas of new media where this is important. The general case of data analytics has grown way beyond delivering a 50 page report filled with graphical fluff, hosting a half dozen meetings to explain the results to the client and using some sleep inducing Power Point presentations as a prop.

Companies are demanding more for an X thousand dollar outlay on this stuff – but they are struggling to find people that can deliver it under the one roof. There’d be folks here that read this blog that have commissioned analytical work for their organisations (often from the usual suspects!) – but when was the last time that a piece of analytical work came in and contained a Dashboard, with drill downs, with what if scenarios, with results in a digital format that you could actually use in an ongoing basis along with the standard reports and metrics and analysis– and all of it explained in a way that is meaningful? When was the last time that you got a DVD full of useful spiffy toys designed specifically for your business and business data plus interactive walkthroughs of the results of the analysis rather than just a power point presentation?

There’s a few firms around that can provide that suite – for the cost of an arm and a leg and a first born child. But smaller consultants that can’t claim to have originated in Boston have the technical resources available now to produce such things, but the training is simply lacking. Technology advanced faster than our university education system could accommodate on this front – but tacking a few extras onto the end of a stats degree isn’t really an answer.

We need to approach this with a fresh perspective if Australia is to keep our reputation of producing world class analysts. That might mean we have to increase the range and types of statistics courses we offer – but what we are producing at the moment isnt tailored for the world we actually end up operating in.

The next generation of analysts need to be more than just data pigs – we need expand our understanding of what a data analyst is and what they are now used for, and give them the skills they need – skills which would already be in deployment throughout our economy if they existed.

The lowest hanging productivity fruit for any business and for any economy is understanding what is actually happening. You cannot manage what you do not know.

The one thing the internet does is provide an enormous capacity to know – what is needed to change that capacity into productivity is data management, analysis and interpretation. It’s why we need to develop a new class of data analyst.

On something completely different, via Dogma in comments, comes Hansard Possum – isn’t he a cute smelly thing!

If you want to waste some time, I found myself playing with this bit of spiffiness for about half an hour this morning. I still have no idea why – warning, may be addictive!

7 Comments

  1. 1
    imacca
    Posted June 5, 2009 at 10:16 am | Permalink

    That ball dropping screen is a seriously productivity sapping page. good find!

  2. 2
    EnergyPedant
    Posted June 5, 2009 at 11:24 am | Permalink

    Very true. At my work we spend a some (but not enough) time trying to eloquently communicate complex results and interactions. Clients are often very keen on a few “expected” outcomes rather than understanding the relationships between inputs and the uncertainties in extrapolations.

  3. 3
    thewetmale
    Posted June 5, 2009 at 5:36 pm | Permalink

    Although i am outside your area of specialization i think you’re making quite a few good points. Especially regarding the need for a degree combining stats and computer/graph porn ability.

    Hansard Possum, works on so many levels, especially the bit about pissing and shitting on everyone and everything. And yeah, that ‘bit of spiffiness is seriously addictive. They should put a government warning on it.

  4. 4
    dan1
    Posted June 5, 2009 at 6:42 pm | Permalink

    Love the way you always use “folks” when your trying hard not to be condescending ;)

    I tried to get this point across to Austpost 15 years ago(honestly) but no. “Why do we care if we can analyze every step a courier makes, every parcel, the time of day, delivery rates per klm etc

    AP = “Waste of time, it’s delivered as soon as it can be , we can’t ask them to speed so it gets there faster”…

    I thought at the time it had some value

    Silly me

  5. 5
    watt
    Posted June 6, 2009 at 10:24 pm | Permalink

    I think universities are trying to offer courses that pretty much do what you are asking for.

    An undergraduate Science degree at the University of Melbourne, with a Statistics major can be taken concurrently with a diploma in infomatics and a sequence in an unrelated field (such as economics or politics) seems to be a reasonable start.

    Alternatively one can take a degree in another field with electives in infomatics and diploma in Maths and Stats concurrently.

    (I have no idea if the courses are good though)

  6. 6
    Posted June 8, 2009 at 11:57 pm | Permalink

    I think this can be extended from just data analysis to all sorts of science and engineering fields. One of the things lacking from formal training in these areas is proper teaching in communication or areas outside the primary focus of study.

    This doesn’t make sense to me since communicating is an extremely important part of the work.

    As a result, you’ll see the best analysts/scientists/engineers are those who can communicate, because they stand out.

  7. 7
    OzPol Tragic
    Posted June 10, 2009 at 8:39 am | Permalink

    My experience in data analysis was the other way around: via liberal arts (where I’d chosen logic over stats) then management, to market research & stats, to computers – although my initial brush with computers (mid60s) was converting words to symbols, to ‘truth tables’ to computer programming (IMthenO, THE definition of ‘boring’). Initially, I couldn’t see the sense in MaAn for data crunching and trend analysis in ‘verbal subjects” research; but only initially, since I soon found the connections to logic, words and imagery … and the masses. Just before I retired, I was adding multi-media and photo-shopping. Thanks to a few current & ex students, I knew where all forms of communication were hypothesised to be within the decade (conversion) … where they will be with VHS broadband … as you’ve covered/ alluded to in the above.

    Data analysis has always been a form of communication, at it’s best in a variety of communication “languages” – words, figures, diagrams & other graphics … and art. I could nominate a few universities (at least 2 in your home state) that would go there with not much pushing – as I’m sure you can, Poss!

    Love ya posts! Go create that course!

Post a Comment

You must be logged in to post a comment.