Tuesday, 16 August 2016

Adblocking is asymmetric warfare and publishers can't win

For those of us who like to keep an eye on developments in web advertising blocking (adblocking), there was a flurry of excitement last week as Facebook announced that it had designed and released a way to defeat the blockers. Adverts inserted into Facebook's news feeds have been made harder for an adblocker to tell apart from regular posts by your friends. If the blocker can't spot the ad, it can't remove it.

This has been coming. Various companies have been trialling experiments ranging from asking people nicely not to block ads (The Guardian) to attempting to completely prevent access to a site for those running adblockers (CityAM).

The response to Facebook from Adblock Plus has also been coming. Its Open Source community defeated Facebook's new tech in less than 48 hours.

On Thursday, Facebook announced another change.

And Adblock Plus killed that one the following day.


I've been plugging away on Twitter for a while with the opinion that big publishers can't win an arms race with adblockers. In this post, I'd like to explain why.



Adblocking is asymmetric warfare. On one side, you have large companies - publishers and ad-tech suppliers - who deliver adverts. On the other you have a growing population of people, estimated at 12m (pdf) in the UK, who want to block ads and a series of smaller software projects which allow them to do that.

Effective asymmetric or guerrilla warfare is all about harassment, persistence and the ability to adapt quickly against a large and powerful, but much slower opponent. Guerrilla combatants don't win a definitive victory, but eventually their opponent is forced to give up and make a deal, because they're burning through resources and not achieving very much.

In order to deploy its new solution, Facebook had to develop the code, test it and then deploy it onto a system serving 1.2 billion users, without breaking anything. Adblock Plus took less than 48 hours to nullify that solution - twice - and did so for free.

The blockers haven't even really had to get sophisticated yet to remain effective. All most adblockers do, is check against a list of server addresses that deliver adverts and web page elements which contain them and remove those before the page is shown to a user.

For a newspaper or Facebook to show an ad, a bit of code is inserted in the web page that effectively says:

[start advert]
Call ad-server and request an ad!
[end advert]

Facebook made that code more difficult to spot, but apparently not difficult enough.

Let's say Facebook got really clever with hiding their ads and mixing them up so the code for their containers and servers looked exactly the same as your friends' posts. What could blockers do in response?

1. Compare the posts in your news feed to your Facebook friends list and zap anything that doesn't match. It requires more coding than just killing advertising flags on a web page, but it's not really difficult... There are already plugins that have this functionality.

2. Use the legal requirement that adverts are clearly labelled as being adverts, as a way for blockers to find and remove them. You've got to write "ad" in the corner of an advert to obey the law. This stuff really isn't that hard.

3. Start training machine learning models to use the content of adverts to identify and remove them. A Gmail spam filter for the whole internet. Are you sure you want to start this fight?

None of those even touch on the idea of weaponised adblock, which it's not unusual to see suggested on tech forums. Currently, adblockers are passive; they prevent an advert from loading. But they don't have to be. Adblockers could deliberately load and click on ads in the background, thousands and thousands of times - without showing them to the user - to screw up the web's revenue model. I'm absolutely not saying they should, but they could, and the idea isn't new.

Are you really sure you want to start this fight?

This is why the advertising industry must acknowledge - whatever its position on the morality of adblocking - that it can't win an arms race with the blockers. Adblocking counter-measures are slow and expensive to roll out and industry wide, will need the cooperation of a huge number of participants.



Blockers can rely on a passionate community of skilled programmers who iterate and deploy new solutions quickly. The blockers don't have to win every time or force a dramatic surrender, they just have to make life difficult and expensive for the ad industry until it eventually gives up.

Advertisers, agencies and publishers must acknowledge three key facts:

1. Adblock penetration is growing rapidly and this is a problem

2. Publishers cannot win an arms race with adblockers

3. A large and growing section section of the population is on the side of the adblockers and so is the law.


It is imperative that the advertising industry identifies the reasons for the rapid growth of adblocking, in order to work towards an alternative to the current situation, where over 20% of people in the UK consume all non-paywalled internet content entirely for free.

This alternative is likely to encompass a model where adverts are no longer a vector for malware, are significantly less irritating to users and provide for much better privacy.

(no, this chart doesn't follow my data vis best practice)

As I've written previously, our new world may well be a much nicer place for large publishers delivering quality journalism. Their audiences won't be trackable onto thousands of tiny websites, so large publishers will be in a much better position to charge a premium for advertising space.

For better or worse, this new world is coming. If you target younger, tech savvy audiences, it is already here. It can't be fought so we must work out how to be effective within it.

Wednesday, 22 June 2016

Enthralled by tech we don't understand

The advertising world has discovered this week that an app purporting to crowd-source spotting migrant boats in trouble in the Mediterranean did no such thing.

It rendered the same static image for every user and when they 'spotted' the boat in it, asked for their contact details.



On the plus side, it did update with the weather forecast for Libya.

The app just won a Bronze award at Cannes.

Apple has pulled it from their app store and an investigation has started.

Whatever the morality of producing the app in the first place - which is pretty shocking - what the episode brings home to me, is the need for senior managers making decisions about tech, to actually understand the technology. Not to be programmers themselves necessarily, but to have a good idea of what is possible, what is easy and what is difficult.

A bunch of judges at Cannes decided that the migrant spotting app was worthy of an award.

None of them apparently had the nous to say "hang on, where are they getting their live satellite images?"

Of course the app can't exist. Marketing creative agencies don't have access to live satellite video streams of the Mediterranean. This isn't 24 and Jack Bauer's not an agency staffer.

Google Earth isn't a live stream.

You can get live images, if by 'live' you mean one per day. And it's not cloudy.

At the very least when it comes to technology, if we don't know, then we need to find an expert and ask. There are charlatans out there in the world. Some of them are software vendors, some do digital advertising and some make apps. If we're not to be taken in, the level of tech savvy in our industry needs to increase and quickly.

Thursday, 14 January 2016

Six tricks to make your data visualisations look better

I saw a tweet yesterday from @colinttrainor and couldn't agree more with this idea.



(tweet shown split into two for clarity)

Whatever software you're using, there's simply no excuse for accepting the defaults and not trying to make your charts look more professional.

But where do you start? It's easy to look at great design and agree that it's great, but when you're looking at an Excel default, inspiration is much harder to come by. What do you change first?

These are a few tips, which I've picked up as I tried to make my own output look better and hopefully a few of you might find them useful too.


1. Make yourself a colour palette.

Your company very likely has a corporate colour palette that you're expected to use for PowerPoint, and whether it's good or bad, it doesn't half help to simplify your choices. Rather than selecting from the entire spectrum of colours, you'll have six or eight to work with that (hopefully) are designed to complement each other.

You can make your own colour palette really easily and use it to give your charts, your blog and anything else you build a consistent, professional feel.

Go to Adobe Color and have a play. Find some visualisation examples that you like, which will help you to decide on the feel of your palette. Do you like striking contrasts, or more subdued tones? There isn't a right answer, but what you choose will have a dramatic impact on the impression that your charts create.





2. Simplify, simplify, simplify.

If I had to reduce my Effective Data Visualisation for Marketers presentation down to a single tip, it would be this (paraphrasing Tufte): If it isn't essential to your chart, get rid of it.

Turn off gridlines, reduce colours, turn off legends and axes if you can,  Look at Colin's tweeted example above. Most of the difference between the two, comes from turning off gridlines and axes, leaving only data.


3. Black text looks amateur.

Have a look at virtually any professionally produced visualisation, infographic, or presentation document. Is the text black? Look closely, is the text jet black, rgb(0,0,0)? I bet it isn't.

On a white background, changing all of your text to a very dark charcoal shade of grey, just works. I'm not a designer so I don't know why it works, but it does. Just do it.


4. Are you sure you want a white background?

The answer might well be yes, but this one is worth thinking about. Clean, white space is a very good thing but a shaded background can lift your visualisations and give them extra pop.

You probably don't always want black, but it's worth keeping in mind to make bright colours jump off the page.


If not black or white, then light greys can look good. When you look closely, it's surprising how many visualisations that you might initially assume have a white background, actually don't.

Here's one of my football match single-page dashboards. The background is a very light, warm grey. On the white background of this blog post, it's easy to see, but against Twitter's dark image borders (which is where I use these), it's not nearly so obvious.


Bright colours definitely aren't for backgrounds and one other thing you absolutely must do if you change the background, is to change everything that was white background, into your new colour. Miss a spot and it will look awful.


5. Times New Roman? Yuk.

Like colour palettes, this is something that most companies do, but few individuals take the time to set up. It makes a difference.


Take the time to pick a font. Here are a couple of resources to help.

By the way, there's a reason that the example fonts above are a picture and I didn't just change the font in this blog post to type them. I want to show you exactly the fonts I chose and not every web browser will render all fonts correctly. If your computer doesn't have a particular font installed and a website tries to use it, then you'll get something else.

Have a read about web safe fonts.

Web safe fonts aren't important if you're drawing charts in Excel and publishing pictures of them, but they are if you're building Tableau dashboards to publish online. If a font isn't available, Tableau falls back onto Times New Roman and makes your dashboards look horrible. Some of my dashboards do this and I really need to fix them.

In Tableau, it's a good idea to use a very common font that's close enough to what you really wanted, than to try to use something beautiful, which nobody else will be able to see.


6. Different is good

There's one more overarching theme which I'd like to mention and it comes from the world of advertising: Different is good.

Different is eye catching, and it will set your work apart.

Even if Excel, Tableau, R and other programs' default charts were beautiful (which they aren't), there'd still be huge value in changing them. If your work looks like everybody else's work, then it's not eye catching, will blend in with everybody else's work and not get noticed.

Find your own visualisation tone of voice and your charts will stand out much more. It's not always easy to break away from the defaults and try to make something better, but it really is worthwhile.


Finally, remember...

Good artists copy, but great artists steal.

Picasso (maybe)

Google image search is your friend. Good luck!

Wednesday, 12 August 2015

Adblocking could be the saviour of high quality journalism

Reading the latest news, adblocking is heralded as the harbinger of the online apocalypse.

Many internet users have decided that they don't like to be tracked and profiled.

They don't like to be irritatingly diverted from the article that they're trying to read.

They don't like auto-play videos.

They don't like ads for things they've already bought, following them around the internet.

And they especially don't like their mobile data allowance being chewed up by advertising.

All of this means that the user base of adblocking software is on an upward trajectory and publishers are starting to worry.



According to PageFair, 21% of UK internet users use an adblocker. Amongst younger, tech-savvy audiences, the rate of blocking is much higher.

For publishers, this is a serious problem, because they get paid per thousand eyeballs viewing ads on their webpages.

For example, if I visit The Guardian, I see this


That banner at the top is generating a small amount of money per visitor for The Guardian.

With a blocker? Poof! It's gone. The page resizes into the gap and you'd never even know it had been there. The Guardian gets no money at all for my visit.

As an aside, Ghostery gives me a list of every tracker that's logging my visit to The Guardian and many of these will subsequently try to follow me around the web and see where else I go.

It's quite a big list...



That's FIFTY EIGHT separate trackers, just from a single visit to The Guardian homepage. No wonder some users want to opt out.

So adblocking continues on its upward trajectory, gains increasing penetration into mobile devices, and in the near future hits and then passes 50% of all users. All of our national newspapers go bust. It's inevitable, right?

Well no, actually, I don't think so.

I think adblocking could be very good for high quality journalism, but there'll be some short-term pain before we get there.

First a little bit of technical info...

Adverts are fairly easy for users to block, due to the way that they're bought and sold and embedded onto web pages. When you visit a newspaper's website, it loads the article you want to read from its own server and makes a request to some other server(s) for adverts. If you keep a big list of the servers that are used to deliver advertising, you can block them in your browser. The article loads and the adverts don't. Simple.

Now imagine the adblocking rate is 100% of internet users. What happens?

Any user, visiting any website, sees no third party adverts at all and publishers get no money.

But advertisers still want to talk to the The Guardian's readers. Of course they do, The Guardian's readers spend a long time staring at those pages and they're an educated bunch, with disposable income. Advertisers like people with disposable incomes and will work quite hard to get their brands in front of them.

Advertisers also no longer have an option to place their ads programmatically - splattering them across the web on loads of different sites and trying to follow individual users around - because as soon as they try to do that, the adblockers kick in and their messages disappear.

Suddenly, there are big online advertising budgets available, but no way to spend them programatically across lots of smaller websites.

So newspapers start selling advertising space the old fashioned way. They agree to place adverts on their site - delivered from their own servers, woven into articles and much tougher to block - and they agree a price to do it.

They might even negotiate that deal on the telephone. How quaint.

It's a model that looks a lot more like the traditional way of selling advertising space. Individually struck, higher value deals with large companies, rather than automated small-ads scattered across the web.

Why doesn't that already happen? Well sometimes it does, but currently, large proportions of online advertising budgets are used to buy programmatically. If you've got a choice between negotiations with individual newspapers, or bunging some cash at an ad exchange on the promise that the exchange will find your exact target audience wherever they are on the web and show them your ad, then of course you'll pick the automated route.

Adblocking takes away the automated route. Users don't like the tracking it entails and in increasing numbers, they're opting themselves out.

If you can't track and follow an individual user onto sites where its cheap to advertise, then you're going to have to buy your audiences in bulk, in the places where they congregate.

Adblocking doesn't leave an internet barren of advertising. It leaves large websites, with high quality audiences, in an incredibly strong position to negotiate their own deals.

For newspapers, it removes a huge number smaller websites from their competitive set and puts them back in the much stronger position that they used to enjoy.

Adblocking at scale would redirect online budgets towards those sites that can guarantee large, desirable audiences for advertisers. It could be the saviour of high quality journalism.

Thursday, 6 August 2015

Is Microsoft's Power BI a Tableau killer?

Have you heard of Betteridge's law of headlines? Then you already know the answer to this post's headline. No it's not.

Why not? Good question. And there is one feature where Microsoft's Power BI  manages to lay a glove on Tableau, but we'll save that for later. First, why are we here?

My last post on Wallpapering Fog was part reminiscence, part rant and part lament about where it's all going wrong with Microsoft Excel. You see, I used to like Excel a lot and mostly, I really don't any more. It's still very useful, but in terms of new features, the bad stuff is starting to outweigh the good.

I had a comment on that post from the design leader on Power BI - which is very flattering - asking if I'd tried the general availability version of their new software and I hadn't, yet. I have now. My IT department is also pushing Power BI so I really needed to take a proper look.

A bit of background if you're unfamiliar with Wallpapering Fog - I've been using Tableau (full-fat and Public) for four years and love it, but wouldn't say I'm wedded to it. I'll be very comfortable jumping ship if something better comes along, but for now Tableau is the best BI software on the market by quite a distance. I think using the best and looking for better is a healthy attitude to take.

Before Tableau, I used Excel Services. It was rubbish. Hopefully Power BI is better.

Onto the test. Is Power BI any good?

I went to powerbi.microsoft.com and signed up.


I got a button that said "Get Started" and clicked it.

The website paused and said "working on it".

Hmmm. I haven't asked you to do anything yet. What could you possibly be working on? As a (reluctant) SharePoint user, I know that "working on it" message all too well. This is not the most promising start.

Still, it's gone in a couple of seconds and I've promised to be as objective as possible with this review. We're in, let's move on.

I'm deliberately going to pile into Power BI without reading any instructions at all, because that's exactly the approach I took with two of its competitors - Qlik View and Tableau - and with both, I was able to make good things happen pretty quickly. I only needed help later, as more advanced features came into play. That's the benchmark.

This will also be an early review and I haven't tested Power BI extensively. I may have missed things, but seeing what you can achieve with a piece of software in 24 hours is a useful exercise. In my experience, great software - like Photoshop or Tableau - will blow you away immediately and then keep on giving as you discover more depth. Before I'd invested any time in Tableau and long before I'd got involved with its community, you can read that it did that.

What to do first? I need to connect to some data so we can put Power BI through its paces. Let's have a look at the options.

I can load a csv file or spreadsheet, but decided to have a crack at an API connector and was presented with a slightly strange array of options.


Acumatica (who?) and Circuit ID (also who?) but not the usual array of social, stock price and economic data connectors. Odd.

Anyway, Google Analytics is there. Everything connects to Google Analytics. Let's try that.



Oops. Five minutes of Power BI thinking about things and then I gave up and refreshed my browser.

Maybe I'm being too ambitious (I'm not, I managed to do this in the competition's software). I loaded the example retail dataset.

A dashboard appeared!



If I'm being picky, some of the formatting is a bit scrappy - especially considering it's the only bundled example - but it's a dashboard. Let's not be too picky yet.

I can click around, highlight and filter and go into edit mode, but after a couple of minutes, I found a button for "Power BI for desktop" and downloaded it. I had hopes of more connectors for local databases and did find those. After I eventually installed it.








Yeah, so like I said. Eventually installed. Including rebooting a work PC that doesn't hurry itself to reboot, that little lot took the best part of twenty minutes.

Why don't I already have the latest version of IE? Because it's rubbish. I use Chrome and Firefox.

On the plus side, from the desktop, the Google Analytics connector works.

From this point on, the niggles stopped and Power BI was... Fine. Not good, not great. Fine.

I loaded some traffic data from Google Analytics and drew a couple of charts. I noticed that dates don't automatically drill from years, through months, to days like they do in Tableau and so manually created a month field.

Power BI's got a weird distinction between measures and columns, so creating that date column involved a brief false start. Apparently dates can only be a column, not a measure. What's the difference? Tableau switches effortlessly between dimensions and metrics, which is a very useful feature. You can split data by a numeric field (using it as a dimension), or you can add that numeric field up. At the point you create the field, the distinction doesn't matter - it's just a column of data and you choose what to do with it afterwards.



Creating charts was straightforward enough, but quite limited. Microsoft have ditched the rows and columns interface from Pivot Tables (that Tableau adopted and supercharged) and gone with a vertical interface. It works, but it's nowhere near as intuitive.


That list of options for summing, averaging and counting is again... fine. Just the basics. No frills.

Enough Google Analytics. I restarted Power BI and connected to one of our advertising datasets on SQL Server. 350k rows that describe what different companies have been spending on advertising over the past few years, to see how Power BI would handle a little bit more data.

It coped fine. Connecting was slightly counterintuitive because you have to tick a little box next to the table name, that's not obviously a tick box. Just clicking on the table name brings up a preview, but leaves the "Load" button greyed out and leaves you scratching your head for a bit.

Data loaded, I quickly put together this summary view of UK advertising spend.




You can filter across the whole page, or just on one chart. If you click on something - a date, or a category - then every element on the dashboard immediately filters to what you've selected.

I like that sort of universal filtering behaviour, when I can control it, but it doesn't look like you can here. Everything filters when you click something, whether you like it or not. That has the potential to confuse the hell out of non-technical end users of your dashboard as they idly highlight a date and most of the data on the view suddenly vanishes.

In keeping with the imposed filtering, you can drop a filter element onto the view (top right on my little dashboard) to make it more obvious what's going on, but you can't attach that filter to specific dashboard elements. You filter everything, or nothing.

If you look closely at my dashboard, you'll see that the dates (which are slanted, yuk) are in the wrong order. That's because in our SQL Server, they're stored as strings.

I tried to make a new column that would be a correctly formatted date and this happened.


What's wrong with that? I don't know, it looks fine to me. Apparently Power BI doesn't know either, because the error message is blank.

I tried calling it Formatted_Date, removing the space. Nope.

I tried a few other ideas. Nope.

Maybe DATEVALUE doesn't work like in Excel. That would be daft.

I gave up.

Overall, Power BI feels like it's doing the bare minimum. You can drop charts, text and maps onto the screen. You can sum and average data. You can format axes and change colours. That's about it.

There are features to create variables (when they work) and to connect data sources together, but BI software will stand or fall on the front end and how you are able to present data back to a user.

Drawing on a couple of my own recent projects, this is Tableau. So's this. Both built in the free, Public version. As far as I can see, you've got no chance of producing visualisations like these in Power BI. It's not outright bad, it works, it's just miles behind the competition.

Power BI hasn't come out swinging. It's a cagey, cautious entry into data visualisation that seems competent, but nothing more than that.

Before we go on, I should mention price, because it's important. Full fat Tableau Desktop is over £1000 a copy. It bloody well ought to be good.

Full fat Power BI is $9.99 a month. These two pieces of software aren't targeting the same market.

It's fairer to compare Power BI with Tableau's free offering - Tableau Public.

Tableau Public is full-fat Tableau Desktop, with database connections stripped out and you can only save your visualisations to Tableau's cloud. We've established that in terms of the sophistication of visualisations you can build, this means Tableau will blow Power BI out of the water. What about when you publish your dashboard to the cloud, for others to see?






Data access and refresh is where Power BI wins over Tableau. It can connect to database sources and APIs and auto-refresh from those sources so that your online dashboard stays updated without you needing to do anything. This functionality is free and for $9.99 a month you can do those refreshes at high frequency, with a bigger data storage allowance, though one that only takes it up to the allowance that Tableau offers for free.

OK Microsoft, you just got my attention. It might be just a line chart and a map, but an auto-refreshing line chart and a map is interesting.

What's the catch?

Well if I had an auto-refreshing dashboard, I'd want to embed it into this blog, or into hilltop-analytics.com, but you can't do that without paying $5 a month for SharePoint Online. I use SharePoint at work and it might just be the worst designed piece of software I've ever come across. I'm not even sure it was designed. No, I'm not paying my own money for it.

For now, if you want to build a sophisticated, good looking visualisation, you need Tableau.

If you want to build a more basic visualisation that refreshes itself, then Power BI is an option, but you'll only be able to share it on app.powerbi.com, not embed it.

There are hopes within the Tableau community that Public will get Tableau's new web data connector, which is coming in the next mini-release. If that happens, then Power BI is dead in the water, because its auto-refreshing (at a low price) USP will be gone.

In a business context, Power BI is in a potentially strong position. If your company has already bought into Microsoft's Office 365 and SharePoint stack - as mine has - then Power BI will integrate with that fairly cheaply and allow users to publish visualisations to each other. I wouldn't be at all surprised to see it gain a fair amount of traction.

Unfortunately, as analysts, that will mean investing our time making the argument that Power BI's competitors, while significantly more expensive, are significantly better. Tableau and Qlik View are changes to how analysts work and massive boosts to their capability and efficiency. Power BI is not that. If you just want to automatically publish a table and chart of financial results to the exec team though, it will certainly be useful.

Overall, Power BI is a 6/10 product and in places it doesn't feel finished. It's useable, but limited. Give it a try, but don't expect too much.


Footnote:

I haven't mentioned Power BI's other innovation of natural language search. The idea is that users can type in "sales in France" (or similar) and the dashboard will show them that.

I haven't mentioned it because it's a gimmick. Cortana (I assume it's got Cortana's engine) isn't the Starship Enterprise computer and it's not going to be intelligent enough to be useful. You'll see this message a lot.


Unfortunately, I already have experience of this feature playing really well in a controlled demo, where the salesman knows it will work. It's going to make it harder to explain to an excited senior exec that Power BI isn't really very good, when "you can talk to it and it just understands!"

Footnote 2:

In this review, I may have said Power BI can't do something, which it actually can. If that functionality comes from bolting on Power-something-else, then I'm not interested. Power BI, PowerView, PowerPivot... the Microsoft data ecosystem is a bit of a mess at the moment and it needs a more coherent offering. Power BI should work out of the box as a single download, as its competitors do and that is how I've tested it.

Tuesday, 2 June 2015

We need to talk about Excel

I'm not sure when it happened. I've got a feeling that the writing's been on the wall since the introduction of the 'Ribbon' menu.

I was the guy who could make Excel dance... Shortcuts flying, interactive dashboards, external data connections and VBA. I loved Excel.

A colleague, observing me building a spreadsheet a few years ago, said, "F*ck me, it's like watching Minority Report".

Proud moment.

Not any more.

Modern Excel is a mess.


It's worth taking a moment to consider how we got here. Excel was first released as Windows software with version 2.0 in 1987. It's nearly thirty years old.

Back when I started out as an analyst in 2000 - who was genuinely excited that a company had seen fit to employ him and to allocate him a desk and a PC - we were using Excel 97. This was the first version to contain proper VBA and also came with Clippy, the universally reviled Office assistant.

Clippy aside, Excel 97 was pretty good. It had most of the useful functions and features that you'd find in modern Excel and it worked.

Crucially, Excel was what you got. It was restricted to 65k rows and its charts looked bloody awful, but there wasn't really an alternative.

Having VBA baked-in made Excel tremendously flexible (and the bane of IT departments everywhere). With a bit of creativity, you could use it for statistical modelling, interactive dashboards, as a calendar, a project planner, a to-do list... And we did. Excel got (ab)used as a solution to every business problem going.


Back in 2000, Excel was the centre of an analyst's world. What happened?


Specialist software has chipped away at Excel's 'jack of all trades' USP.

If a computer can do it, you can probably make Excel do it. That's no exaggeration. VBA is behind Excel and so if some functionality doesn't exist out of the box, then you can add it. You want games in Excel? Here are fifty. Be warned: I make no guarantee those games won't royally screw up your PC. VBA can do that too.

When you break down the uses for Excel, you find new competitors are encroaching on all sides. Competitors that are designed to do a specialist job, to do it really well and that integrate with each other to provide a complete solution. For statistical modelling, you've got R, SciPy, Matlab... For visualisation, you've got R (again), Tableau, Qlik View... For data storage you've got a vast array of options and for data processing (ETL), you've got Alteryx, Pentaho and again, the list goes on.

That's just the things that Excel is actually for. Under the list of things that Excel has been abused to make it do, there are hundreds of better options. Many of them free. If you want a to-do list, for goodness sake pick something that's designed to do that job.


Excel is like a Leatherman multi-tool. You can get most DIY jobs done with it if you try hard enough.


But a Leatherman is rarely the best way to do any specific job. You want a proper screwdriver, or a full-size hacksaw, or to have a corkscrew for your dinner party that's not also attached to a pair of pliers.

A specialist's toolkit looks like this. One tool - the right tool - for each job.


This is Excel's problem in 2015. It's trying to do everything - often by bolting on more plugin tools - and so it's doing almost everything badly.

Excel's a great way to make some very average looking data visualisations, or to store your data in a way that makes it really difficult to manipulate quickly and to refresh. Excel can deliver a crap interactive dashboard to a (SharePoint) web page and it can do statistical modelling that's really hard to repeat, and leaves no audit trail.

Yes, you can sort of fix those issues, with plugins and macros and hacking and creative thinking, but that's back to fixing your motorbike with a Leatherman, when you could have had the full range of Snap-On tools.


Modern Excel has one more problem. And it's a biggie...


You can't be a beginner's introduction and a specialist's cutting-edge tool at the same time

Yes, I'm going to start with a rant about the Ribbon menu. It was a stupid idea when it was introduced and it's still a stupid idea now. When you watch an experienced user manipulate a familiar piece of software, you'll rarely see them touch the mouse, because it's a slow way to do what you want.

Microsoft introduced the ribbon to make features more prominent for selection with the mouse (and presumably with a view to the arrival of touch-screens). With subsequent releases, more and more features have moved into areas where they are difficult or impossible to access with the keyboard; try formatting a chart, or even saving a file in Excel 2013.

This might sound like a petty complaint, but it's a symptom of a very serious issue. The Ribbon and mouse / touch control were a big two-fingers to experienced Excel users.

Excel has been progressively dumbed-down to make it easier to access for inexperienced users.

Which is absolutely fine.

Except that simultaneously, Microsoft has introduced PowerBI, with features that aim squarely at advanced data manipulation and visualisation. I've tried them and to be frank, they're not up to scratch. They're awkward to install, difficult to use and when you do get them to work, they produce very average looking output.

Excel has ended up in a place where it's too advanced and has too many features for novice users and it's not as good as a dedicated toolkit for specialists. That's not a comfortable place to be.


Where now?

Excel has a strong defensive position, in that big IT departments like it because it's part of a suite of Microsoft software that they're already buying. As a business analyst, you also need Excel plus other tools - if only because everyone else still uses it - so it's not going anywhere in a hurry.

That defensive position is being eroded on all sides though. Particularly because you can get a lot of the competitors that I've been discussing for free. If your corporate IT environment isn't completely locked down, then you can make a lot of headway with open source, start to get your best work out into the world and then argue about commercial software licences later...

There is also one thing that Excel is truly brilliant at and it's not to be dismissed lightly. Sometimes you want a multi-tool. Just for a quick job, because it's easier than delving into the big toolbox. Excel is a fabulous tool for this. For quickly reformatting one-off data, for banging out a functional chart, or for correlating a couple of variables, you can't beat Excel.

Microsoft should recognise this use for Excel and take it right back to basics. Turn it into a Leatherman; a lightweight, portable, do-anything, data scratch-pad, that's not trying to be more.

They'll still need a full featured BI solution of course, and possibly something else that targets less experienced users, but stop trying to make Excel the scaffold that holds the whole data analysis structure together. It's not working and if my experience is anything to go by, it's leading experienced users to actively dislike the product.

If Microsoft don't produce that lightweight scratch-pad for data, I firmly believe that somebody else will and that could spell the end of Excel as a tool for serious analysts. Excel will have been replaced for the one task at which it is still the best option.

Monday, 23 March 2015

In praise of modesty

One of these people invented the World Wide Web.

The other talks about it.



One of these people is the best footballer in the world.

He's not the one in the gold shoes.



More of this please.