Analytics Roundup: Wise guy edition

Dunning-Kruger effect - Wikipedia, the free encyclopedia

Mistakes in Experimental Design and Interpretation
Taxonomy of ways to fail your experimental design.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

0 comments | Add a comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment






Beauty in the Details

The design process is about whittling away distractions, making the obscure feel obvious, making the obvious feel implicit, and doing it without anyone noticing. To the untrained eye, your best work looks like you’ve done no work at all. If you’ve done a stellar job, then your design will feel utterly obvious. —Neil Mix from Paradox of Elegance blog post.

Neil goes on to say that "it’s easier to see the flaws than it is to see the elegance." That may be true, but a careful look at the best interfaces reveals the little and beautiful elements that make all the difference. These small features might not determine whether someone uses a piece of software, but they will determine if the user enjoys their stay:

Designer Bret Victor, who we first wrote about here, has developed a desktop widget for the SF-area train schedules. He allows users to change their query right in the description of an object—notice the red text.

Magic Ink widget

While we are fawning over Bret's handiwork, here's another cool feature he built into his Click-Shirt site for customized design of t-shirts. This bar at the bottom of the screen tracks the history of changes as the user designs a shirt. Each time I make an edit, I get a visual breadcrumb trail to easily see my history and backtrack.

Click Shirt

Google Finance stock charts have a nifty little device that lets you change the time range you are looking at. You can change both the size and the start of the time window using one adjustable object. Not to mention the embedded alert markers.

Google Finance stock chart

Some elegant touches are more subtle. Check out the search toolbar in Firefox. When I start typing, it fills out search terms both from my history (above the line) and from common searches (below the line).

SearchToolbar

The Safari browser for Windows offers a new approach for finding words in a web page. The browser greys the screen and highlights the target word. And you can to tab through the various instances of the word with the orange highlighting.

Safari Finder

And while we are on the subject of Apple: sometimes the difference between clunky and good is simply about the quality of the images. A while back I wrote a break-up letter to PowerPoint—one reason was that the Mac-alternative called Keynote does a much job with the look of default charts. The chart on the right feels more professional, in part due to the anti-aliasing of the image. (Joel on Software has an interesting post about anti-aliased fonts here.)

PPT chart Keynotechart

Finally an infographic from the New York Times called the Sector Snapshot. The beauty of this presentation of information is in the careful use of contrast and skill at keeping the focus on the numbers.

NYT sector

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

2 comments


June 19, 2007
derek said:

You're dead right about anti-aliasing, although with Excel for Windows my beef is not with the fonts themselves, which are fine enough to suit me, but with the graph symbols, which are embarrassingly childish-looking at any serious resolution. I have resorted to using data labels in Wingdings fonts, centred over a data series formatted so that its symbols are invisible, and this is sometimes acceptable. Though sadly not always, due to the tendency of text and drawing objects to drift away from their appointed positions when printed, exported, or resized.

If they could fix that problem, I would strongly suggest a modern graphics program should make use of font technology for its symbol set, with specialist fonts making hundreds of different symbols available to the chart maker instead of the WinExcel traditional circles, squares and diamonds etc.


May 31, 2009
Kneu said:

Love the NYtimes infographic "sector snapshot: energy".
Do you know of anyone who challenged themselves to reproduce it in excel?
I'm totally in love with it but not smart enough to come up with a solution.

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Excel 2007 and the Lie Factor

“The representation of numbers, as physically measured on the surface of the graphic itself, should be directly proportional to the quantities represented.”

Edward Tufte calls violation of this principle the “Lie Factor”. The implementation of in-cell data bars in Microsoft Excel 2007 is a big offender.

Almost a year ago, I was surprised to discover that the Microsoft Excel 2007 development team didn’t understand what zero means. Their implementation of in-cell data bars showed a bar in a cell, even if the cell had a zero or very low value.

Data bars in the Excel 2007 prototype

That was in the Excel 2007 Beta. Things haven’t improved in the current version of Excel 2007. The default setting for data bars in Excel 2007 is to scale to bars so that the smallest bar is based on the smallest value in the selected range and the largest bar is based on the largest value. It still appears that the smallest bar will be no smaller than five or ten percent of the width of the cell. Here’s a sample:

Sample data bars in Excel 2007

So, if you select a range that has values between 600 and 700, the 600 would have a little bitty bar and the 700 would have a full-width bar. Based on the bars, it would look like the 700 is ten to twenty times larger than 600. Outside of Redmond, this is generally regarded as untrue.

What’s more, if you create two sets of data bars side by side, each group of data bars scales itself independently even though they look the same. Take a look at this screenshot:

Sample data bars from two different conditional formats in Excel 2007

Notice the top seven cells have data bars that have one set of scaling and the bottom data bars have a different scaling. However, they look identical, and users should generally expect these bars to have the same scale.

Here are the rules:

  1. Defaults matter! It doesn’t matter that you can do data bars correctly in Excel. The default should be to do it right and it should be hard to do it wrong.
  2. The “right way” to make data bars is to make the length of the data bar directly proportional to the value in the cell. If one cell has a value twice another it should have a bar that is twice as long.
  3. Remove the default gradient shading. The gradient makes it hard to tell where the bar ends, obscuring what you’re trying to show.
  4. Continuous cells with data bars should all use the same scale. Use different colors to indicate ranges that have different scales.

Excel 2007 supports at least twenty-five different combinations of ways of specifying the length of the data bar.

Five different ways of setting data bars

Exactly one of those ways is correct. Base the shortest bar on the number 0. Base the longest bar on the highest value. Turn off the gradient. If you want to see bars based off percentile or some custom formula, then be explicit. Create a new column, create your formula, create bars on that column.

Please, guys, this isn’t rocket science. This is plain common sense. You would not ship Microsoft Word with a glaring bug in the way text renders. You would not ship Excel with a broken statistical function that people use everyday. Delivering deceitful-by-design infographics betrays your central role in democratizing the analysis of data. Until you fix this, in-cell ASCII art still remains the best way to explore data visually.

A disclosure: We do not currently use Excel 2007 at Juice Analytics. This is not due to a high-minded sense of moral outrage but is merely a reflection of our clients' environments.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

9 comments | Show all comments only the last 5 are shown


July 10, 2007
R Varley said:

Hi, I'm trying to write an evaluation document on Excel 2007; everyone seems to think it's rubbish for statistics, but no-one says what's wrong. I've been trawling the internet for days, and turned up nothing beyond "Everyone knows it's broken". Can you give me any pointers?

Thanks.


October 11, 2007
Patrick O'Beirne said:

1st Oct 2007:
Data Bars – Feedback Please
Today’s author: Scott Ruble, the program manager who leads the charting and visualization efforts in Excel. Scott is looking for some feedback on potential changes to data bar behaviour.

http://blogs.msdn.com/excel/archive/2007/10/01/data-bars-feedback-please.aspx


October 12, 2007
Chris Gemignani said:

Patrick, I commented on the Excel databars post. I'm astonished that these questions keep coming up. The solution is simple: "You need to start with the absolute principle that the bars you show _must_ be proportional to the numbers they represent."


December 27, 2008
Greg said:

Is there a way to turn off the gradient fill in the data bars?


April 9, 2009
Matt Craig said:

I have used another way to represent data bars "in cells", but it is "clunky". I create a bar chart, turn off every interface element except for the bars, make the bars slightly transparent (if I still want to see the numbers), and then put it on top of the cells.

It works, but it's "clunky" - the scaling and placement is finicky. It is pretty nice when it's done though, as you can apply appropriate rules to the display of the graph. (e.g. 0 = 0 length), and it gets rid of the annoying gradient (the reason I did it in the first place)

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Persuasive Presentations

Back in my consulting days at Diamond Technology Partners, I was known for my ability to bend PowerPoint to my will and fashion epic presentation-stories from lovingly-crafted slides. There was a term used when a client wanted a good looking presentation; they would ask if it could be "Zachified." Ah, the false glory.

Now I realize I was merely an amateur in designing presentations that could entrance and persuade an audience. I was going on instincts without much thought to the types of evidence, structure, and flow that would convince my audience.

Last week I had lunch with a man who has made a living from teaching others how to create effective presentations. His name is Andrew Abela and his blog is Extreme Presentations. Andrew has developed a thorough framework and training approach. He has a Doctorate and is a professor at Catholic University, so you know he brings an academic seriousness to the messed-up world of flufferpoint:

def (Withrop Hayes): A presentation that attempts to distract from the lack of substantive content or evidence with use of screenbeans, clip art, and other stock pictures or illustrations. A.k.a. clipterfuge (Todd Moy), clusterpoint (Cathy), The Macy’s Data Day Fluff Parade (Jamel).

Andrew gave me a quick backstage pass to his training methodology. Here are a few highlights:

1. Like a fool, I asked whether he preferred the sparse Lessig method or the more traditional, content-rich method. False choice. It all depends on the situation, just don't use the wrong approach at the wrong time. Andrew makes the distinction between "ballroom style" and "conference room style."

"Ballroom style presentations, like most typical PowerPoint presentations, are colorful, vibrant, attention-grabbing, and (sometimes) noisy. They typically take place in a large, dark room, such as a hotel ballroom. Conference room presentations are more understated: they have less color and more details on each page. They are more likely to be on printed handouts than projected slides, and they are more suited to your average corporate conference room. The single biggest mistake that presenters make is to confuse the two idioms, and particularly to use ballroom style where conference room style is more appropriate. I would estimate that upwards of 90 percent of all PowerPoint presentations use ballroom style, yet most of the time our presentation conditions call for conference room style."

That's from an article he shared with us called Achieve Impact through Persuasive Presentation Design (PDF)

2. It is important to mix data-based evidence with anecdotes. People need both of these types of information to persuade both the mind and heart (my interpretation).

3. Anticipate your audience's objections and build them into your storyline. What is better than having exactly the right slide next when someone raises a concern?

4. Good presentations require a lot of thought about their design. Andrew has defined five dimensions of an "Extreme Presentation": logic, rhetoric, graphics, politics, and metrics.

His blog offers a couple useful tools:

  • A framework for choosing the right chart
  • Slides that pass the squint test : "A good way to test whether your page is laid out properly is to apply what designers call the "squint test." Squint at the page, so that all the text is blurred and illegible. Do you get anything about the page without having to read the text? If you can see that the page is showing a process or two or three alternatives or a bunch of things converging, then your page passes the squint test."
This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

4 comments


June 5, 2007
Jose Hernandez said:

Hello Zach,

I could not follow the link to the article "Achieve Impact through Persuasive Presentation Design (PDF)", can you please check.

Thanks

Jose


June 5, 2007
Zach said:

That PDF is now available.


June 15, 2007
Atilla said:

Hi Zach,

I noticed that the links at the end of your posts do not point to the correct URLs. For example, the NEXT link points to your current post (Persuasive Presentations) instead of "Excel 2007 and the Lie Factor" post, and the Previous link points to your current post (Persuasive Presentations) instead of "How to manipulate..." post. Is there a way to correct this? I have been trying hard to browse your archives but due to this problem, it is not possible.

Cheers


June 20, 2007
Tom said:

I like #3 a lot... As a college student, my professors are always telling me to do this in my essays, and I don't see any reason why it wouldn't be an effective strategy in a presentation. I'll have to give it a shot sometime soon.

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Analytics Roundup: Databases and spreadsheets

Sharing what matters | Economist.com
This requires a new level of flexibility in the database. When building most databases today, programmers decide in advance what sort of questions users might wish to ask of the data, by defining what are known as the “schema”

Allowing Business Users To Program Your System Is A Recipe For Disaster
Parts of the solution make sense, but the problem is scare-mongering.

Spreadsheet errors, news stories about spreadsheets with costly mistakes
Ok, now who's going to measure the benefits?

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

0 comments | Add a comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment






How to manipulate scale to not make your point

Today I got this chart in an e-mail from the well-meaning alumni fundraisers at my school:

Alumni Challenge Chart

Aside from the crazy looking gridlines, monsterous size, oddly phrased title, and overkill for a single data point, its greatest sin is not giving the Young Alumni enough credit. They blew away their goal of 300 gifts, you just wouldn't know it from this chart.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

6 comments | Show all comments only the last 5 are shown


June 1, 2007
darrell said:

In addition, the chart scale is cropped at about 400, and the data clearly indicates 530 is the actual figure. Do I detect a hint of "well we can't tell them we're overfunded and then ask for more contributions"?

Send them a cheque with the signature cropped off. :)


June 1, 2007
Jorge Camoes said:

If you want to see a full range on malicious, misleading or plain stupid charts you just have to browse through some corporate annual reports. You can learn a lot on how to lie with charts.

(And then take a look at this one again.)


June 4, 2007
TonyV said:

To me, it just looks amateurish - I'm guessing that it wasn't intended as an example of all the mistakes that could be made in a chart? You could be cynical and interpret it as "now we've got the cash in, we've got better things to spend our time on than taking the effort to communicate properly". But I'm not cynical ...


June 4, 2007
derek said:

I don't see the size as monstrous (what is "size" in the world of computer display?). Instead I see the supporting text as teeny, a common mistake of "default chart" users in Excel.

I used to have a theory that this was caused by us older people who were taught to make scatter charts in science class at school by being given a sheet of graph paper to fill in. Since the accuracy of hand drawing is better the larger the grid, we used as much of the grid as we could, and squeezed the labels and title into the thin margins.

This theory, while neat and simple, doesn't account for a generation that by now ought to be far beyond drawing on paper, and should have grown up using computer charting tools.


June 4, 2007
James Taylor said:

The power of a college education! Great example.

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment