Analytics Roundup: Wise guy edition
By Chris Gemignani
June 22, 2007
Find more about:
analytics
humor
management
statistics
- Dunning-Kruger effect - Wikipedia, the free encyclopedia
- Mistakes in Experimental Design and Interpretation
- Taxonomy of ways to fail your experimental design.
Beauty in the Details
By Zach Gemignani
June 18, 2007
Find more about:
infovis
interface
The design process is about whittling away distractions, making the obscure feel obvious, making the obvious feel implicit, and doing it without anyone noticing. To the untrained eye, your best work looks like you’ve done no work at all. If you’ve done a stellar job, then your design will feel utterly obvious. —Neil Mix from Paradox of Elegance blog post.
Neil goes on to say that "it’s easier to see the flaws than it is to see the elegance." That may be true, but a careful look at the best interfaces reveals the little and beautiful elements that make all the difference. These small features might not determine whether someone uses a piece of software, but they will determine if the user enjoys their stay:
Designer Bret Victor, who we first wrote about here, has developed a desktop widget for the SF-area train schedules. He allows users to change their query right in the description of an object—notice the red text.

While we are fawning over Bret's handiwork, here's another cool feature he built into his Click-Shirt site for customized design of t-shirts. This bar at the bottom of the screen tracks the history of changes as the user designs a shirt. Each time I make an edit, I get a visual breadcrumb trail to easily see my history and backtrack.

Google Finance stock charts have a nifty little device that lets you change the time range you are looking at. You can change both the size and the start of the time window using one adjustable object. Not to mention the embedded alert markers.

Some elegant touches are more subtle. Check out the search toolbar in Firefox. When I start typing, it fills out search terms both from my history (above the line) and from common searches (below the line).

The Safari browser for Windows offers a new approach for finding words in a web page. The browser greys the screen and highlights the target word. And you can to tab through the various instances of the word with the orange highlighting.

And while we are on the subject of Apple: sometimes the difference between clunky and good is simply about the quality of the images. A while back I wrote a break-up letter to PowerPoint—one reason was that the Mac-alternative called Keynote does a much job with the look of default charts. The chart on the right feels more professional, in part due to the anti-aliasing of the image. (Joel on Software has an interesting post about anti-aliased fonts here.)

Finally an infographic from the New York Times called the Sector Snapshot. The beauty of this presentation of information is in the careful use of contrast and skill at keeping the focus on the numbers.

2 comments
derek said:
You're dead right about anti-aliasing, although with Excel for Windows my beef is not with the fonts themselves, which are fine enough to suit me, but with the graph symbols, which are embarrassingly childish-looking at any serious resolution. I have resorted to using data labels in Wingdings fonts, centred over a data series formatted so that its symbols are invisible, and this is sometimes acceptable. Though sadly not always, due to the tendency of text and drawing objects to drift away from their appointed positions when printed, exported, or resized.
If they could fix that problem, I would strongly suggest a modern graphics program should make use of font technology for its symbol set, with specialist fonts making hundreds of different symbols available to the chart maker instead of the WinExcel traditional circles, squares and diamonds etc.
Kneu said:
Love the NYtimes infographic "sector snapshot: energy".
Do you know of anyone who challenged themselves to reproduce it in excel?
I'm totally in love with it but not smart enough to come up with a solution.
Add a comment
Excel 2007 and the Lie Factor
By Chris Gemignani
June 7, 2007
Find more about:
analytics
excel
sparklines
tufte
visualization
“The representation of numbers, as physically measured on the surface of the graphic itself, should be directly proportional to the quantities represented.”
Edward Tufte calls violation of this principle the “Lie Factor”. The implementation of in-cell data bars in Microsoft Excel 2007 is a big offender.
Almost a year ago, I was surprised to discover that the Microsoft Excel 2007 development team didn’t understand what zero means. Their implementation of in-cell data bars showed a bar in a cell, even if the cell had a zero or very low value.

That was in the Excel 2007 Beta. Things haven’t improved in the current version of Excel 2007. The default setting for data bars in Excel 2007 is to scale to bars so that the smallest bar is based on the smallest value in the selected range and the largest bar is based on the largest value. It still appears that the smallest bar will be no smaller than five or ten percent of the width of the cell. Here’s a sample:

So, if you select a range that has values between 600 and 700, the 600 would have a little bitty bar and the 700 would have a full-width bar. Based on the bars, it would look like the 700 is ten to twenty times larger than 600. Outside of Redmond, this is generally regarded as untrue.
What’s more, if you create two sets of data bars side by side, each group of data bars scales itself independently even though they look the same. Take a look at this screenshot:

Notice the top seven cells have data bars that have one set of scaling and the bottom data bars have a different scaling. However, they look identical, and users should generally expect these bars to have the same scale.
Here are the rules:
- Defaults matter! It doesn’t matter that you can do data bars correctly in Excel. The default should be to do it right and it should be hard to do it wrong.
- The “right way” to make data bars is to make the length of the data bar directly proportional to the value in the cell. If one cell has a value twice another it should have a bar that is twice as long.
- Remove the default gradient shading. The gradient makes it hard to tell where the bar ends, obscuring what you’re trying to show.
- Continuous cells with data bars should all use the same scale. Use different colors to indicate ranges that have different scales.
Excel 2007 supports at least twenty-five different combinations of ways of specifying the length of the data bar.

Exactly one of those ways is correct. Base the shortest bar on the number 0. Base the longest bar on the highest value. Turn off the gradient. If you want to see bars based off percentile or some custom formula, then be explicit. Create a new column, create your formula, create bars on that column.
Please, guys, this isn’t rocket science. This is plain common sense. You would not ship Microsoft Word with a glaring bug in the way text renders. You would not ship Excel with a broken statistical function that people use everyday. Delivering deceitful-by-design infographics betrays your central role in democratizing the analysis of data. Until you fix this, in-cell ASCII art still remains the best way to explore data visually.
A disclosure: We do not currently use Excel 2007 at Juice Analytics. This is not due to a high-minded sense of moral outrage but is merely a reflection of our clients' environments.
9 comments | Show all comments only the last 5 are shown
James L. said:
“The representation of numbers, as physically measured on the surface of the graphic itself, should be directly proportional to the quantities represented.”
Although unrelated to the article, I was wondering what your take would be on the use of logarithmic scaling? This violates "direct proportionality", but is quite common in scientific/engineering fields (I myself used it the other day).
Chris Gemignani said:
My initial thought is that logarithmic scaling doesn't work in the context of in-cell graphing. Log scaling would be really hard to use without an axis and is probably best when you're comparing time-series trends in a line chart. Thanks.
Will Oswald said:
"You would not ship Excel with a broken statistical function"... erm, unless you include the LINEST function that up until Excel 2003 did not adjust for collinearity in multiple regressions, a fundamental problem
Chris Gemignani said:
You noticed I qualified my statement with "that people use everyday". I have heard about this problem and others in with Excel's statistical functions. These problems should have been fixed as soon as they were reported.
R Varley said:
Hi, I'm trying to write an evaluation document on Excel 2007; everyone seems to think it's rubbish for statistics, but no-one says what's wrong. I've been trawling the internet for days, and turned up nothing beyond "Everyone knows it's broken". Can you give me any pointers?
Thanks.
Patrick O'Beirne said:
1st Oct 2007:
Data Bars – Feedback Please
Today’s author: Scott Ruble, the program manager who leads the charting and visualization efforts in Excel. Scott is looking for some feedback on potential changes to data bar behaviour.
http://blogs.msdn.com/excel/archive/2007/10/01/data-bars-feedback-please.aspx
Chris Gemignani said:
Patrick, I commented on the Excel databars post. I'm astonished that these questions keep coming up. The solution is simple: "You need to start with the absolute principle that the bars you show _must_ be proportional to the numbers they represent."
Greg said:
Is there a way to turn off the gradient fill in the data bars?
Matt Craig said:
I have used another way to represent data bars "in cells", but it is "clunky". I create a bar chart, turn off every interface element except for the bars, make the bars slightly transparent (if I still want to see the numbers), and then put it on top of the cells.
It works, but it's "clunky" - the scaling and placement is finicky. It is pretty nice when it's done though, as you can apply appropriate rules to the display of the graph. (e.g. 0 = 0 length), and it gets rid of the annoying gradient (the reason I did it in the first place)
Add a comment
Persuasive Presentations
By Zach Gemignani
June 5, 2007
Find more about:
powerpoint
presentation
presentations
Back in my consulting days at Diamond Technology Partners, I was known for my ability to bend PowerPoint to my will and fashion epic presentation-stories from lovingly-crafted slides. There was a term used when a client wanted a good looking presentation; they would ask if it could be "Zachified." Ah, the false glory.
Now I realize I was merely an amateur in designing presentations that could entrance and persuade an audience. I was going on instincts without much thought to the types of evidence, structure, and flow that would convince my audience.
Last week I had lunch with a man who has made a living from teaching others how to create effective presentations. His name is Andrew Abela and his blog is Extreme Presentations. Andrew has developed a thorough framework and training approach. He has a Doctorate and is a professor at Catholic University, so you know he brings an academic seriousness to the messed-up world of flufferpoint:
def (Withrop Hayes): A presentation that attempts to distract from the lack of substantive content or evidence with use of screenbeans, clip art, and other stock pictures or illustrations. A.k.a. clipterfuge (Todd Moy), clusterpoint (Cathy), The Macy’s Data Day Fluff Parade (Jamel).
Andrew gave me a quick backstage pass to his training methodology. Here are a few highlights:
1. Like a fool, I asked whether he preferred the sparse Lessig method or the more traditional, content-rich method. False choice. It all depends on the situation, just don't use the wrong approach at the wrong time. Andrew makes the distinction between "ballroom style" and "conference room style."
"Ballroom style presentations, like most typical PowerPoint presentations, are colorful, vibrant, attention-grabbing, and (sometimes) noisy. They typically take place in a large, dark room, such as a hotel ballroom. Conference room presentations are more understated: they have less color and more details on each page. They are more likely to be on printed handouts than projected slides, and they are more suited to your average corporate conference room. The single biggest mistake that presenters make is to confuse the two idioms, and particularly to use ballroom style where conference room style is more appropriate. I would estimate that upwards of 90 percent of all PowerPoint presentations use ballroom style, yet most of the time our presentation conditions call for conference room style."
That's from an article he shared with us called Achieve Impact through Persuasive Presentation Design (PDF)
2. It is important to mix data-based evidence with anecdotes. People need both of these types of information to persuade both the mind and heart (my interpretation).
3. Anticipate your audience's objections and build them into your storyline. What is better than having exactly the right slide next when someone raises a concern?
4. Good presentations require a lot of thought about their design. Andrew has defined five dimensions of an "Extreme Presentation": logic, rhetoric, graphics, politics, and metrics.
His blog offers a couple useful tools:
- A framework for choosing the right chart
- Slides that pass the squint test : "A good way to test whether your page is laid out properly is to apply what designers call the "squint test." Squint at the page, so that all the text is blurred and illegible. Do you get anything about the page without having to read the text? If you can see that the page is showing a process or two or three alternatives or a bunch of things converging, then your page passes the squint test."
4 comments
Jose Hernandez said:
Hello Zach,
I could not follow the link to the article "Achieve Impact through Persuasive Presentation Design (PDF)", can you please check.
Thanks
Jose
Zach said:
That PDF is now available.
Atilla said:
Hi Zach,
I noticed that the links at the end of your posts do not point to the correct URLs. For example, the NEXT link points to your current post (Persuasive Presentations) instead of "Excel 2007 and the Lie Factor" post, and the Previous link points to your current post (Persuasive Presentations) instead of "How to manipulate..." post. Is there a way to correct this? I have been trying hard to browse your archives but due to this problem, it is not possible.
Cheers
Tom said:
I like #3 a lot... As a college student, my professors are always telling me to do this in my essays, and I don't see any reason why it wouldn't be an effective strategy in a presentation. I'll have to give it a shot sometime soon.
Add a comment
Analytics Roundup: Databases and spreadsheets
By Chris Gemignani
June 4, 2007
Find more about:
agile
blog
development
excel
spreadsheets
- Sharing what matters | Economist.com
- This requires a new level of flexibility in the database. When building most databases today, programmers decide in advance what sort of questions users might wish to ask of the data, by defining what are known as the “schema”
- Allowing Business Users To Program Your System Is A Recipe For Disaster
- Parts of the solution make sense, but the problem is scare-mongering.
- Spreadsheet errors, news stories about spreadsheets with costly mistakes
- Ok, now who's going to measure the benefits?
How to manipulate scale to not make your point
By Zach Gemignani
June 1, 2007
Find more about:
charts
infovis
Today I got this chart in an e-mail from the well-meaning alumni fundraisers at my school:

Aside from the crazy looking gridlines, monsterous size, oddly phrased title, and overkill for a single data point, its greatest sin is not giving the Young Alumni enough credit. They blew away their goal of 300 gifts, you just wouldn't know it from this chart.
6 comments | Show all comments only the last 5 are shown
40ozKing said:
That is a monster. But I've seen similar from management consultants. Actually sat thru a training session with some consultants who spoke about charts and client communications, and then demonstrated with the dreaded 3-D Excel charts!
darrell said:
In addition, the chart scale is cropped at about 400, and the data clearly indicates 530 is the actual figure. Do I detect a hint of "well we can't tell them we're overfunded and then ask for more contributions"?
Send them a cheque with the signature cropped off. :)
Jorge Camoes said:
If you want to see a full range on malicious, misleading or plain stupid charts you just have to browse through some corporate annual reports. You can learn a lot on how to lie with charts.
(And then take a look at this one again.)
TonyV said:
To me, it just looks amateurish - I'm guessing that it wasn't intended as an example of all the mistakes that could be made in a chart? You could be cynical and interpret it as "now we've got the cash in, we've got better things to spend our time on than taking the effort to communicate properly". But I'm not cynical ...
derek said:
I don't see the size as monstrous (what is "size" in the world of computer display?). Instead I see the supporting text as teeny, a common mistake of "default chart" users in Excel.
I used to have a theory that this was caused by us older people who were taught to make scatter charts in science class at school by being given a sheet of graph paper to fill in. Since the accuracy of hand drawing is better the larger the grid, we used as much of the grid as we could, and squeezed the labels and title into the thin margins.
This theory, while neat and simple, doesn't account for a generation that by now ought to be far beyond drawing on paper, and should have grown up using computer charting tools.
James Taylor said:
The power of a college education! Great example.





0 comments | Add a comment
said: