Knowing Less Means More
By Juice Alumni
March 31, 2006
Find more about:
interface
One of our current projects it to design or re-design the user interface for the next generation of one of our client's products. While talking with the designers and engineers, you wonder why the people with the deepest understanding of the product aren't always the most qualified to design the interface.
The biggest challenge of designing the UI of a complex system is that as you become comfortable and knowledgeable of a design, the harder it is for you to accurately assess how simple and intuitive it is. How do you account for this? By constantly bring in fresh eyes at all aspects of the design process. The longer you've been toiling over a feature, the less likely that you yourself notice a glaring design flaw.
Harvesting Collective Genius
By Chris Gemignani
March 28, 2006
Find more about:
Lovely article in the New York Times business section on Sunday. Rite-Solutions, a software company, has built an internal marketplace to allow everyone in the company to invest in good ideas.
At Rite-Solutions, the architecture of participation is both businesslike and playful. Fifty-five stocks are listed on the company's internal market, which is called Mutual Fun. Each stock comes with a detailed description — called an expect-us, as opposed to a prospectus — and begins trading at a price of $10. Every employee gets $10,000 in "opinion money" to allocate among the offerings, and employees signal their enthusiasm by investing in a stock and, better yet, volunteering to work on the project. Volunteers share in the proceeds, in the form of real money, if the stock becomes a product or delivers savings.
This intrigues me both for its democracy and for the discipline of requiring people with ideas to write a clear, easy-to-read description of their project.
"We're the founders, but we're far from the smartest people here," Mr. Lavoie, the chief executive, said during an interview at Rite-Solutions' headquarters outside Newport, R.I. "At most companies, especially technology companies, the most brilliant insights tend to come from people other than senior management. So we created a marketplace to harvest collective genius."
Great ideas are equally possible coming from the bottom up as the top down. In particular, those "bottom" people have a lot of hands on experience with the way things actually work.
"There's nothing wrong with experience," said Mr. Marino, the company's president. "The problem is when experience gets in the way of innovation. As founders, the one thing we know is that we don't know all the answers."
I've worked at companies where a senior executive is vigorously pursuing an idea that most people know is doomed. No one person is willing to share the bad news, but what if he or she had to confront a slumping share price for their "expect-us"?
My not so Jiffy experience
By Zach Gemignani
March 26, 2006
Find more about:
customeranalytics
analytics
Funny thing happened to me last week on my way to an oil change - my car's engine was destroyed.
It all happened in a blink: I stopped by Jiffy Lube on my way to the bakery and swung up to the garage entrance, first in line. If you've ever had an oil change, you've probably experienced the old "preventative maintenance" up-sell: a technician pulls you out of the waiting room, gives you a grim, disappointed look, then explains the various parts of your car that are in severe need of service. In the past, I've been good at standing up to these automotive authority figures. I'd mumble "no thanks, maybe next time," not daring to look into the eyes which so clearly said: Don't you care about your own safety? This time, however, I broke down and gave the go ahead for an engine flush. I was assured that any sane car owner would have this procedure done every 15k miles; here I am at 80k without my first flush.
I knew something was wrong when I saw them pushing my car out of the garage half an hour later. I was assured it was no problem; they just needed to dry off my spark plugs. Two hours later I was calling for a ride.
All of which would have been a small inconvenience if I hadn't gotten a call the next morning letting me know they would need to replace my engine. Clearly something had gone terribly wrong with that engine flush.
I should say: I have little reason to gripe about Jiffy Lube. They are covering the engine replacement and a rental car. That said, there are a few lessons Jiffy Lube management might take from this situation:
- The edge cases matter. A while back we wrote (here and here) about analysis of anomalies and the opportunity for learning. One point that applies in this case: Collectively, outlier customers provide a service: they stress test the product and highlight unrealized strengths or weaknesses. In its desire to relentlessly upsell, Jiffy Lube has extended its service outside its comfort zone to a point of weakness.
- Data can make you smarter. I had an interesting conversation with the outside mechanic that is installing the replacement engine. He said I was lucky. My engine has a known problem with high levels of sludge build-up. He has seen other instances where an engine can be so full of sludge that an engine flush is incapable of breaking through the muck (like clogged arteries, I imagined) and the result is ruin. I get a refurbished engine with 80% new parts in place of an engine that was like the heart of an overweight cholesteral-holic. Maybe Jiffy Lube shouldn't be indiscriminantly upselling every customer. It wouldn't be difficult to build some filters into their system for high risk maintenance.
- Communicate with unhappy customers. Most companies would benefit from a simple alarm system for catching and responding to customers with particularly bad experiences. Something to appease them before they tell all their friends, family, and co-workers about their crappy experience (heck, they might even blog about it). All I ask as a customer is: a) recognize that you have created an inconvenience for me; b) convey that this isn't a status quo situation; and c) assure me that you will make me whole. Jiffy Lube wasn't effective in communicating any of these. They had an odd nonchalance that suggested this happens all the time, no single point of contact to speak to, and no apology for the inconvenience.
1 comment
Cujo said:
Sadly, the nonchalance you speak of indicates one of two things. One, the business is grossly mismanaged; I suspect this is not the case of Jiffy Lube, but what do I know? The other option is that they make so much money from upselling crap people don't need that it isn't even worth filtering or worrying about the expense when they hose someone's engine, as it's just part of the cost of doing business that way.
Add a comment
Good and bad charting examples
By Chris Gemignani
March 22, 2006
Find more about:
analytics
excel
visualization
Just found this (thanks instantcognition), which offers examples of poor charting design along with a discussion of how to make the charts better. Both the good and bad examples are repeatable in Excel.
Extreme Excel Charts
By Chris Gemignani
March 22, 2006
Find more about:
excel
screencast
This is a video created entirely using Excel bubble charts. It illustrates flows over the course of the year between two starting states and four ending states.
Click picture to view video
I want to stimulate discussion on creative charting methods using common tools, Excel or otherwise. If you have an example of a creative use of charting, let me know and we can all get a little better at illustrating information.
7 comments | Show all comments only the last 5 are shown
ACI said:
Very interesting animation. A couple of questions occur:
1. Did you create the animation in Excel using VBA (or other method) or was it just a massive series of charts that you screen-shot and pumped into Flash?
2. How hard was it to create the animation in Flash?
Chris said:
Its iterative VBA on the charts using a method sort of like this (http://www.juiceanalytics.com/weblog/?m=200506). A neat trick is you can use VBA to dump a GIF image of any chart to a file. That gave me a directory with 365 GIFs. Then I used Irfanview ( http://www.irfanview.com/) which is fast and free to browse through the GIF images. That's all there is to the "movie", holding down the right arrow key to browse the images successively very quickly. I like this technique because you can stop at any time and reverse if you see a pattern of interest.
ACI said:
So I'm giving some thought to trying to recreate a cloud chart (ex: http://lunametrics.blogspot.com/2006/03/visitor-clouds-and-web-analytics-and.html) in Excel but am having a tough time framing the required VBA, being a dillettante when if comes to VBA...
Have you tried this?
I'm thinking that it's just a bunch of text boxes with appropriately-sized text but am not sure how to even start thinking about how to make it all work together dynamically.
Babak said:
I think this would be of big help:
http://www.juiceanalytics.com/weblog/?p=38
ACI said:
very cool indeed (RTFM, or in this case RTFB). But WAY beyond my ability to hack for a much simpler implementation :P
Kelly O'Day said:
I liked your idea so much that I tried it on a trend chart.
It's was very easy to set up a VBA routine to create GIFs.
I dwnloaded Irfanview and created a humble animation. A good start that has given me some ideas.
One question, your demo showed a nice movie type display. My demo was ugly in comparison. black backgground, no control options and a display of file path and name across top.
How do you actually capture your movie. I recorded slide show. I am missing somehting that you did.
Once I have adecent movie, I'll submit it. Can you give me some additional tips on using Irfanview? Thanks
Kelly O'Day
Chris said:
Kelly,
I wonder if it's a transparency problem? I believe my images were set to a white background in Excel prior to exporting them to GIF.
The right and left arrows will cycle through images in a directory. Holding the right arrow down gives a convincing 30 fps. That is what I am using to "animate" the movie.
Also, I _did not_ put Irfanview into full screen mode (which I think is what's giving you the black background, etc.). Instead, I captured the movie using Camtasia Studio on a portion of the Irfanview window. You can do the same thing using Windows Media Encoder, too.
Hope this helps,
Chris
Add a comment
Presentation checklist: always simplify, never screenbean
By Zach Gemignani
March 20, 2006
Find more about:
powerpoint
presentations
The business world is awash in piles of words mis-classified as PowerPoint presentations. Here are three slides from a webside development vendor's pitch:

Even at a distance, it's clear you wouldn't want to sit through this. Creating a compelling presentation is tough. There are, however, a few things that can make a big difference for your audience. Before you save your PowerPoint file as "final" and send to print, here's a checklist of things that you should always do, do more of, do less of, and never do.
Always...
- Tell a story. Great presentations tell a compelling and cohesive story. Stories have themes, characters, plotlines, and a message.
- Build a flow. Try reading your slide headlines. Do the slides connect? Do they tell a story? Don't be afraid to shuffle your slides to find a cleaner way for the concepts to fit together.
- Provide a roadmap. Its easy to lose an audience if you have a complex argument or storyline. Give them sign-posts that let them know where they are and where they are headed.
- Banish slides to an appendix. When I build a presentation, I will inevitably end up with slides that don't fit neatly into my story. Find a home for them in the appendix; it's the "green room" of your presentation where potentially useful material can hang out until it's needed.
- Simplify. Your story can be simpler. Your slides can have more whitespace. Be ruthless in cutting.
More...
- Pictures, including full page pictures. Cliff Atkinson of the beyond bullets blog is a big proponent of pictures in place of words and offers a good, though extreme, perspective.
- Changes of pace. Long, data intensive presentations easily become monotonous. Break these up with different kinds of slides, multimedia content, or audience interaction.
- Animation in slides. While animation can be a distraction, it does have its place. In particular, if you have a complex slide, consider building the layers of information. It forces your audience to focus on just one part at a time.
- Quick punchlines. It is always tempting to develop a presentation that gradually builds to an exciting conclusion. Save it for your great American mystery novel. Your audience is impatient to get to the point. Provide it to them up front.
Less...
- Words. People use extra words to cover up when they don't know precisely what they want to say. Enough said.
- Bullets. A list with more than five bullets is beyond most people's ability to process or remember. Long lists can be broken into subgroups.
- Bullet-points. For short, simple lists, remove the bullet points.
- Stock clipart: Microsoft has been generous enough to provide a world of clipart fun. Avoid that world as much as possible. Clipart is mostly useless filler.
- Animation between slides. Unlike in-slide animation which can focus your audience, animation between slides is simply a distraction.
- Capitalization. There are rules for when to capitalize. Capitalizing Almost Every Word in a Sentence is not One of Them.
Never...
- Use ScreenBeans. Check this out: I found the ScreenBean store. For just $59.95, you can get a whole package of digital ant people running around in business settings. Trust me, there are cheaper ways to ruin a presentation. On that note, I found this slide in a presentation entitled: "Effective Presentation: Using Microsoft PowerPoint." Ironic?

3 comments
» Communication that Sticks - Juice Analytics said:
[...] We can construct engaging stories from the data and put together winning PowerPoint decks, but the window of opportunity to communicate our results always seems frighteningly narrow. Weeks of analysis and synthesis can get crammed into a single 30-minute agenda-item. The cramming part isn’t so much the problem (it demands focus on the key results)–I am more concerned about the fleeting attention of an audience that has a dozen other priorities, is awash in information, and may be data-phobic. Not to mention the risk of getting derailed by conversations about the data sources and statistical significance. [...]
» Lingua Analytica, or How to Impress your Boss with Sniglets - Juice Analytics said:
[...] 7. __________: A presentation that attempts to distract from the lack of substantive content or evidence with use of screenbeans, clip art, and other stock pictures or illustrations. [...]
Chongololo said:
I actually make and give quite a lot of presentations and I LIKE screenbeans!
Add a comment
Public Data for the Public
By Juice Alumni
March 20, 2006
Find more about:
googleearth
google
We've blogged recently about the rich data available from the census bureau and the ability to visualize it with low cost GIS tools like Google Earth. Our friend Ducky recently released her mash-up of Google maps and census data that shows the potential for dynamic and informative maps that are easy and free to use.

This data is available to everyone, but compare the data you find at the census bureau to some of the maps at Webfoot. The census bureau presents a bunch of numbers and Webfoot made them come alive. At Juice, we call this "democratizing data" and it shows the true potential of the next generation of web tools to empower your organization and put the data at the hands of the people that need it. Check out Absolutely Google Earth for a source of tools and resources to do the same.
100 PowerPoint Ideas
By Zach Gemignani
March 16, 2006
Find more about:
powerpoint
Sometimes when you're starting a presentation, you need a little help getting over the hump in deciding how to present data and concepts. The following PowerPoint deck contains a collection of ideas and visuals that can help when you're brainstorming.
I hope this is a valuable resource for presentation writers looking for inspiration, widgets, nice-looking charts and graphics, and ideas on structuring slides.
It's organized into four sections:
Widgets: Little graphics that help call out an idea or show where you are. Examples include: Harvey balls, bullets, brackets, arrows, and outline styles.
Charts: Different ways to use PowerPoint's built-in charting to create nice looking graphics. Examples of column charts, pie charts, commented pie chart, multi-column charts, waterfall chart, radar charts, and representing data with chart areas.
Graphics: This includes tables, value chains, linked chains, representing three dimensions of value, showing market forces, project structure, project timelines, segmentation schemes, ranges, positives and negatives, product roadmaps, flows, processes, Venn diagrams, filtering opportunities, and showing market space.
Slide Structures: Ways of framing the content of a slide, including personas, multi-charts, section breaks, and calling out important concepts.
7 comments | Show all comments only the last 5 are shown
Mary said:
My question is: can one just lift these bits and use them? How do I do that?
Chris said:
One can just take these bits and use them. Just copy any graphics you want from the 100 PowerPoint ideas deck and paste into the new project you're working on.
You are a very helpful and kind sharing person, thanks a million!!!!!! said:
Thanks 1000 millions times over ,not enough, as most of these software pakages are so easy to use for those who have used them intensively, or have looked at them sytematically.
So when those people pass their knoweldge on, like your self, well thats normally once in a blue moon thing.
Anyway i must fi my jumping cursor problem on my vaio notebook, i really have found life easy now with presentations and dont have to stare at those beatiful women in their skirted suits making me think about other things i am so transfixed with these prentations that i love my diagrams and flow charts etc.
Kind Warm regards
Jean Piere
Chris said:
From IM this morning:
<strong>Chris</strong>: best comment ever
http://www.juiceanalytics.com/weblog/?p=138#comments
never to be topped
<strong>Zach</strong>: haha
that kills
<strong>Chris</strong>: you've got to write a response
<strong>Zach</strong>: I don't know where to begin
Thanks so much Jean Pierre for the kind comment! I’m glad we could help you get your mind back on your work and off those walking distractions.
Greg said:
great stuff.thanks
Lucia said:
Thanks for sharing. This is a valuable collection of ideas!
op said:
Thanks for sharing. I am already in Office 2010 and these power point ideas are still useful
Add a comment
Microsoft Office: Fighting the Bull
By Chris Gemignani
March 15, 2006
Find more about:
excel
powerpoint
We're cutting through it this week with an extra helping of Microsoft Office help. Coming soon: PowerPoint brain food, Excel charting de-gunkulator, and Excel charting extreme edition.
But first, check out fightthebull.com, an add-in for Word and PowerPoint that takes syntax checking in a more useful direction. This adds a toolbar to your apps that determines how readable your writing is and how prone you are to biz-talk. Because, wihle you cn sltil raed tihs, you probably can't decipher this:
One of Unicorp's corporate objectives is to develop strategic relationships with key customers and be recognized for our ability to deliver services of superior value. This competitive advantage will be achieved through continued focus on our core competencies, management attention to the development of operations and process management excellence in all parts of our business, the identification and application of best processes, and continued attention to direct and indirect cost management. The focus on core competencies will promote the concentration of knowledge in select areas consistent with the tenets of Unicorp's strategic plan, Unicorp 2000. Management's attention to operations and process management excellence in all business areas will be achieved through the continued expansion of our management and technical staff, as well as through consistent application of corporate quality programs such as benchmarking and continuous improvement, leading to the establishment of Unicorp's superior business processes in each core competency. Finally, continued attention to direct and indirect cost management will enable Unicorp to offer customers a superior, value-added package of high-quality service at a competitive price.
1 comment
Mary said:
I can decipher that--it is much like edu-lingo. Still your new tool sounds interesting. How can I attach it to my computer?
Add a comment
Excel Keyboarding Game
By Chris Gemignani
March 15, 2006
Find more about:
excel
screencast
You can get a lot faster with Excel if you learn to use keyboard shortcuts instead of the mouse. Here's a fun Excel-based game that we use at JuiceCo to teach one basic keyboarding skill. I guarantee this will make you better and faster with Excel.
Click picture to view video. The audio seems to be infested by aliens and drops out midway through. Many apologies.
If you're viewing this in a feedreader, click though to see a screencast of the game.
5 comments
phil pfeifer said:
Instead of cntrl and arrow, I use end with arrows. It is surprising how few people know how these tricks. I call it the "roller coaster" phenomenon when, instead, people over and undershoot the end of a block when trying to paint with the mouse.
Chris said:
My best times are:
Race 1: 0.59 seconds
Race 2: 1.13 seconds
Race 3: 1.44 seconds
Race 4: 1.67 seconds
Race 5: 4.77 seconds
Race 6: 8.80 seconds
Race 7: 13.20 seconds
Race 8: 4.84 seconds
Anyone doing better?
-Chris
Josh said:
Oh, please make it so your webcasts don't auto-start!!!
Also, the flash app seems to be capturing every time I hit space bar. So, as I type this comment, it starts and stops the recording each time I hit space between words. AAAAHHH!! (Safari 2.0.3)
Chris said:
Josh,
You must be really sick of hearing my voice by now. I just checked Windows Firefox and Windows IE and the clips _do not_ autostart (which is good). So, I've localized my abuse to Mac users. Very sorry.
Yaniv said:
This game doesn't seem to be showing up on this post, and I don't see a working hyperlink anywhere. I tried to copy paste into my browser (firefox) http://www.juiceanalytics.com/flash/excel_keyboard_race,444,324
and I got a page not found error.
Thanks for reposting the link if it's still up.
Add a comment
The Heart of the "Competing on Analytics" Matter
By Chris Gemignani
March 14, 2006
Find more about:
analytics
I confess to a lack of ambition. When Tom Davenport's article on "Competing on Analytics" came out in the Harvard Business Review in January, Zach and I critiqued Tom's list of 10 things that are "what it takes to be an analytics competitor" because it offered a good example of condensed misperceptions about what analytics can and should do.
But we didn't address the heart of the matter; the biggest way analytics will change over the next decade and the reason we're in business.
Niel Raden does. He offers a critique of "Competing on Analytics" that is focused and deep. I invite you to read it as well as his other publications. This is good thinking by a very experienced analytics consultant.
There are two schools of thought when it comes to the value of BI in general. One is that it is best used by “quantitative” types and other analytical business people, who can spot trends and analyze patterns to assist in the big decisions and set and direct strategy. The other position is that BI is at its best when helping a broad range of people and processes at an operational level, marginally improving performance, repeatedly and often. The former is the commonly held view of management consultants and, previously, BI practitioners a decade ago. The latter position gained currency in the last few years and is now widely seen as borne out in practice. Using BI to form a new strategy for a global financial services firm makes for good marketing collateral, but when it comes to ROI, lots of small improvements are the way to go.
Why is centralizing analytics bad and decentralizing analytics good? Why shouldn't your organization have a single centralized "brain" that directs the far-flung body to intelligent, purposeful action?
Centralized control of data and analytical expertise may not seem very controversial, but what Davenport is implying is not only centralized control, but also centralized design. This is another naïve assumption, because many organizations are not only decentralized—they’re dysfunctional. Separate units within organizations often need autonomy because they are just so different from the rest of the organization. In addition, as an organization becomes more “agile,” which is a definite trend, decision-making, even for the big decisions, will become more decentralized. Imagine how difficult it will be to buy or sell pieces of a company if the “brain,” the centralized analytical capability, stays with the parent and there is no local expertise?
In our experience, there are hundreds of decisions that need to be made each day, even in a medium-sized organization. In most cases, those decisions are being made in a vacuum—on faith, trust, gut-feel, and partial information. Statistical significance tests aren't needed to improve these decisions, just basic, easy-to-use visibility into business processes.
Statistical analytics can be helpful, but perspective and experience are even more useful to find insights in data. Democratizing data in your organization, making it easier to put more eyes, more experience, more brains against your data is the challenge of the next ten years in analytics. The Internet abounds with examples of what people can do when they can get their hands on data.
Incidentally, if you do want to hear Tom Davenport's side of the argument, you can catch a webcast on Thursday, March 16th at 1:00pm EST. Here's a link.
3 comments
val said:
I must concur with your reasoning and hesitation to affirm Tom's vision (better say his vantage point). Tom's perspective smacks of conceptually dominated reasoning. It is devoid of a theoretical basis that would suggest that centralization has real opportunity costs (e.g., like all centralized IT). Second it is devoid of the organizational realities asscoaited with the division of labor within firms. Third it fails to appreciate the nature of tools available to us today.
I am not impressed with Tom's piece because it reflects no basis, in theory, experience or widom. I will go a step further to suggest that his perspective is based upon his big IT/Systems Integration propensity.
Mary said:
I couldn't agree with you more Chris. Working together, brainstorming, cooperation is always the best way to go in problem solving of any type. Two heads are better than one... etc.
Jay said:
Hi,
I agree with a great deal of what has been said here however I want to play the other side for a moment on a few of the issues.
To begin with why not do both? Have centralized teams of “quants” (which can exist in a single department or at a corporate level) and give the reporting tools to everyone else.
Just as there are downsides to keeping everything to one high level team there are also downsides to releasing data to everyone. To flip the tables in my work experience I have seen many occasions where these quant teams have out performed “business” people consistently by 100% to 300% in ROI on things such as marketing campaigns or business process re-engineering. Conversely I have seen occasions where business people come up with ideas that create such ROI increases as well.
Just as you are subject to the great ideas that come from having “many eyes” on the information you are also subject to the bad ideas that come form the same process. The same can be said for the small analytical groups. The upside should ways be weighed against the downside in order to define the benefit.
Sometimes data (manipulated properly) can become self-fulfilling evidence. There is a lot of thought and care that is required in order to relate historical outcomes accurately to future outcomes. It is not that only quants can do this as the issues are more conceptual and are issues of experimental design; which we naturally figure out in many parts of our lives. There are lots of constraints during the real-time operation of a business like time, people, cost, quality, etc…but you should always strive for the absolute best answers possible within your constraints.
Any decision about something that has not yet happened is a forecast of a future outcome. By not testing something in a reliable fashion the decision is exposed to an unknown level of potential risk. Very loosely speaking the reliability (accuracy to the future outcome) of the testing (decision making) method is inversely related to that amount of risk. Of course there will always be uncertainty in decision making that is a reality.
Perhaps a more balanced overall perspective is best. Take both approaches and see what happens. You will not know which is best or even if either are providing benefit until you try. Just know that both can be beneficial and it is really hard to make judgments absolutely in either direction.
Add a comment
The Future Problem with Enterprise Data Warehouses
By Chris Gemignani
March 8, 2006
Find more about:
analytics
A friend of mine with years of analytics and management experience at large companies wrote recently. He put his finger squarely on a real issue with enterprise data warehouses--they're built to solve today's problems (not tomorrow's) and they're designed for the needs of analysts that are more statisticians than experts about the business.
I wanted to provide some comments on the enterprise wide data warehouse and the challenges it presents at large corporations. Jim Novo certainly seems to support the roll up approach (I'm on his mailing list) but I agree with Juice that it is too slow, too costly, and results in restricted analytics the way most large companies build them. Most of the large data warehouses I've worked with only include data variables that are key to managing a business TODAY as the warehouses are too big and costly to store data variables with a low usage frequency. They also attempt to cleanse the data by classifying. This makes life easier for an analyst with statistical experience but a limited knowledge of the business. However you're losing information. Problem: You do not know what will be important in the future. Distributed databases at a line of business or product level tend to store more raw data to manage and analyze the business. Sure, the amount of space used would be the same if you simply put the data into the warehouse but that is not the way decisions are made. Decision makers look at the frequency of use of the data variables (TODAY) and the cost to include them then decide which get deleted or excluded. Also, the analysts who are disconnected to the business lines do not understand the raw data.
Let me give you a real world example. Our data warehouse classifies claims into a limited number of claim reason categories. This might be considered a poor design but please see above about analysts that are disconnected from the business lines. When a new type of claim is developing, the person classifying the claims (claims rep) does not have a category to select so they just select what works best to fit into the pre-defined categories. Information is lost due to the restrictions of the allowed categories within the data warehouse. If the notations from the claims system would have been stored (an unforeseen variable) in the warehouse and text mining analytics were being done, a specific word would have been found associated with claims at an alarming rate. Identification of this word would have allowed for early recognition of the issue. It cost us a lot of money in claims due to the missing data but no one thought to include the notes due to the size and costs. Well, we have them now.
Little Pictures Tell Big Stories
By Chris Gemignani
March 7, 2006
Find more about:
analytics
visualization
Zach mentioned our penchant for telling a story about customers using little data-dense pictures. Happily, this style of data visualization won "Best in Show" in DM Review's 2005 data visualization competition. Follow this jump for an excellent example of a data dense visualization created using Tableau.
Judge customers by behavior, not fur color
By Zach Gemignani
March 5, 2006
Find more about:
customeranalytics
segmentation
To a stranger, my two dogs look alike. To me, they couldn't be more different. They came from different dog shelters and are more than two years apart (that's 14+ human years). Here they are: Maddie has her chin resting on Ally.

Ally is twitchy, a mama's girl, frightened of loud noises, and getting creaky. Maddie is confident, independent, curious about the loud noise, and energetic. Ally loves other dogs and distrusts new people. Maddie adores all people and is suspicious around certain dogs. Their features and personalities couldn't be more different. I've had some time to get to know them.
When we meet a stranger on a walk (particularly one who isn't a dog owner), we often get: "They must be related." Our denials don't seem to phase these people as they point to the obvious evidence: "...but they are exactly the same size and color."
Superficial judgements are natural - a first level of defense to categorizing and manage a complex world. However, it's unhealthy to not try to dig deeper. For some businesses, superficial characteristics are as far as the analysis goes when segmenting or profiling customers. A better approach is to look at customer behaviors which provides a much more accurate reflection of interests and needs. Jim Novo, marketing consultant, agrees:
Customer behavior is a much stronger predictor of your future relationship with a customer than demographic information ever will be
Simple customer characteristics can be easy to come by; age, income, zip code are probably part of your basic customer database. In contrast, behavioral segmentation is a more initimating analytical challenge. Here's the approach we've used successfully at Juice:
- Create individual pictures of customers that visually show their behaviors over time. The trick is to create a "visual language" that represents actions and is intuitive
- With a dash of Excel, SAS, and python code, we generate thousands of these pictures of individual customers
- We visually scan for common patterns of behaviors and the associated success/failure points (e.g. repurchase, upsell, churn, etc.)
- Finally, we work backwards from our new understanding of behaviors to segment customers based on statistical measures of behaviors.
This approach differs from traditional data mining-based approaches that drill down from the top looking for patterns. We start at a very granular level and looks for patterns (using the power of the human visual system). It may sound a little crazy, but we've found that it can be both insightful and highly predictive.
Census Data in Google Earth
By Chris Gemignani
March 1, 2006
Find more about:
analytics
census
googleearth
We love Google Earth because it puts the power to explore data in the hands of average folks. We've been exploring uses for census data and wanted to share some of this data with the world.
What you're seeing here is a map of the counties in the United States colorized by median age. Lighter colors are older.

Census data is also available at the block group level which is much, much more detailed.
Without further ado, what follows are three sets of links for each state which allow you to explore population density, median age, and male/female ratio in each state at two levels of detail. Google Earth is required. We did have some ftp issues when uploading these files, so if you have any problems, let me know and I'll re-upload the file.
Population Density
Lighter is higher population density (white is 800+ people per square mile), Dark is lower population density (black is 2 or fewer people per square mile)
Median Age
Lighter is older median age (white is 46.0 years median age), Dark is younger median age (black is 29.0 years median age)
Male/Female Ratio
Lighter means more men than women (white is 55% men), Dark means more women than men (black is 45% men)
If you want to know more about Google Earth, check out our Absolutely Google Earth a collection of tools and resources to get you started.
We're working on a project to make this and other simple mapping applications more widely available. If you're a python guru who is interested in building great mapping applications like Chicago Crime give me a jingle.
35 comments | Show all comments only the last 5 are shown
Shane said:
I found this really interesting, thanks guys. It would be nice if there were some translucency or something, so that you can see the satellite picture also, is this possible?
Also one other suggestion maybe you could use the population data you already have to improve the other maps. For example, Central Park in New York has misleading median age and m/f ratio information, maybe sections like this could be transparent or some to colour off your scale, eg green.
Chris said:
Thanks Shane. We're on top of that translucency thing. Look for a screencast very soon. All the census block groups are supposed to have roughly the same population base, so there _shouldn't_ be population-based problems. I'll take a look at it, though, because there are definitely some outliers.
Josh said:
Very cool. I'm just getting into ArcView at work... I'm definitely going to have to figure out how to export some data to Google Earth. It just makes it so much more fun! I work with political data, and it seems like I should be able to do the same thing with precincts and electoral data that you did with block groups and census data.
BTW, it seems like you might need to set a mime-type on your server for .kmz files.
http://www.keyhole.com/kml/kml_tut.html#kml_server
Safari downloaded your files as text.
Babak said:
Shane,
You should be able to change layer transparency without waiting for Chris' screencast. In google Earth, select the census overlay layer, and there should be a horizontal scroll bar (in between 'Places' and 'Layers'). Drag it to the left and you'll get what you want.
Let me know if you need any more help.
Rob Mosley said:
Is there a quick way to download all the files in one go? Or do I need to click on every link?
Chris said:
Rob,
There's no quick way to grab all the files. We'll look into creating an archive containing all the files. Thanks.
Chris
Brian said:
Thanks for the great data!
How about a more detailed legend? For example, you state that in the pop. density maps, white is 800+ / square mile and black is such and such, but what about the other colors?
adaptive_tech said:
Hi guys -- great work, thanks for sharing this.
One quick question: Do you plan on releasing a version that has the actual census block group numbers in there? It would be great to relate data back to block groups as described here:
http://ftp2.census.gov/plmap/pl_blk/st06_California/
Chris said:
Thanks, err, adaptive.
We are considering some more releases, one of which is to be just the labeled outlines. Anything else you want to see?
- Chris
Brad said:
Is there a way to look at geocoded census information by county.
I need to look at census tracts, street addresses, road networks for the city of Tulsa, OK county is Tulsa.
I have low income census tract numbers and need to know where these neighborhoods are located.
Dave Robinder said:
Every time I try to open on of these files, Google Earth crashes. The KML looks fine as near as I can tell. I'm using the latest stable version of Google Earth. Any ideas on what I might be doing wrong? It doesn't seem to matter which file I use.
Chris said:
Sorry, Dave. I don't have any idea. I just opened a few on the latest stable Google Earth and had no problems. It's a cop-out, but maybe a reinstall is in order?
Chris
Joe said:
Awesome work here! The marketing data that can be generated from census extrapolation could be very valuable... thanks for sharing your KMZ's.
I second Brian's comment. Can we get more granularity in the population color legend? Or can you simply state what the colors mean?
Mathieu said:
Awesome, congrats !!
Do you know how I could get this kind of information from the Canada ?
Thanks !
Chris said:
Mathieu,
I don't know where to get this info for Canada. If anyone does, please post.
Thanks,
Chris
Mathieu said:
Chris,
Nevermind!
I mean, do you have this kind of information (pop. density) of the Canada ?!
Chris said:
Mathieu,
Sorry, I don't have it.
Chris
ytJohn said:
Chris,
A few other people have mentioned this, but I would just like to add my vote/request/begging for a legend of sorts. Even if you provided it as a static web page that I could refer to, having a legend of color to population density would be great.
Also I like you idea of outlining the block level regions. I noticed expanding out the state name gives you a list "block group 1", "block group 2", and so on, and the names repeat themselves, but give a detailed number of people per square mile. Clicking on the name lets me see where it's referencing, and then I can zoom in and make out the area, but it's a guessing game to figure out where a block is and how far it extends. Labelling it with the region numbers that the census bureau provides would definately make things more efficient.
Those two things aside, this is a incredible piece of work and is just about what I was looking for to help me with a grant I am applying for.
Aidan said:
This is a great post - thanks for sharing the ideas and data files. I'm helping someone out map Census data and found your examples very helpful. I've put together a <a href="http://censuskml.blogspot.com/" rel="nofollow">blog</a> to track my experiments. I've been playing with the 3rd dimension to show the data.
One of the things I've found is that Google Earth [at least the Mac version] seems to have a bug when it comes to loading 3D polygons from KML files in that it shows very dark colors, regardless of what the actual color is in the KML file. If you edit the object, the color auto-magically brightens, have you guys noticed this?
Thanks again!
Robert Kosara said:
Wow, this is great stuff! Information wants to be free and such, wonderful!
BTW, are you aware of Imran Haque's <a href="http://gecensus.stanford.edu/gcensus/" rel="nofollow">gCensus</a> project at Stanford? Saying that it is very similar to your project would be an understatement, perhaps you guys could work together?
Zach said:
We did just find out about the gCensus project, and I agree that it covers similar ground. We got pretty far down this path then were distracted by other shiny objects. Even so, it would be good to get some of our capability out there for others to use.
jennifer said:
This is great. I'd love to also see median income, if that information is available.
Internet Briefing Blog / GISle said:
[...] GIS-Daten grafisch darzustellen war fürher kostspielig. Google Earth bietet hier eine billige Alternative. So geht das. Und es sieht auch noch spannend aus. Noch spannender wäre es, wenn die grafische Darstellung von Vernetzung mit solchen Landkarten-Darstellungen in einem mash-up zu sehen wären. Nur so ein Gedanke, wahrscheinlich ein saublöder. Ist ja schon gut, man wird ja noch… yigg this! — save to del.icio.us [...]
Aidan said:
Jennifer - I have put together maps of Median HH income at <a href="http://censuskml.blogspot.com" rel="nofollow">CensusKML</a> in this <a href="http://censuskml.blogspot.com/2007/03/getting-better-color-schemes.html" rel="nofollow">post</a>. Feel free to drop me a line and I'll happily share the KMZ files.
Nick said:
The data you have put together is excellent. Thanks so much.
I have a question. I assume that the data is from the 2000 Census. Do you have data from previous years (i.e. 1990, 1980, etc.)? With multiple years of data it would be possible map the growth of a city over time.
Thanks again.
Patrick said:
Related to Nick's question - can you confirm that these layers are based on the 2000 Census?
Thanks
Chris Gemignani said:
Nick/Patrick: This data is based on the 2000 census.
Eric Bobrow said:
Is there any way (using Google Earth and your data or additional data available online) to draw a polygon over an area of the map, and get a population count?
We are trying to work out area definitions for sale/representation of our software products, and it would be very helpful to know how many people are in a particular area on the map.
A munir said:
Hi Chris. I am interested in using Google Earth to find Clutter info. Basically I need to know if a zipcode is in Dense Urban/Urban/Sub-urban or Rural clutter. Do you have any suggestions?
Any idea will be appreciated
Chris M. said:
I am ecstatic about the info you have provided!
How about ethnicity data? Is there a way to access that in this same format?
Much appreciated and thanks again!!
Chris M.
Andrew C said:
I have been looking for good detailed county overlays for a long time... I hate that they disappear when you zoom in. Finally found your website - perfect! Many thanks.
Wai Lee said:
Where did you guys get your information on geographic boundaries? You seem to have stored the county-boundaries as lists of lat-lng pairs (in the KML document). I can't imagine you got these lat-lng pairs from a map. Are they published somewhere?
Thanks!
Chris Gemignani said:
Hi Wai Lee: I'm sorry to say, I don't recall exactly where we got the boundary data. There are published government sources that typically include geographies in shapefiles. I believe we extracted those out using Python and generated the KML also using Python. We did a very slight amount of shape simplification (coalescing nearby points) to make things easier on Google Earth.
Aaron Coleman said:
Thank you for posting these .kmz. I just found .kmz on the EPA website showing air pollution throughout the country. Any idea why Google hasn't gotten in kahutz with the census bureau and published this info along with their existing data. Guess it's a lot of info in one software.
Dan said:
Hey Chris,
This is awesome work. I was wondering if you knew where the data is from? Is this the 2000 Census data?







0 comments | Add a comment
said: