Knowing Less Means More

One of our current projects it to design or re-design the user interface for the next generation of one of our client's products. While talking with the designers and engineers, you wonder why the people with the deepest understanding of the product aren't always the most qualified to design the interface.

The biggest challenge of designing the UI of a complex system is that as you become comfortable and knowledgeable of a design, the harder it is for you to accurately assess how simple and intuitive it is. How do you account for this? By constantly bring in fresh eyes at all aspects of the design process. The longer you've been toiling over a feature, the less likely that you yourself notice a glaring design flaw.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

0 comments | Add a comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment






Harvesting Collective Genius

Lovely article in the New York Times business section on Sunday. Rite-Solutions, a software company, has built an internal marketplace to allow everyone in the company to invest in good ideas.

At Rite-Solutions, the architecture of participation is both businesslike and playful. Fifty-five stocks are listed on the company's internal market, which is called Mutual Fun. Each stock comes with a detailed description — called an expect-us, as opposed to a prospectus — and begins trading at a price of $10. Every employee gets $10,000 in "opinion money" to allocate among the offerings, and employees signal their enthusiasm by investing in a stock and, better yet, volunteering to work on the project. Volunteers share in the proceeds, in the form of real money, if the stock becomes a product or delivers savings.

This intrigues me both for its democracy and for the discipline of requiring people with ideas to write a clear, easy-to-read description of their project.

"We're the founders, but we're far from the smartest people here," Mr. Lavoie, the chief executive, said during an interview at Rite-Solutions' headquarters outside Newport, R.I. "At most companies, especially technology companies, the most brilliant insights tend to come from people other than senior management. So we created a marketplace to harvest collective genius."

Great ideas are equally possible coming from the bottom up as the top down. In particular, those "bottom" people have a lot of hands on experience with the way things actually work.

"There's nothing wrong with experience," said Mr. Marino, the company's president. "The problem is when experience gets in the way of innovation. As founders, the one thing we know is that we don't know all the answers."

I've worked at companies where a senior executive is vigorously pursuing an idea that most people know is doomed. No one person is willing to share the bad news, but what if he or she had to confront a slumping share price for their "expect-us"?

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

0 comments | Add a comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment






My not so Jiffy experience

Funny thing happened to me last week on my way to an oil change - my car's engine was destroyed.

It all happened in a blink: I stopped by Jiffy Lube on my way to the bakery and swung up to the garage entrance, first in line. If you've ever had an oil change, you've probably experienced the old "preventative maintenance" up-sell: a technician pulls you out of the waiting room, gives you a grim, disappointed look, then explains the various parts of your car that are in severe need of service. In the past, I've been good at standing up to these automotive authority figures. I'd mumble "no thanks, maybe next time," not daring to look into the eyes which so clearly said: Don't you care about your own safety? This time, however, I broke down and gave the go ahead for an engine flush. I was assured that any sane car owner would have this procedure done every 15k miles; here I am at 80k without my first flush.

I knew something was wrong when I saw them pushing my car out of the garage half an hour later. I was assured it was no problem; they just needed to dry off my spark plugs. Two hours later I was calling for a ride.

All of which would have been a small inconvenience if I hadn't gotten a call the next morning letting me know they would need to replace my engine. Clearly something had gone terribly wrong with that engine flush.

I should say: I have little reason to gripe about Jiffy Lube. They are covering the engine replacement and a rental car. That said, there are a few lessons Jiffy Lube management might take from this situation:

  • The edge cases matter. A while back we wrote (here and here) about analysis of anomalies and the opportunity for learning. One point that applies in this case: Collectively, outlier customers provide a service: they stress test the product and highlight unrealized strengths or weaknesses. In its desire to relentlessly upsell, Jiffy Lube has extended its service outside its comfort zone to a point of weakness.
  • Data can make you smarter. I had an interesting conversation with the outside mechanic that is installing the replacement engine. He said I was lucky. My engine has a known problem with high levels of sludge build-up. He has seen other instances where an engine can be so full of sludge that an engine flush is incapable of breaking through the muck (like clogged arteries, I imagined) and the result is ruin. I get a refurbished engine with 80% new parts in place of an engine that was like the heart of an overweight cholesteral-holic. Maybe Jiffy Lube shouldn't be indiscriminantly upselling every customer. It wouldn't be difficult to build some filters into their system for high risk maintenance.
  • Communicate with unhappy customers. Most companies would benefit from a simple alarm system for catching and responding to customers with particularly bad experiences. Something to appease them before they tell all their friends, family, and co-workers about their crappy experience (heck, they might even blog about it). All I ask as a customer is: a) recognize that you have created an inconvenience for me; b) convey that this isn't a status quo situation; and c) assure me that you will make me whole. Jiffy Lube wasn't effective in communicating any of these. They had an odd nonchalance that suggested this happens all the time, no single point of contact to speak to, and no apology for the inconvenience.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

1 comment


January 15, 2007
Cujo said:

Sadly, the nonchalance you speak of indicates one of two things. One, the business is grossly mismanaged; I suspect this is not the case of Jiffy Lube, but what do I know? The other option is that they make so much money from upselling crap people don't need that it isn't even worth filtering or worrying about the expense when they hose someone's engine, as it's just part of the cost of doing business that way.

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Good and bad charting examples

Just found this (thanks instantcognition), which offers examples of poor charting design along with a discussion of how to make the charts better. Both the good and bad examples are repeatable in Excel.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

1 comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Extreme Excel Charts

This is a video created entirely using Excel bubble charts. It illustrates flows over the course of the year between two starting states and four ending states.

Click picture to view video

I want to stimulate discussion on creative charting methods using common tools, Excel or otherwise. If you have an example of a creative use of charting, let me know and we can all get a little better at illustrating information.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

7 comments | Show all comments only the last 5 are shown


March 23, 2006
ACI said:

So I'm giving some thought to trying to recreate a cloud chart (ex: http://lunametrics.blogspot.com/2006/03/visitor-clouds-and-web-analytics-and.html) in Excel but am having a tough time framing the required VBA, being a dillettante when if comes to VBA...
Have you tried this?
I'm thinking that it's just a bunch of text boxes with appropriately-sized text but am not sure how to even start thinking about how to make it all work together dynamically.


March 23, 2006
Babak said:

I think this would be of big help:

http://www.juiceanalytics.com/weblog/?p=38


March 23, 2006
ACI said:

very cool indeed (RTFM, or in this case RTFB). But WAY beyond my ability to hack for a much simpler implementation :P


April 10, 2006
Kelly O'Day said:

I liked your idea so much that I tried it on a trend chart.

It's was very easy to set up a VBA routine to create GIFs.

I dwnloaded Irfanview and created a humble animation. A good start that has given me some ideas.

One question, your demo showed a nice movie type display. My demo was ugly in comparison. black backgground, no control options and a display of file path and name across top.

How do you actually capture your movie. I recorded slide show. I am missing somehting that you did.

Once I have adecent movie, I'll submit it. Can you give me some additional tips on using Irfanview? Thanks

Kelly O'Day


April 10, 2006
Chris said:

Kelly,

I wonder if it's a transparency problem? I believe my images were set to a white background in Excel prior to exporting them to GIF.

The right and left arrows will cycle through images in a directory. Holding the right arrow down gives a convincing 30 fps. That is what I am using to "animate" the movie.

Also, I _did not_ put Irfanview into full screen mode (which I think is what's giving you the black background, etc.). Instead, I captured the movie using Camtasia Studio on a portion of the Irfanview window. You can do the same thing using Windows Media Encoder, too.

Hope this helps,
Chris

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Presentation checklist: always simplify, never screenbean

The business world is awash in piles of words mis-classified as PowerPoint presentations. Here are three slides from a webside development vendor's pitch:

Bad presentation example

Even at a distance, it's clear you wouldn't want to sit through this. Creating a compelling presentation is tough. There are, however, a few things that can make a big difference for your audience. Before you save your PowerPoint file as "final" and send to print, here's a checklist of things that you should always do, do more of, do less of, and never do.

Always...

  • Tell a story. Great presentations tell a compelling and cohesive story. Stories have themes, characters, plotlines, and a message.
  • Build a flow. Try reading your slide headlines. Do the slides connect? Do they tell a story? Don't be afraid to shuffle your slides to find a cleaner way for the concepts to fit together.
  • Provide a roadmap. Its easy to lose an audience if you have a complex argument or storyline. Give them sign-posts that let them know where they are and where they are headed.
  • Banish slides to an appendix. When I build a presentation, I will inevitably end up with slides that don't fit neatly into my story. Find a home for them in the appendix; it's the "green room" of your presentation where potentially useful material can hang out until it's needed.
  • Simplify. Your story can be simpler. Your slides can have more whitespace. Be ruthless in cutting.

More...

  • Pictures, including full page pictures. Cliff Atkinson of the beyond bullets blog is a big proponent of pictures in place of words and offers a good, though extreme, perspective.
  • Changes of pace. Long, data intensive presentations easily become monotonous. Break these up with different kinds of slides, multimedia content, or audience interaction.
  • Animation in slides. While animation can be a distraction, it does have its place. In particular, if you have a complex slide, consider building the layers of information. It forces your audience to focus on just one part at a time.
  • Quick punchlines. It is always tempting to develop a presentation that gradually builds to an exciting conclusion. Save it for your great American mystery novel. Your audience is impatient to get to the point. Provide it to them up front.

Less...

  • Words. People use extra words to cover up when they don't know precisely what they want to say. Enough said.
  • Bullets. A list with more than five bullets is beyond most people's ability to process or remember. Long lists can be broken into subgroups.
  • Bullet-points. For short, simple lists, remove the bullet points.
  • Stock clipart: Microsoft has been generous enough to provide a world of clipart fun. Avoid that world as much as possible. Clipart is mostly useless filler.
  • Animation between slides. Unlike in-slide animation which can focus your audience, animation between slides is simply a distraction.
  • Capitalization. There are rules for when to capitalize. Capitalizing Almost Every Word in a Sentence is not One of Them.

Never...

  • Use ScreenBeans. Check this out: I found the ScreenBean store. For just $59.95, you can get a whole package of digital ant people running around in business settings. Trust me, there are cheaper ways to ruin a presentation. On that note, I found this slide in a presentation entitled: "Effective Presentation: Using Microsoft PowerPoint." Ironic?

Screen Beans

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

3 comments


January 15, 2007
» Communication that Sticks - Juice Analytics said:

[...] We can construct engaging stories from the data and put together winning PowerPoint decks, but the window of opportunity to communicate our results always seems frighteningly narrow. Weeks of analysis and synthesis can get crammed into a single 30-minute agenda-item. The cramming part isn’t so much the problem (it demands focus on the key results)–I am more concerned about the fleeting attention of an audience that has a dozen other priorities, is awash in information, and may be data-phobic. Not to mention the risk of getting derailed by conversations about the data sources and statistical significance. [...]


April 4, 2007
» Lingua Analytica, or How to Impress your Boss with Sniglets - Juice Analytics said:

[...] 7. __________: A presentation that attempts to distract from the lack of substantive content or evidence with use of screenbeans, clip art, and other stock pictures or illustrations. [...]


December 22, 2007
Chongololo said:

I actually make and give quite a lot of presentations and I LIKE screenbeans!

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Public Data for the Public

We've blogged recently about the rich data available from the census bureau and the ability to visualize it with low cost GIS tools like Google Earth. Our friend Ducky recently released her mash-up of Google maps and census data that shows the potential for dynamic and informative maps that are easy and free to use.

Webfoot map

This data is available to everyone, but compare the data you find at the census bureau to some of the maps at Webfoot. The census bureau presents a bunch of numbers and Webfoot made them come alive. At Juice, we call this "democratizing data" and it shows the true potential of the next generation of web tools to empower your organization and put the data at the hands of the people that need it. Check out Absolutely Google Earth for a source of tools and resources to do the same.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

0 comments | Add a comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment






100 PowerPoint Ideas

Sometimes when you're starting a presentation, you need a little help getting over the hump in deciding how to present data and concepts. The following PowerPoint deck contains a collection of ideas and visuals that can help when you're brainstorming.

100 PowerPoint Ideas.ppt

I hope this is a valuable resource for presentation writers looking for inspiration, widgets, nice-looking charts and graphics, and ideas on structuring slides.

It's organized into four sections:

Widgets: Little graphics that help call out an idea or show where you are. Examples include: Harvey balls, bullets, brackets, arrows, and outline styles.

Charts: Different ways to use PowerPoint's built-in charting to create nice looking graphics. Examples of column charts, pie charts, commented pie chart, multi-column charts, waterfall chart, radar charts, and representing data with chart areas.

Graphics: This includes tables, value chains, linked chains, representing three dimensions of value, showing market forces, project structure, project timelines, segmentation schemes, ranges, positives and negatives, product roadmaps, flows, processes, Venn diagrams, filtering opportunities, and showing market space.

Slide Structures: Ways of framing the content of a slide, including personas, multi-charts, section breaks, and calling out important concepts.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

6 comments | Show all comments only the last 5 are shown


March 17, 2006
Chris said:

One can just take these bits and use them. Just copy any graphics you want from the 100 PowerPoint ideas deck and paste into the new project you're working on.


March 28, 2006
You are a very helpful and kind sharing person, thanks a million!!!!!! said:

Thanks 1000 millions times over ,not enough, as most of these software pakages are so easy to use for those who have used them intensively, or have looked at them sytematically.

So when those people pass their knoweldge on, like your self, well thats normally once in a blue moon thing.

Anyway i must fi my jumping cursor problem on my vaio notebook, i really have found life easy now with presentations and dont have to stare at those beatiful women in their skirted suits making me think about other things i am so transfixed with these prentations that i love my diagrams and flow charts etc.

Kind Warm regards

Jean Piere


March 28, 2006
Chris said:

From IM this morning:

<strong>Chris</strong>: best comment ever
http://www.juiceanalytics.com/weblog/?p=138#comments
never to be topped
<strong>Zach</strong>: haha
that kills
<strong>Chris</strong>: you've got to write a response
<strong>Zach</strong>: I don't know where to begin

Thanks so much Jean Pierre for the kind comment!  I’m glad we could help you get your mind back on your work and off those walking distractions.


November 28, 2007
Greg said:

great stuff.thanks


January 13, 2009
Lucia said:

Thanks for sharing. This is a valuable collection of ideas!

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Microsoft Office: Fighting the Bull

We're cutting through it this week with an extra helping of Microsoft Office help. Coming soon: PowerPoint brain food, Excel charting de-gunkulator, and Excel charting extreme edition.

But first, check out fightthebull.com, an add-in for Word and PowerPoint that takes syntax checking in a more useful direction. This adds a toolbar to your apps that determines how readable your writing is and how prone you are to biz-talk.  Because, wihle you cn sltil raed tihs, you probably can't decipher this:

One of Unicorp's corporate objectives is to develop strategic relationships with key customers and be recognized for our ability to deliver services of superior value. This competitive advantage will be achieved through continued focus on our core competencies, management attention to the development of operations and process management excellence in all parts of our business, the identification and application of best processes, and continued attention to direct and indirect cost management. The focus on core competencies will promote the concentration of knowledge in select areas consistent with the tenets of Unicorp's strategic plan, Unicorp 2000. Management's attention to operations and process management excellence in all business areas will be achieved through the continued expansion of our management and technical staff, as well as through consistent application of corporate quality programs such as benchmarking and continuous improvement, leading to the establishment of Unicorp's superior business processes in each core competency. Finally, continued attention to direct and indirect cost management will enable Unicorp to offer customers a superior, value-added package of high-quality service at a competitive price.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

1 comment


March 17, 2006
Mary said:

I can decipher that--it is much like edu-lingo. Still your new tool sounds interesting. How can I attach it to my computer?

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Excel Keyboarding Game

You can get a lot faster with Excel if you learn to use keyboard shortcuts instead of the mouse. Here's a fun Excel-based game that we use at JuiceCo to teach one basic keyboarding skill. I guarantee this will make you better and faster with Excel.

Click picture to view video. The audio seems to be infested by aliens and drops out midway through. Many apologies.

Excel Keyboarding Game.xls

If you're viewing this in a feedreader, click though to see a screencast of the game.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

5 comments


March 15, 2006
phil pfeifer said:

Instead of cntrl and arrow, I use end with arrows. It is surprising how few people know how these tricks. I call it the "roller coaster" phenomenon when, instead, people over and undershoot the end of a block when trying to paint with the mouse.


March 16, 2006
Chris said:

My best times are:

Race 1: 0.59 seconds
Race 2: 1.13 seconds
Race 3: 1.44 seconds
Race 4: 1.67 seconds
Race 5: 4.77 seconds
Race 6: 8.80 seconds
Race 7: 13.20 seconds
Race 8: 4.84 seconds

Anyone doing better?

-Chris


March 21, 2006
Josh said:

Oh, please make it so your webcasts don't auto-start!!!

Also, the flash app seems to be capturing every time I hit space bar. So, as I type this comment, it starts and stops the recording each time I hit space between words. AAAAHHH!! (Safari 2.0.3)


March 22, 2006
Chris said:

Josh,

You must be really sick of hearing my voice by now. I just checked Windows Firefox and Windows IE and the clips _do not_ autostart (which is good). So, I've localized my abuse to Mac users. Very sorry.


June 5, 2007
Yaniv said:

This game doesn't seem to be showing up on this post, and I don't see a working hyperlink anywhere. I tried to copy paste into my browser (firefox) http://www.juiceanalytics.com/flash/excel_keyboard_race,444,324
and I got a page not found error.

Thanks for reposting the link if it's still up.

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





The Heart of the "Competing on Analytics" Matter

I confess to a lack of ambition. When Tom Davenport's article on "Competing on Analytics" came out in the Harvard Business Review in January, Zach and I critiqued Tom's list of 10 things that are "what it takes to be an analytics competitor" because it offered a good example of condensed misperceptions about what analytics can and should do.

But we didn't address the heart of the matter; the biggest way analytics will change over the next decade and the reason we're in business.

Niel Raden does. He offers a critique of "Competing on Analytics" that is focused and deep. I invite you to read it as well as his other publications. This is good thinking by a very experienced analytics consultant.

There are two schools of thought when it comes to the value of BI in general. One is that it is best used by “quantitative” types and other analytical business people, who can spot trends and analyze patterns to assist in the big decisions and set and direct strategy. The other position is that BI is at its best when helping a broad range of people and processes at an operational level, marginally improving performance, repeatedly and often. The former is the commonly held view of management consultants and, previously, BI practitioners a decade ago. The latter position gained currency in the last few years and is now widely seen as borne out in practice. Using BI to form a new strategy for a global financial services firm makes for good marketing collateral, but when it comes to ROI, lots of small improvements are the way to go.

Why is centralizing analytics bad and decentralizing analytics good? Why shouldn't your organization have a single centralized "brain" that directs the far-flung body to intelligent, purposeful action?

Centralized control of data and analytical expertise may not seem very controversial, but what Davenport is implying is not only centralized control, but also centralized design. This is another naïve assumption, because many organizations are not only decentralized—they’re dysfunctional. Separate units within organizations often need autonomy because they are just so different from the rest of the organization. In addition, as an organization becomes more “agile,” which is a definite trend, decision-making, even for the big decisions, will become more decentralized. Imagine how difficult it will be to buy or sell pieces of a company if the “brain,” the centralized analytical capability, stays with the parent and there is no local expertise?

In our experience, there are hundreds of decisions that need to be made each day, even in a medium-sized organization. In most cases, those decisions are being made in a vacuum—on faith, trust, gut-feel, and partial information. Statistical significance tests aren't needed to improve these decisions, just basic, easy-to-use visibility into business processes.

Statistical analytics can be helpful, but perspective and experience are even more useful to find insights in data. Democratizing data in your organization, making it easier to put more eyes, more experience, more brains against your data is the challenge of the next ten years in analytics. The Internet abounds with examples of what people can do when they can get their hands on data.

Incidentally, if you do want to hear Tom Davenport's side of the argument, you can catch a webcast on Thursday, March 16th at 1:00pm EST. Here's a link.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

3 comments


March 15, 2006
val said:

I must concur with your reasoning and hesitation to affirm Tom's vision (better say his vantage point). Tom's perspective smacks of conceptually dominated reasoning. It is devoid of a theoretical basis that would suggest that centralization has real opportunity costs (e.g., like all centralized IT). Second it is devoid of the organizational realities asscoaited with the division of labor within firms. Third it fails to appreciate the nature of tools available to us today.
I am not impressed with Tom's piece because it reflects no basis, in theory, experience or widom. I will go a step further to suggest that his perspective is based upon his big IT/Systems Integration propensity.


March 17, 2006
Mary said:

I couldn't agree with you more Chris. Working together, brainstorming, cooperation is always the best way to go in problem solving of any type. Two heads are better than one... etc.


November 13, 2006
Jay said:

Hi,

I agree with a great deal of what has been said here however I want to play the other side for a moment on a few of the issues.

To begin with why not do both? Have centralized teams of “quants” (which can exist in a single department or at a corporate level) and give the reporting tools to everyone else.

Just as there are downsides to keeping everything to one high level team there are also downsides to releasing data to everyone. To flip the tables in my work experience I have seen many occasions where these quant teams have out performed “business” people consistently by 100% to 300% in ROI on things such as marketing campaigns or business process re-engineering. Conversely I have seen occasions where business people come up with ideas that create such ROI increases as well.

Just as you are subject to the great ideas that come from having “many eyes” on the information you are also subject to the bad ideas that come form the same process. The same can be said for the small analytical groups. The upside should ways be weighed against the downside in order to define the benefit.

Sometimes data (manipulated properly) can become self-fulfilling evidence. There is a lot of thought and care that is required in order to relate historical outcomes accurately to future outcomes. It is not that only quants can do this as the issues are more conceptual and are issues of experimental design; which we naturally figure out in many parts of our lives. There are lots of constraints during the real-time operation of a business like time, people, cost, quality, etc…but you should always strive for the absolute best answers possible within your constraints.

Any decision about something that has not yet happened is a forecast of a future outcome. By not testing something in a reliable fashion the decision is exposed to an unknown level of potential risk. Very loosely speaking the reliability (accuracy to the future outcome) of the testing (decision making) method is inversely related to that amount of risk. Of course there will always be uncertainty in decision making that is a reality.

Perhaps a more balanced overall perspective is best. Take both approaches and see what happens. You will not know which is best or even if either are providing benefit until you try. Just know that both can be beneficial and it is really hard to make judgments absolutely in either direction.

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





The Future Problem with Enterprise Data Warehouses

A friend of mine with years of analytics and management experience at large companies wrote recently. He put his finger squarely on a real issue with enterprise data warehouses--they're built to solve today's problems (not tomorrow's) and they're designed for the needs of analysts that are more statisticians than experts about the business.

I wanted to provide some comments on the enterprise wide data warehouse and the challenges it presents at large corporations. Jim Novo certainly seems to support the roll up approach (I'm on his mailing list) but I agree with Juice that it is too slow, too costly, and results in restricted analytics the way most large companies build them. Most of the large data warehouses I've worked with only include data variables that are key to managing a business TODAY as the warehouses are too big and costly to store data variables with a low usage frequency. They also attempt to cleanse the data by classifying. This makes life easier for an analyst with statistical experience but a limited knowledge of the business. However you're losing information. Problem: You do not know what will be important in the future. Distributed databases at a line of business or product level tend to store more raw data to manage and analyze the business. Sure, the amount of space used would be the same if you simply put the data into the warehouse but that is not the way decisions are made. Decision makers look at the frequency of use of the data variables (TODAY) and the cost to include them then decide which get deleted or excluded. Also, the analysts who are disconnected to the business lines do not understand the raw data.

Let me give you a real world example. Our data warehouse classifies claims into a limited number of claim reason categories. This might be considered a poor design but please see above about analysts that are disconnected from the business lines. When a new type of claim is developing, the person classifying the claims (claims rep) does not have a category to select so they just select what works best to fit into the pre-defined categories. Information is lost due to the restrictions of the allowed categories within the data warehouse. If the notations from the claims system would have been stored (an unforeseen variable) in the warehouse and text mining analytics were being done, a specific word would have been found associated with claims at an alarming rate. Identification of this word would have allowed for early recognition of the issue. It cost us a lot of money in claims due to the missing data but no one thought to include the notes due to the size and costs. Well, we have them now.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

0 comments | Add a comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment






Little Pictures Tell Big Stories

Zach mentioned our penchant for telling a story about customers using little data-dense pictures. Happily, this style of data visualization won "Best in Show" in DM Review's 2005 data visualization competition. Follow this jump for an excellent example of a data dense visualization created using Tableau.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

1 comment


May 10, 2007
theadore said:

i do not like it

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment





Judge customers by behavior, not fur color

To a stranger, my two dogs look alike. To me, they couldn't be more different. They came from different dog shelters and are more than two years apart (that's 14+ human years). Here they are: Maddie has her chin resting on Ally.

Dogs

Ally is twitchy, a mama's girl, frightened of loud noises, and getting creaky. Maddie is confident, independent, curious about the loud noise, and energetic. Ally loves other dogs and distrusts new people. Maddie adores all people and is suspicious around certain dogs. Their features and personalities couldn't be more different. I've had some time to get to know them.

When we meet a stranger on a walk (particularly one who isn't a dog owner), we often get: "They must be related." Our denials don't seem to phase these people as they point to the obvious evidence: "...but they are exactly the same size and color."

Superficial judgements are natural - a first level of defense to categorizing and manage a complex world. However, it's unhealthy to not try to dig deeper. For some businesses, superficial characteristics are as far as the analysis goes when segmenting or profiling customers. A better approach is to look at customer behaviors which provides a much more accurate reflection of interests and needs. Jim Novo, marketing consultant, agrees:

Customer behavior is a much stronger predictor of your future relationship with a customer than demographic information ever will be

Simple customer characteristics can be easy to come by; age, income, zip code are probably part of your basic customer database. In contrast, behavioral segmentation is a more initimating analytical challenge. Here's the approach we've used successfully at Juice:

  1. Create individual pictures of customers that visually show their behaviors over time. The trick is to create a "visual language" that represents actions and is intuitive
  2. With a dash of Excel, SAS, and python code, we generate thousands of these pictures of individual customers
  3. We visually scan for common patterns of behaviors and the associated success/failure points (e.g. repurchase, upsell, churn, etc.)
  4. Finally, we work backwards from our new understanding of behaviors to segment customers based on statistical measures of behaviors.

This approach differs from traditional data mining-based approaches that drill down from the top looking for patterns. We start at a very granular level and looks for patterns (using the power of the human visual system). It may sound a little crazy, but we've found that it can be both insightful and highly predictive.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

0 comments | Add a comment

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment






Census Data in Google Earth

We love Google Earth because it puts the power to explore data in the hands of average folks. We've been exploring uses for census data and wanted to share some of this data with the world.

What you're seeing here is a map of the counties in the United States colorized by median age. Lighter colors are older.

Median age in United States by county (lighter is older)

Census data is also available at the block group level which is much, much more detailed.Median age around Detroit, Michigan by census block group (lighter is older)

Without further ado, what follows are three sets of links for each state which allow you to explore population density, median age, and male/female ratio in each state at two levels of detail. Google Earth is required. We did have some ftp issues when uploading these files, so if you have any problems, let me know and I'll re-upload the file.

Population Density

Lighter is higher population density (white is 800+ people per square mile), Dark is lower population density (black is 2 or fewer people per square mile)

by County (overview) by Census Block Group (fine detail)

Alabama Alaska Arizona Arkansas California Colorado Connecticut Delaware District of Columbia Florida Georgia Hawaii Idaho Illinois Indiana Iowa Kansas Kentucky Louisiana Maine Maryland Massachusetts Michigan Minnesota Mississippi Missouri Montana Nebraska Nevada New Hampshire New Jersey New Mexico New York North CarolinaNorth Dakota Ohio Oklahoma Oregon Pennsylvania Rhode Island South CarolinaSouth Dakota Tennessee Texas Utah Vermont Virginia Washington West Virginia Wisconsin Wyoming

Alabama Alaska Arizona Arkansas California Colorado Connecticut Delaware District of Columbia Florida Georgia Hawaii Idaho Illinois Indiana Iowa Kansas Kentucky Louisiana Maine Maryland Massachusetts Michigan Minnesota Mississippi Missouri Montana Nebraska Nevada New Hampshire New Jersey New Mexico New York North CarolinaNorth Dakota Ohio Oklahoma Oregon Pennsylvania Rhode Island South Carolina South Dakota Tennessee Texas Utah Vermont Virginia Washington West Virginia Wisconsin Wyoming

Median Age

Lighter is older median age (white is 46.0 years median age), Dark is younger median age (black is 29.0 years median age)

by County (overview) by Census Block Group (fine detail)

Alabama Alaska Arizona Arkansas California Colorado Connecticut Delaware District of Columbia Florida Georgia Hawaii Idaho Illinois Indiana Iowa Kansas Kentucky Louisiana Maine Maryland Massachusetts Michigan Minnesota Mississippi Missouri Montana Nebraska Nevada New Hampshire New Jersey New Mexico New York North CarolinaNorth Dakota Ohio Oklahoma Oregon Pennsylvania Rhode Island South CarolinaSouth Dakota Tennessee Texas Utah Vermont Virginia Washington West Virginia Wisconsin Wyoming

Alabama Alaska Arizona Arkansas California Colorado Connecticut Delaware District of Columbia Florida Georgia Hawaii Idaho Illinois Indiana Iowa Kansas Kentucky Louisiana Maine Maryland Massachusetts Michigan Minnesota Mississippi Missouri Montana Nebraska Nevada New Hampshire New Jersey New Mexico New York North CarolinaNorth Dakota Ohio Oklahoma Oregon Pennsylvania Rhode Island South Carolina South Dakota Tennessee Texas Utah Vermont Virginia Washington West Virginia Wisconsin Wyoming

Male/Female Ratio

Lighter means more men than women (white is 55% men), Dark means more women than men (black is 45% men)

by County (overview) by Census Block Group (fine detail)

Alabama Alaska Arizona Arkansas California Colorado Connecticut Delaware District of Columbia Florida Georgia Hawaii Idaho Illinois Indiana Iowa Kansas Kentucky Louisiana Maine Maryland Massachusetts Michigan Minnesota Mississippi Missouri Montana Nebraska Nevada New Hampshire New Jersey New Mexico New York North CarolinaNorth Dakota Ohio Oklahoma Oregon Pennsylvania Rhode Island South CarolinaSouth Dakota Tennessee Texas Utah Vermont Virginia Washington West Virginia Wisconsin Wyoming

Alabama Alaska Arizona Arkansas California Colorado Connecticut Delaware District of Columbia Florida Georgia Hawaii Idaho Illinois Indiana Iowa Kansas Kentucky Louisiana Maine Maryland Massachusetts Michigan Minnesota Mississippi Missouri Montana Nebraska Nevada New Hampshire New Jersey New Mexico New York North CarolinaNorth Dakota Ohio Oklahoma Oregon Pennsylvania Rhode Island South Carolina South Dakota Tennessee Texas Utah Vermont Virginia Washington West Virginia Wisconsin Wyoming

If you want to know more about Google Earth, check out our Absolutely Google Earth a collection of tools and resources to get you started.

We're working on a project to make this and other simple mapping applications more widely available. If you're a python guru who is interested in building great mapping applications like Chicago Crime give me a jingle.

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. All source code is released under a BSD License unless otherwise specified.

34 comments | Show all comments only the last 5 are shown


October 2, 2007
Chris M. said:

I am ecstatic about the info you have provided!
How about ethnicity data? Is there a way to access that in this same format?
Much appreciated and thanks again!!

Chris M.


November 28, 2007
Andrew C said:

I have been looking for good detailed county overlays for a long time... I hate that they disappear when you zoom in. Finally found your website - perfect! Many thanks.


June 23, 2009
Wai Lee said:

Where did you guys get your information on geographic boundaries? You seem to have stored the county-boundaries as lists of lat-lng pairs (in the KML document). I can't imagine you got these lat-lng pairs from a map. Are they published somewhere?

Thanks!


June 25, 2009
Chris Gemignani said:

Hi Wai Lee: I'm sorry to say, I don't recall exactly where we got the boundary data. There are published government sources that typically include geographies in shapefiles. I believe we extracted those out using Python and generated the KML also using Python. We did a very slight amount of shape simplification (coalescing nearby points) to make things easier on Google Earth.


January 25, 2010
Aaron Coleman said:

Thank you for posting these .kmz. I just found .kmz on the EPA website showing air pollution throughout the country. Any idea why Google hasn't gotten in kahutz with the census bureau and published this info along with their existing data. Guess it's a lot of info in one software.

Your name

Email (optional, will not be shared)

Type the word "juice" (required to confuse the spammers)

Your comment


Add a comment