Billions and Billions of Reports (a la Carl Sagan)
By Ken Hilburn
November 7, 2008
Find more about:
bi
I recently came across a white paper on the "five styles of BI" and thought that would be an interesting read. As it turns out, more interesting than I expected. In this paper, the vendor (in order to protect the innocent, we'll just call them MacroTactics) made a statement regarding the performance capacity of this particular vendor's solution: 72,000 reports per hour. Let's see, 72,000 reports per hour... that would be 576,000 reports in an 8 hour day... and 149,760,000 reports per year. Wow. Who's reading that stuff?
Now, I fully buy in to the fact that applications that deal with lots and lots of data need to be hugely scalable, but what I don't buy is how this is in any fashion a measure that anyone can use to figure out if a particular BI solution is right for them. I can just imagine the requirements spec for that solution: "15.1.182.f - Solution must be capable of creating 70,000 reports per hour. Alternately, solution will be able to generate 140,000,000 reports per year." 140 million reports! Incredible. (Now, what did I do with my mini-me?)
Seriously, here's the thing. More reports is rarely the answer. We already have plenty of data and plenty of reports. What buyers and users really want is fewer reports and more information that helps them get their jobs done better and faster.
We'd encourage business intelligence vendors to think of themselves more as data storytellers than data factories churning out generic report widgets…even if they can do it at incredibly high speeds. From this perspective, you wouldn't want to hear Steven Spielberg bragging about his ability to pump out a dozen movies a year or J.K. Rowling trumpeting her ability to write 1000 pages a year (hmm, wait a sec).
How to Feel Better About Your Data Warehouse Fiasco
By Zach Gemignani
February 27, 2008
Find more about:
Business_Intelligence
bi
reporting
Here’s a little predictive analytics:
About a year ago, I took a swipe at the “$80 million supercomputer to analyze NYC student achievement.” It smelled more like a super sales job than a super useful analytical tool.
At the time I had said:
Teachers are underpaid, hardly appreciated, and overworked. I can only wonder what the half-life is of a system that asks teachers to log on to get information delivered by the “chief accountability officer.”
Well, it appears that things haven’t gone that smoothly with the supercomputer. Today, I received a link from Leonie Haimson, a NYC education advocate, to a story entitled SCHOOLS COMPUTER AN $80M ‘DISASTER’.
Not only has the supercomputer struggled to gain much traction with users (“The school system’s new $80 million computer super system to track student performance has been a super debacle, teachers and principals say”), it has coincided with severe budget cuts.
We see these data warehousing problems all the time with our clients, and the NYC supercomputer displays all the hallmarks:
- Delivery delays: Nearly six months after the Department of Education unveiled the “first of its kind” data-management system, the city’s 80,000 teachers have yet to log on because of glitches and delays.
- Bad user experience: Many principals have complained that it runs slowly, lacks vital information, and is often too frustrating to use.
- Complicated training and set-up: School officials were hoping to have everyone hooked up and trained within months… delays in creating IDs and passwords for teachers
- Trying to do too much, delivering too little: The principal added that she preferred to get student information from a combination of old data systems “rather than wait for ARIS to churn and churn and churn and maybe give me half the report I need.”
- Massive cost: Complaints about the expensive system—on which nearly $35 million has been spent so far—have gotten louder since the city unceremoniously chopped $100 million from individual school budgets last month.
- And yet, few success anecdotes to justify the investment: ARIS had already enabled her data team to analyze the performance trends of the school’s many English-language learners.
It does offer one thing that I haven’t seen before: a Chief Accountability Officer.
6 comments | Show all comments only the last 5 are shown
Miguel Marcos said:
Chief Accountability Officer.
Go ahead and pronounce the anagram:
CAO.
Ross said:
Does these mean that the "CAO" will be the one who has to pay back the $80 mil, he's Accountable right?
Lynn Marentette said:
I'm a school psychologist who works with people and data. I've been frustrated with user-unfriendly technology throughout my career. Student data management systems are one of my sore spots.
The quote from the article tells it all: "Bad user experience: Many principals have complained that it runs slowly, lacks vital information, and is often too frustrating to use."
NYC isn't the only place experiencing usability problems with education data management systems.
I work in North Carolina, a state that adopted the NC WISE data management system about 10 years ago, at quite an expense. My school district will be moving to the system next year, so I do not have personal experience with the system. To prepare myself tof the change, I viewed the multitude of questions discussed on the NC WISE Questions/FAQ website. One glance and you will quickly see that ease-of-use was probably not a priority during the system's development: http://www.ncwise.org/ncwiseQuesFAQ.html
By the way, the NC WISE system doesn't play well with data-management systems designed for N.C. special education student information. Many districts use CECAS for handling this data, but as you can see from a recent "bug" list, there are numerous problems: http://www.nccecas.org/systemupdates/openbugs.html
Problems with the NC WISE system have been documented for years. Here are a few examples:
eSchool News article, 2004: http://www.eschoolnews.com/news/top-news/index.cfm?i=35547&CFID=5146403&CFTOKEN=38831309
Byte and Switch article, 2006: http://www.byteandswitch.com/document.asp?doc_id=105150
Note: I returned to school a few years ago to study computers/technology part-time, including HCI, so I know that there is much room for improvement.
Annoyed Analyst said:
So...how do I feel better about my data warehouse fiasco? :-)
Mary G. said:
Same problem, smaller scale. You’ve opened the door for me to rant a bit about a program that was recently imposed on the special educators in our five town school district at a cost that I have been unable, to date, to track down—but I intend to.
Our small rural district here in Vermont recently bought into a special education management program. As the coordinator of special education services and assistant principal at my school, I have found the program difficult to use and inefficient.
The program, Case-e, was created with little or no understanding of the special education process. The creators didn’t even bother to use the special education terminology we use in Vermont on the countless, seemingly arbitrarily placed, tabs. Each section of an evaluation or individualized education plan is isolated so that the user can’t see the report until it is finished. And, there is no spell check! Really, I’m not kidding. Just as annoying, test scores cannot be shown in columns. Oh, you think I’m making this up? No, the program does not allow it. The support folks from the company said they can’t fix it because it was “created by engineers in California.” What is that suppose to mean? They are willing to sell a program but to be responsible for how it works?
According to their website, the company claims their goal is to “help special educators reduce their time spent on paperwork and administrative duties so they can spend more of their time and energy helping students.” Wrong. The final product is unattractive and difficult to read, often with big blank spaces and sections of the report divided where they shouldn’t be. Not only does the original document take significantly longer to prepare, but also I am ashamed to give it to parents. I always apologize for it and often give them a second copy of a report that I have redone in my old parent-friendly format.
Finally, the program claims to provide “total support… you have all the assistance you need, when you need it.” Not true. If the creators knew anything about teachers, they would know that we often write these reports late at night and on the weekends. Teaching is not a 9 to 5 job. Their technical assistant folks need to keep the same hours that teachers do.
Beyond the cost of this program, this is costing our district many extra hous of professional time. I could go on, but I won’t. Grrr...
Mr. C said:
I'm a principal in NYC and can verify for you that ARIS is the fiasco you describe. For a system with very few active users, it's shockingly slow. (What will happen when everyone is actually using it?) Currently, ARIS is a pretty user interface on top of an outdated (and equally atrocious) computer system in which we still do all of our data entry. No new information can be put into ARIS. You can only view the information and produce more attractive looking reports which mostly concern test scores.
The trainings, largely done by IBM staffers with no sense of what we would need/want from a data system, have been a total waste of time. I have spent more time in training about how to use ARIS than I have actually using it for anything to help my students or my school. And yes, I just lost tens of thousands of dollars from my budget.
Add a comment
Microsoft says: "BI is Really Hard to Use"
By Zach Gemignani
January 3, 2008
Find more about:
agile
bi
business
management
We don’t tend to agree with Microsoft when it comes to data analysis and presentations. In fact, we’ve even been critical of them for misrepresenting data, excessive visual “flair”, missed opportunities to improve Excel, forgetting their power users, subpar presentation tools, and wasteful slide masters.
With all these past differences, I was a little surprised to find that we do share some common ground. Check out the comments (from an article in Internet News) by Peter Klein, CFO for Microsoft’s Business Division in describing the world of business intelligence:
“I’ve talked to a lot of customers about business intelligence and the one thing that they tell me is it’s really hard to use,” said Peter Klein, during at the Credit Suisse conference.
“‘I’m not getting the value out of the investment that I made,’” Klein said customers had complained. “‘I have invested a lot in my back-end systems, and today 10 percent or less of my employees actually touch it, or get access to the data. I’ve got six different BI solutions across multiple different departments, none of which talk to each other. And they’re hard to use, so I’ve got to send people to training for two weeks to learn how to use it.
Finally, we are speaking the same language. Now, I’m curious to see what they are going to do about it.
2 comments
Jock Mackinlay said:
You have to focus on people rather than technology. After all, the business intelligence is for people to do their work and make decisions.
Wouter Brandsma said:
And that statement comes from a firm who delivers Reporting Services in Visual Studio as a serious BI tool. That application was never designed for business users, but for developers. Why not integrate the query solution of Reporting Services in Excel as a replacement for MS Query and use Excel. It is a fact that the most used function in any BI tool is "export to Excel" (Microsoft acknowledge that in their BI seminars).
Add a comment
Do you have Insurgent Data?
By Ken Hilburn
December 7, 2007
Find more about:
bi
business
There are two kinds of people in this world: those who put things into two categories and those who don’t. Maybe this isn’t the best representation of the complexities of the human race, but it does give me a cheap lead-in to compare two types of problem solutions: “high tech,” focused on tools, and “high touch,” focused on interpersonal communications.
I was reminded of these two approaches by a recent interesting article in Wired that expresses an opinion about why America’s performance in Iraq has been disappointing. The basic premise of this article is that America has entered into this engagement in a “technology networked” fashion, drowning it in technology; the more, the better.
The article suggests that the US forces would make more progress if they were to spend more time on a “socially networked” approach. For instance, instead of remote controlling a drone from 100 miles away, spend more time drinking chai with local leaders. Not the absence of technology, but the incorporation of technology into a socially based environment.
“If I know where the enemy is, I can kill it. My problem is I can’t connect with the local population.” This was a quote from one division commander. Change a couple of words and you end up with a statement that many of us would find all too familiar:
“If I know where the inefficiency is, I can fix it. My problem is I can’t connect with my data.”
Aren’t we witnessing this in spades right now in the BI space? There’s no lack of number of tools and number of features in these tools. The challenge is figuring out who the real insurgents are and how you deal with them. If you’ve been reading the Juice blog for very long, you have a pretty good feeling for how we approach what we believe is a social problem (high touch) and not a technical one (high tech).
The good news is that the US forces are changing their approach to socialize more with the Iraqi people—hopefully leading to a better Iraq. Is there good news for the BI space? We’d like to hear from you on how you’re making sure you focus enough on the social “high touch” aspects of our space. What’s your insurgent data? How can you get to know it better?
2 comments
Patrick said:
One thing I do to find the insurgents is decidely low tech. Its my Ainsley filter. I have a very bright, honest 12-year-old handy (she's also a fair to middlin' mechanic). If my reports don't pass the Ainsley filter, I'm not done.
"What does 'Prj AVL' mean, dad?"
Nothing- it means nothing, so I just type out the complete words 'Projected Available'.
"If this is supposed to tell me about new products, what are these year-to-date sales things?"
They are insurgent data points. I remove them.
I realize that not everyone has access an Ainsley filter- so I encourage you to build one. She has helped me color-code things: "Why is that yellow? I thought yellow meant caution."
She's right- if I am using colors to show meaning, red/yellow/green is OK, otherwise, use different colors to separate ideas visually.
My Ainsley filter has made be a better communicator. You need someone honest, bright, and interested. The part where she spoils dad is just icing.
Simon said:
I always thought that there were three types of people in the world. Those that understand binary and those that don't ;0)
I think this problem stems from us trying too hard to please too many masters. One of the pillars of good information is relevance, the others being accuarcy and timeliness. The problem with relevance is unlike accuracy and timeliness it can vary a great deal by audience.
If you were only ever trying to communicate with a narrow audience it would be easy to cut out noise from our output.
Add a comment
The Last Mile of Business Intelligence
By Zach Gemignani
November 1, 2007
Find more about:
bi
reporting
“The last mile” is a term that often is applied in the telecom industry in reference to “the final leg of delivering connectivity from a communications provider to a customer.” It is an expensive and complex step due to the challenge of pushing information from centralized, high capacity channels to many diverse end-points where information is ultimately used.
We think there is a “last mile” problem in business intelligence too. This critical bridge between data warehouses and communication of insights to decision-makers is often weak or missing. Your investments and meticulous efforts to create a central infrastructure can become worthless without effective delivery to end-users. “But how about my reporting interface?” you wonder. That’s a creaky and narrow bridge to rely on for the last mile of business intelligence.

Listening to our clients, we are confident the last mile is a real problem. The ultimate source of this failure is less clear. Here are a few of theories:
1. The engineers who built the data warehouse build the interface. No offense to the talented individuals who can push around, clean, normalize, and integrate data—but they may not be ideally suited to designing a user interface for non-technical users. A designer wouldn’t create charts that look like this (our favorite example of chart-based encryption):

In the worst case, developers are dismissive of user experience. I’ve met with IT folks who felt confident that providing a massive data table would provide a suitable solution for delivering information to users. “Hey, they’re getting their data. Is there a problem?”
2. Reporting is considered the fundamental mechanism for working with data. Here’s a framework we’ve started to consider in thinking through the multiple approaches for getting value from data:

- Reporting lets you monitor things that are well-understood and relatively predictable.
- Exporation or analysis helps you understand new processes and erratic and shifting behaviors.
- Presentation is about communicating insights and understanding, often building on both reporting and analysis.
Many people assume that a reporting tool is sufficient to do in-depth analysis and communicate results. That’s like trying to build a deck with a screwdriver.
3. Poor fundamentals in information display. Despite the efforts of folks like Edward Tufte and Stephen Few, general literacy in this area is still low. Shiny, 3D pie charts are still acceptable, even desirable in some places. Particularly disturbing is the persistence and pervasiveness of this problem in Excel where there still remains some confusion as to why this is bad information display:

You don’t have to go any further than the Dashboard Spy to find examples of the visual muck that is commonplace.
16 comments | Show all comments only the last 5 are shown
Jorge Camoes said:
Zach, with low literacy levels you can't say "this is good, this is bad", because people will not understand why. A reference model is not enough, you must guide them step by step, make them compare outputs and explain why a 3D pie chart is not good for them. This is not easy and takes a long time, but there is no other way, I think.
Rob said:
Jorge, check the link to Stephen Few's website. He's got a whole page of "bad graph" examples along with analysis and explanation of a proper solution. Pretty good reading so far.
Dave Katz said:
I like the framework, but I think it's missing the cyclical relationship between these three activities:
1) Start with exploration/analysis, until you get to the point where the thing you're analyzing is "well-understood."
2) Develop some reports that help you identify patterns, trends, and notable exceptions.
3) Present the "insights and understanding" that you found along the way to your colleagues - this is bound to generate new questions, which brings us back to step 1.
This is admittedly oversimplified, but I think its a workable model.
I envision the diagram of this process as a circle with arrows showing the process flow, rather than as a triangle.
Zach said:
Dave, I agree that there is a flow between these modes. The problem is when people don't recognize that they are using reporting for presentation or presentation for analysis, and so on. Also, I thought about placing "dashboards" between reporting and presentation as it is a kind of hybrid.
Jorge Camoes said:
Rob, I know the page and it is a good starting point. This is one of the things I also try to do in my blog. But you need some sort of interaction, to explain, overcome objections, etc.
I would say that we are selling something that no one wants to buy. Imagine an ad were everyone is happily drinking and socializing. Then we must sell the message "don't drink and drive". People know you're right, but they'll keep drinking (and using 3D pie charts because the boss likes them).
Mohit Mahendra said:
Spot on identifying the problem, and I like the model you presented. The last mile challenge is probably more a people issue than any other. Exploration, understanding and presentation of insight require unique and uncommon skills, and talent. There is only so much that can be embedded and automated in a system. After that it comes down to the people that deliver the last mile.
Dave Katz said:
Friends don't let friends build 3D pie charts.
Tony said:
Hahaha, nice one Dave!
What's worse than a pie chart? A 3D pie chart!
Sanjay Tamta said:
I'm surprised that you think of the "last mile" as a pervasive issue. Yes, it is true that several interfaces leave a lot to be desired in terms of content and usefulness, but I contend that leading corporations are very familiar with and extremely good at mitigating the risk of "lost in translation" between technical and management teams. With 15+ years of robust global business growth and the emergence of IT as key enabler, corporate success is in large part attributed to the availability and relevance of key business insight through the use of well designed EIS and BI systems. So I agree with your hypothesis in part but would say that it is really a matter of the experience level of the people involved. When you look at leading BI implementations, you will see tech and management working in concert to create value for the organization. It’s really no different from building any other kind of system.
michel g said:
I keeping thinking of Mr Covey when I deal with clients facing this issue. I believe it is either his 1st or 2nd principle that says, "begin with the end in mind." I can't help but think that the last mile is so challenging because we start with a list of requirements or features and not of desired outcomes.
Jennifer E said:
We are in the "Last Mile" in my project where I am embarrassed to say that I manage the User Interface. I struggle with everything you have mentioned above. Unfortunately, the company I work for does not feel it important to invest in UI and my team is sparse and has been over run with the Data Modelers with no design sense. If just 2% of our time was spent on usability we would have much easier to understand product and the training to use it would be greatly reduced.. I will keep reading your blogs.
Zach said:
Jennifer: I'm sorry to hear about our situation. It raises the question: Does a lone, screaming user interface person in the Data Modeler woods make a sound? We'd be happy to lend a sympathetic ear if you want to reach out.
Sanjay: Our experience has been very different from what you describe. We've found it rare that management and tech work so smoothly "in concert." I have yet to see a BI implementation that consistently brings joy and insight to end-users. Furthermore, it seems a stretch to attribute 15+ years of global business growth to the success of these EIS/BI systems. On the other hand, it seems like you have a Harvard Business Review article in you.
Sanjay Tamta said:
Zach:
That was funny and your point is well taken. Actually, I think an HBR article may just hit the spot.
Good blog btw.
Sanjay
James Taylor said:
Nice post and inspired me to post on something similar over at http://www.edmblog.com/weblog/2007/11/mistakes-in-the.html
JT
Mark said:
Good post and you can see similar examples at <a hfre="http://www.dashboardzone.com">Dashboard Zone</a>
Sanjay Tamta said:
Zach:
Has your firm done any BI work in the area of Social Network Analysis?
Thx
Add a comment
Earlier writing





2 comments
michael said:
I work on the supplier side and this has become an initiative. The issue here though is what do we really consider a report. For some of my clients I've almost fully automated the "reports" but, really these are just data dumps. The just want the all the data points summarized in an excel workbook. Useful eh?
But for other clients we provide written reports. The problem we're facing now is that many of our research teams are 'trained' to write their reports like a data dump, Q1 here is the distribution here is the mean. Next, Q2 here is ... This is what I'm currently dealing with. Researchers with over 10 years experience and no desire to make their product better.
Alan said:
I think you are extrapolating the figures the wrong way. A system that can turn out 72,000 reports in an hour can do 1,200 in a minute. This may still seem high, but depends on your definition of a report. For instance a monthly sales report can be distributed to 1000 people, each with different access to the data, and therefore different data content. If you consider each of these a spearate report, then you only have 12 core reports for each user to consume. When a warehouse completes a load, sending out updated report pack of 12 reports to all users is not beyond the realms of possibility.
So while I think the claims of the MacroTactics are silly, I can see a case where high performance is an issue
said:
Add a comment