Ajilitee

Archive for the ‘Agile Analytics’ Category

Big data technologies in healthcare insurance (payers): nosql and MDM–part 1

Friday, April 20th, 2012

We were working with a client around Fraud, Waste and Abuse (FWA) recently and we needed to clean up the client’s Provider data to help us track longitudinal changes in fraud behavior. Some of the published reports suggest that FWA accounts for, minimally, $21B in Medicare payments that never should have been made. That’s a lot of money. I’ll blog more on FWA, analytics and how Ajilitee could help you under our Managed Analytics service offerings, but in this blog I wanted to focus on just one small aspect of managing Provider data at Payer companies.

Many healthcare insurance companies are not known for rapid innovation. But times have changed. The need to manage members using Customer Relationship Management (CRM) technologies has greatly increased and many Payers have started their CRM efforts. But, these CRM technologies have been in use other industries for 20 years.  The technologies needed to support CRM analytics and member point of care/point of service interactions are different then what many Payer organizations have in place today.

Where do bigdata technologies play? We have been asked that question here at Ajilitee and as part of working on our FWA products and internal R&D innovation, we have been using and looking at these technologies for awhile.

Bigdata technologies span many areas and have interesting names: hadoop, hbase, hive, pig, nosql, mahout and more. Nosql and hadoop in particular are core technologies that other layers build on. For example, hive and pig build on hadoop. Hadoop itself can build on nosql technologies. I will not repeat all of these concepts in this blog because a lot of content has already been written on these technologies. From here on, I’ll assume you are familiar with some of these terms or look them up easily.

Guiding Thoughts

Here’s a couple of principles that can help guide the conversation:

  • Deep insights: The more you understand your domain specific problem, the easier it is to adapt new technologies to solve them in novel ways or to recognize trade-offs you are implicitly making with current technologies. You can find novel uses that solve a problem in different ways. Think Geoffrey Moore and disruption.
  • Delamination: By rethinking the layers of technologies you use today, its possible to pick and choose the layers and how they combine together to create new solutions. Many nosql packages do not include compute engines within their data access and storage solution. So solutions such as hadoop must be layered on top of the nosql databases in order to obtain compute processing. There are a lot of variations to this thought but just think “delamination” can be helpful around innovation.
  • Do something different: Unless you are really, really smart, a great way to learn about how to use the new technologies is try something and fail multiple times to solve a problem.  This learning by doing is key to innovation. Of course, you have to learn from your mistakes each time—failing by itself is pointless.

Scanning the bigdata Technical Landscape

There are many new technologies in the bigdata world. Let’s look at nosql. Many people question whether the technologies are mature enough for production use and whether they help them solve business problems faster, cheaper or better in some way.

Bigdata and nosql conversations usually start with explaining issues found in the world of managing and serving massive amounts of data needed for websites. But the technologies seem confusing at times and begin to wonder if they apply to our problems especially those in the healthcare world.

Nosql technologies are often quoted as having the following properties:

  • No sql: This means there is no sql!
  • Schema-less: There is no schema!
  • Eventually consistent: Forget ACID. Forget transactions. Eventually your data is consistent.
  • Fault tolerant, scalable, distributed: A whole number of really great architectural characteristics that sound good but you are not always sure apply to you.

Once you start looking at the bigdata technologies you are immediately struck by the fact that:

  • With some nosql technologies you actually do define a schema and indicate what a column’s type and name is. Some nosql technologies also need to know how to sort columns so they need knowledge of how to compare keys or values. That sounds like a schema!
  • With nearly all bigdata technologies, someone has already written a [insert technology name here] Query Language to you can run queries. This sounds like everyone still wants *SQL.
  • With some nosql technologies, you have to specify properties such as coherency, which indicates that you can get the value you just wrote back out of the database. While not the full definition of ACID, it starts sounding close!
  • There is a lot of parallelism everywhere. The file system is parallelized! Don’t forget that the job flow is parallelized as well! (assuming it fits a specific data parallel processing model). Everything is about scale-out—add more nodes and everything keeps running, the system stays up, and everyone has fast, guaranteed access!
  • There appears to be several interfaces into the database some of which require programming in a language you may not be familiar with e.g. not SQL! How do you even load data?! You almost feel like you are programming at the lowest level of database programming possible. Wait a second, which layer are you programming? The filesystem level? The map-reduce level? Or both?

This seems very confusing. So let’s think through the issues. We want to avoid having everything look like a nail because we have a bigdata hammer.

Healthcare Example

In the Payer space, Master Data Management (MDM) is finally becoming a component of the business and architectural landscape.  In the Payer world, MDM means managing Providers, Members, Contracts and Products and other business entities that you can often touch and feel or that are really considered non-claims data.  Other types of Payer data include “event” data. This is data generated by interactions with Members from sales, service and care oriented interactions. Of course, there is also claims data. Claims data is the largest source of data, followed by “event” data then MDM. Are some of the bigdata technologies relevant for even the small datasets such as MDM datasets? Small in this case means a few million rows and typically much less. Of course, this links back to our FWA problem statement at the beginning of the blog and the need to clean up our Provider list to perform Fraud analysis. We’ll illustrate some of our points with industry specifics.

  • Schema-less:
    • The NPEES Provider list from CMS changes multiple times a year. The list contains all of the Medicare Providers and some, but not all, of their demographics.  The columns of data change although not frequently. New providers are added or removed as appropriate. Provider data inside a Payer typically originates in several systems–sometimes up to 20.  So it would seem that a technology that claims to be “schema-less” would be useful. But schema-less does not mean that you do not need to specify the data types of the data.  You have to specify the format somewhere so external tools can use the data. The NPEES file has several sub-entities in it like addresses and other codes indicating the Provider’s specialty or whether the Provider is an individual or an organization. Shouldn’t we pull these sub-entities out and make them their own table? Shouldn’t we also try to specify where and how the data should be loaded to be efficiently accessed, perhaps by using table partitions, or striped volumes or other typical database designs? These are normal database design concepts.
      • Part of the value of being schemaless is that you tend to concentrate data together into denormalized structures and use it to answer a smaller set of business questions such as “what data changed between NPPES files each month?” And the data you load may be very dirty, so lets load it all as strings, then convert the data in the database itself. We don’t have to work to hard to specify types, but we must specify some. We can also ignore doing detailed table design because most nosql database are designed to scale out. Bigdata can help us push aside this operational complexity.
      • MDM data changes over time. For example, you may choose to append one external data vendor’s Provider demographic data one year, then switch the vendor the next. That’s a whole new set of data structures in the traditional world. Being schemaless allows us to manage data changing over time without having to reload or re-baseline to achieve acceptable performance. Hence, you can evolve the schema more easily and that’s a great reduction in operational complexity.
      • No-sql: We clearly need to write a query to determine what changed between different NPPES file releases. We have to write a query. The entire claim of nosql must be false! The answer is more subtle than that. The claim of nosql is really one of not having many characteristics of traditional RDBMS databases built into the database layer. For example, you will not see nosql databases implementing referential integrity through sql statement such as foreign keys, etc. You do have to specify primary keys for some nosql database just to help with managing the data.  In fact, many systems today, whether a data warehouse or a transactional system, actually implement integrity in the processing layer above the database these days. This is neither wrong nor right, but just where it is often happening. Hence by saying its nosql, you are really saying that the data architecture is one where the data is more concentrated, where integrity is implemented in a processing layer and not the database, and where the data access interface makes as few assumptions about the data as possible in terms of its structure.
        • There is often another implication of schemaless that is less often recognized. Because the nosql database essentially delaminate the database stack to some degree. Mathematical processing occurs outside the layer.. While nosql creates uniform access performance by keeping the interface simple and scaling out, it also does not allow computations to be automatically pushed down to an individual node for parallel processing. That’s where hadoop steps in. By teasing apart the computing part from the data access layer, you have to now choose where computing occurs. In the case of hadoop, that processing can occur on a node where the data lives (there is a Cassandra+Hadoop integration layer) or you can process the data controlling using uniform access performance to avoid overloading the compute server. This also means there are really not any stored procedures in nosql databases.
        • Eventually Consistent: In our specific case, eventually consistent is fine. Since we are loading the data and deduping and cleansing it initially, that’s not a big deal. But…
          • Let’s also think through the case of Provider MDM. In an enterprise MDM system, all transactional systems should reference, in real time, the MDM system to obtain authoritative data when it needs it. The MDM data should be consistent. However, even in Payers today, the MDM data is not immediately consistent. There is an acceptable lag between one transactional system authoring some master data and another system in being able to access it. Typically, the lag is a few hours or a day or a few days. In fact, if we look at nosql databases like Cassandra, its quite possible to improve the time to consistency at a significant lower cost structure. For example, a social media site can tune its consistency which means it can tune how fast you can see your new “friends” post. You will also want to tune the consistency you want for MDM and scale it up or down. You can do this in nosql technologies without incurring additional development time or complexity all using the same database. That’s huge and compelling. Because enterprise MDM makes the MDM system an operational imperative, you have almost immediately solved some very vexing architecture problems at an incredible inexpensive cost point.
          • Cool architecture: The previous bullets have already pointed out the need for scale and robustness so I will not repeat that here. But an additional thought may be worth pointing out. If you think about the data access patterns, where the architecture is concentrated to have an MDM hub that selves up authoritative data to many, many consuming applications (many reads, few writes) and this all happening in relatively real-time, then scaling and fault-tolerance are actually key. In the MDM vision landscape, cool architecture is actually really important and your MDM hub does look more and more like a website serving up data. You need a transactional OD.  Its also important to realize that you can only take advantage of the cool architecture if the other parts of your architecture are also simplified. Your mileage may vary if all you are doing is plugging in new technology into the same old landscape without any changes anywhere.
            • Because a cool architecture can be delaminated, we have to plan for how computations (queries) will be executed. You cannot automatically have the computations pushed down to a node without using hadoop or something similar. Otherwise all computation and IO gets throttled on the node you issue the query from. That’s one are of review and choice you have to think through and one place where hadoop, hive and pig try to help you think through. Other database engines that distribute the data and computations might make this much easier but you may have to make other architecture choices to use those tools. Think deeply and carefully about cool architecture.

So based on some deeper thinking, it appears that the bigdata and nosql world can offer something of value even for a Provider MDM problem which seems like an ill-fit to begin with.

Summary

It appears that if the problem you are trying to solve is important enough to use these other technologies, there is some benefit to using them in the right mix and in the right proportions to your existing architecture. They are viable and based on our experience at Ajilitee, can be made production ready. In some cases, they can dramatically reduce operating complexity despite their seemingly lack of maturity around tooling. In many Payers, reducing operating complexity is a huge win.

In the next blog we will demonstrate learning by doing using bigdata technologies on larger Provider datasets and common healthcare processing analytical patterns. I’ll also return to the FWA theme.

As a treat, John Bair our CTO is speaking at TDWI’s Cool BI Forum in Chicago on May 8. He’ll be talking about these technologies and how they can help you. His talk is based on direct experiences from building products and solutions for our clients.

ACOs and aggregated data: Payer analytics comes to ACOs

Thursday, September 22nd, 2011

I was reading through the the 42 CFR 425 Medicare Program: Medicare Shared Savings Program: Accountable Care Organizations yesterday in response to a client asking us about our Managed Services. Specifically thinking about II(C)(2) and data sharing.

The rule discusses data sharing between ACO participants and the government.  Generally, the gist of the following few sections suggests allowing the Secretary to share claims information on Medicare beneficiaries both at the point of enrollment as well as on an ongoing basis. Why?

First, having claims data at the point of enrollment helps ACOs satisfy other parts of the rule around creating personalized care plans, helping with care coordination and other changes in care processes that need to be made for that beneficiary–all focused on patient centered principles. The data could be personally identifiable of course.

Finally, on an ongoing basis, the data would be provided in detailed as well as aggregated form to help ACOs understand the total universe of care. Since an ACO may not actually provide all care services for a beneficiary (although the goal is to do as much as possible), obtaining beneficiary level claims data helps ACOs understand the complete picture. The rule goes on to describe how this data would be used in aggregate and in detail (Section 5 and 6):

  • Financial peformance modeling
  • Utilization management
  • Clinical management
  • Quality reporting
  • Care management/Care coordination

For example, if the ACO’s beneficiary population has a high rate of readmission, then a program could be put in place to improve discharge coordination to reduce readmissions.

Looking over the intended uses, none of which are new per se, it struck me that a  lot of Payer claims data analytics is now transitioning to the Provider community. Since ACOs are inherently about Providers, not Payers (which is the primary reason an ACO is not an HMO), the ACOs now need to do Payer analytics.

That’s what Ajilitee is really good at. And we can do this on a Managed Analytics basis so we can help ACOs rapidly get to market on ACO analytics at the beneficiary level. Most importantly, since we build our analytics environments on could computing platforms like Amazon, you only pay for what you need and it can grow to the size you want as the ACO grows.

That’s what struck me about Managed Analytics and the client conversation. With Ajilitee, Managed Analytics has a great promise to deliver capabilities that Payers have today cost efficiently to the Provider community immediately tomorrow.

Are system integrators poised to play a new role as analytics innovator?

Wednesday, May 18th, 2011

We were briefing Gartner analysts John Hagerty and Kurt Schlegel on Ajilitee strategy, plans, and client work. We shared with them our belief that markets, especially mid-markets, are moving towards a world of managed analytics–analytics that are performed by third parties like Ajilitee and which are tuned to a specific business process and contracted through a managed services agreement.

Our sister division Discovery Health Partners (DHP) is a great proof-point on how managed analytics can deliver a fully outsourced business process in a SaaS model, in this case for healthcare payers who want to recover claim overpayments and subrogate third-party liability. The DHP Intelligent Cost Containment platform runs completely on Amazon Web Services. We also shared that Ajilitee will deliver the entire information management stack to one healthcare company and will provide process-specific reference data to support business rule processing in “smart” claims processing to a second—both using Amazon’s cloud.

Near the end of the conversation, Gartner posed some questions for us. Do we think that traditional system integrators, such as the large IT consultancies, are all poised to become a new type of analytics provider to companies?  Well, we think the answer is no, and of course, not surprisingly, it’s also a yes. Here’s why.

NO

On the one hand, systems integrators have been trying to integrate analytics all along as part of building an annuity revenue stream and as an extension of their efforts to integrate applications and business process. In neither case have these efforts changed the game. Let me explain.

First of all, large consultancies have thrived on large packaged-application implementations: SAP, Oracle and the like.  These are huge, complex projects that typically span years, continents, and literally hundreds of millions of dollars. This is revenue they can count on, year after year—the concept of a revenue annuity. Instead of selling a new consulting project each year, which is a lot of work, why not have something locked in for a multi-year contract?

The packaged-application projects have almost always been about improving, standardizing, centralizing or otherwise improving business processes. To answer client demands in these multi-year commitments, the large application vendors have spent millions, perhaps billions, developing footprint extensions in the analytics space to try and help clients understand the business process metrics as well as the end results of the business process, e.g., how much money did you make last quarter?  Some packages have even integrated more advanced data mining components. My old company, Thinking Machines, had a data mining product called Darwin that was sold to Oracle for just that reason.

So the idea of analytics closely coordinated with the business process software has always been an area of interest, an intention. However, while the analytics packages are okay, they are not always best-of-breed or useful in standalone situations. They are often considered footprint extensions of the main ERP application–they are not a focus.

Secondly, system integrators have always tried to build “solution sets” representing preintegrated business process solutions.  After all, system integrators are in a great position to understand the nuances of a client’s business process and inject analytics at the right place, in the right way, using the right tools. For years, going back to the time that I was at PricewaterhouseCoopers (PwC), producing these solution sets was a goal. Since I was in the management analytics (data mining) practice at the time, the idea was to produce analytical products that could be resold. We did not have a lot of tools, infrastructure was always hard to get allocated, and generally there were logistical issues that prevented us from realizing our dream.  The solution sets didn’t get built.

So, in one sense, the answer is no, system integrators are not a new type of analytics provider in the market. They have been trying to do this all along–for the annuity revenue and as an extension of what they had always been working on with applications and the business process. Analytics has been on the agenda for awhile.

YES

Alright, because we are also consultants, we also have to say yes. Here’s why.

System integrators are also a new analytics provider to companies because the tools and technologies, whether open source or commercial, are now becoming available at a scale that allows a system integrator to create, evolve and manage complex applications more easily and cheaper.  In short, creating analytic solutions has become a viable business for them.

Several enablers make this possible. Cloud computing has greatly enhanced the capability to create sandboxes, analysis environments and scalable analytical production systems, like scoring engines. Open source and commercial analytics software have more flexible licenses or architectures–there is probably even a hadoop app for your iPhone coming out soon that taps into your top-10 friends’ smartphones. And of course, the web has opened new analytical output delivery mechanisms that were not envisioned a decade ago. Taken together, these evolutions make it easier to deliver results.

So there is also a “yes” answer to the question because, at long last, new technologies allow a system integrator to engage in this area in a way that makes economic sense to the clients.

Inside client organizations, challenges may be creating more opportunities for system integrators to step in as analytics partner.  Analytical talent is still a scarce commodity and is shifting overseas.  Creating analytics capabilities inside a company has been harder than clients envisioned, given budget, skillsets, and focus. At the same time, time to market, the speed of delivery, required to make an improvement in a company’s top or bottom line has increased.  Data integration and data management, which are core competencies to power industrial-sized analytics—is hard work.

Considering Gartner’s question makes me smile. Yes, system integrators are making major noise in this space, and they are buying some companies here and there and forming internal groups to harvest their clients’ intellectual property and produce products. While they haven’t succeeded on a large scale in the past, things may be different in our evolving environment.

Ajilitee is already in this space and we are innovating.  We do this in the healthcare market today where security and privacy are paramount and we do it on a large scale across multiple clients. We do it with cloud computing technologies and our information management stack.  We do it because we have the business and technical talent to advise clients on how to make analytics work in their business process. We do it because we passionately believe that this approach helps our clients compete and improve their results. That’s what we are about.

Data Warehouse Project Work Effort Estimation…and Why Hope is Not a Strategy

Monday, January 31st, 2011

I talk with many clients around their data warehouse programs. In some cases, projects run longer and cost more than expected. This is true for clients who have mature data warehouse development process as well as those with new capabilities. Why is estimation so hard? Let’s ask that question in the context of healthcare analytics and figure out why counting on “hope” is a bad way to run your data warehouse program.

The Path to Reliable Estimates is not Well-Defined

One reason that estimation is hard is because it is difficult to specify analytical requirements. Yes, I know it’s fairly easy to specify that you want a report that runs a cross-tab between admission types and paid claim amounts. But let’s look a little deeper at what’s necessary to get a valid report–the analytics underlying it. What definition do you use for admission types? Are they the same definitions across all of the groups? How do you calculate paid claim amounts–what do you exclude or include? Do you have all the data? While data governance standards can help you answer these questions, sometimes it’s the next level of detail that makes it difficult to define a path from a report request to the development effort to create that report–that analytical artifact.

The path to reliable estimation is not always well-defined because analytics, using best practices, tries to be predictive. When you use predictive modeling techniques, however, you do not always know what variables are important. For example, when you are forecasting utilization rates, do you want to take last year’s utilization rates and bump them up 10 percent? Why 10 percent? Maybe other factors are influential. Are patient demographics changing? Are providers cost-shifting procedures to achieve higher reimbursement rates in one area versus another? Perhaps governmental factors, such as the increase in Medicaid enrollees, is driving up utilization for basic services.

Because you cannot specify everything, you will need to address basic utilization forecasting before you start. That means you have to play with the data, understand data quality issues, understand proxies for measures, understand the information content of specific datasets. In other words, you do not know what you need and how you will need it until you solve the problem. That’s why it’s a not a tidy, well-defined path to trustworthy analytics underlying your estimates.. At the end of the day, however, predictive modeling that takes into account various factors is still a better way to forecast than guessing or hopeful thinking.

Just because something is not well-defined does not mean you should give up or just hope that the answer you give is the right one. As we know, “hope” is not a great way to run a company or a program. Here I’ll share some of my thoughts on a more studied approach to estimating.

Use Top-Down Estimation

The most basic approach to estimation that most companies use is top-down estimation. It comes in two forms.

The first form is by saying, “hmmm….my management judgment says it will take 4 weeks to gather requirements, 3 weeks for design and architecture, 6 weeks of build and unit test, 3 weeks of systems integration testing, 2 weeks of UAT and 1 week to deploy. Okay…that’s about right.” That’s management judgment at work. This approach is actually fairly good but is highly variable and not easily reproduced consistently across different managers. This method is good at capturing organizational dynamics–for example, the business is busy this month, so requirements will take longer, or the development team is busy on another release, so they need 2x the amount of time they would normally need. So it’s good and proper to use this form of top-down estimation.

The second form of top-down estimation uses look-alike comparsions. Project X was really HARD and it took 6 months. Project Y is of the same order of complexity, so therefore it’s 6 months as well +/- 2 weeks. This approach takes into account high-level, structural complexity. For example, I have to master the members (uniquing). or I have to build an organizational view of providers, or I must consider some other aspect of complexity that can be identified at a high level. This is a good way to capture estimates for this type of information.

But top-down estimation is not enough. While it can be good at times and eerily accurate, it’s not always reliable, scalable across many projects and managers, or consistent. Hoping that top-down answers are right is not a strategy. A strategy is a purposeful creation that management should address with purposeful action. In some cases, these top-down estimates are guesses, and guesses are not why I pay managers to work for me. I do not expect perfect estimates, but I expect estimates that I can learn from organizationally, and continually refine during the life of the project. I need estimates that make me better even if they are wrong today.

Use Bottom-Up Estimation

Bottom-up estimation is the second major approach to creating work estimates. Bottom-up estimates do involve management judgment and analytical models and estimation of complexity. But they do so in a reproducable, structured way:

What goes into a bottom-up estimate?

  • How many tables are being staged from a source to a staging database, then to an operational data store, then to a data warehouse?
  • How many ETL programs must be written assuming the organization’s standard architecture and design for ETL?
  • How many sessions with users are needed to gather requirements?
  • How many reports are to be created? Are they well understood?
  • How many new servers must be procured, installed and configured?
  • How many data model changes, dimensions or facts are needed?
  • …other factors in your estimating model

Work units represent fundamental steps in data warehousing needed to properly move data from one place to the other in a way that satisfies business requirements. There are many different levels of bottom-up estimation, but generally you need these types of details, in some form, to create a large spreadsheet to perform estimation. For each item, find a count or use a parametric model to convert the count into work effort. Those conversion factors represent your productivity factors.

You cannot always get all of the data in the exact form you want. However, you can detail your estimates and submodels in the bottom-up calculation. If you don’t have the number of tables in the sources, you can estimate based on organizational average, say 50 or 20 or 30. You can model the average number of attributes per table. You can state your assumption for the number of requirements and reports. All of this can be stated and most importantly, recorded, in a spreadsheet or estimating tool that captures the assumption and allows you to change it as needed, throughout the duration of the project.

It‘s the ability to be explicit with your assumptions and use an explicit model that makes a good bottom-up estimate.

Are bottom-up estimates sometimes wrong? You bet. But through a few iterations, the models can be become significantly better more quickly. In addition, you can do things you could not do before, such as engage multiple vendors and understand their assumptions, use a disciplined number of development steps, the SDLC, built into the estimate, which again forces standardization across potentially diverse sets of suppliers.

Putting it All Together

So both top-down and bottom-up estimation are needed and useful. BOTH must be performed to arrive at an estimate that can be triangulated from different points of view. Somewhere in the middle is the work estimate you can use to start your project. Hope is not a strategy when it comes to data warehouse projects–use analytics to help you estimate and get better over time–in a way, that’s the whole point of analytics.

Putting it together also means that you need to run the top-down and bottom-up model across all projects at the same time. This allows you to identify duplication or other types of common costs that can be more efficiently solutioned.

How good can your estimates be? For small-to-large and simple-to-complex projects you can achieve good estimates with 5-10% of actuals. I’ve always considered a good estimate to be one that is within 5% of the actuals, especially for analytical projects. Your target tolerance may vary but you’ll be using a methodology that employs the analytics you are building for others.

What strategies do you use to develop reliable estimates for your data warehousing project work efforts? Let me hear from on this interesting topic.

Deep Dive on Analytical Styles: The Future Style

Tuesday, December 21st, 2010

You need this style today. You probably do not have it. Thirty years ago, statisticians starting using regression to improve response rates to direct mail campaigns. Ten years ago, artificial intelligences (AI) academics discovered new (and old) approaches to predicting the future. Five years ago, businesses saw the benefits and started incorporating future-looking analytics into the environment cost effectively. One year ago the recession hit and everyone said they were surprised (well, except a few people).

Predicting the future is hard. At the economic level it is really hard. But predicting the future at a customer level, while hard, is a much more bounded problem.

The Future Style of analytics uses your corporate customer data (touchpoints, surveys, purchase transactions) and combines with it with any other available data such as demographics, firmagraphics, behavioral and attitudinal to create a prediction. Perhaps you need to predict when a customer will churn from their cellular company, when a customer will fall into quiet mode for their DVD rentals, what they may need or want to purchase next or when they will move from the student segment to the affluent segment.
The Future Style gives you predictions at all levels, then helps you anticipate changes and provides a window, sometimes small, to do something about it. It’s up to you to act.

The Future style is not easy. It never is.

You need a large amount of past historical data. You have to assume that past behavior can reasonably predict future behavior. You also need the right math and science working on the data to make the prediction of the future with confidence. You need to take into account all of the variables that you think could influence the prediction. That’s a lot of machinery, a lot of infrastructure. It takes a coordinated effort to make this happen well, happen quickly, and most importantly integrate it into your business process.

Lets face it, if you can predict the future but can’t do anything about influencing the present to make money, there’s no point. Stop wasting time. The biggest, and I mean the biggest, issue with the Future Style is using a prediction to take action today. A lot of clients have spent their money to create a Future Style capability but they could not get their “customer channel” to change their business process and the way they interact with the company’s customers (despite my best efforts to address this with them). I said “company’s customers” because if your SBU or functional group believes they own the customer, you have to work with them (or go to the CEO) as a partner to implement actions to make the predictions valuable. That’s hard. Maybe it should not be hard, but it is.

The key to the Future Style is, yes, to have the capability to predict but also the ability to intervene to your benefit.

Analytically enabled agile companies balance all three analytical styles and use them where needed. Knowing which style to invoke in which circumstance means knowing what you want to measure—back to our strategy once again. The other necessity for analytic enablement is to have the information infrastructure behind you to support flexible, reliable and efficient access to the data and an ability to manipulate it for insight.

In the next series of blogs, I will look at what it takes from the technology perspective to be an analytically enabled agile enterprise.