<<

White Paper AND THE COMPLEXITY MATRIX

www.sisense.com data state. data each for suited better are approaches different program, Analytics a Business considering When Complex. or Big, Diversified, Simple, be may data Your standpoints. these of both from data describes matrix complexity data The analysis. for data the to prepare is needed (cost) that store it. and The query to more needed (cost) data effort more sources the data, (data tables) the bigger the because complexity drive elements These the more effort complexity: complexity: of data key drivers two are there analytics, business of context the In complex? data makes exactly What performance. system hindering and management data in of trouble kinds all causing programs, of many bane the be can data complex organizations, IT For competition. of the ahead key to staying is the potential its unlocking and data complex this user, understanding business the For increasing. is also decisions business to make it using and data that of unlocking value the time, same At the sources. data 6 disparate than more integrate to need 70% of organizations Research, to Ventana According sources. data more are there but data, more year. per 50% is there only Not almost be 2020 will through data in growth annual compound that reports IDC exponentially. growing are environments Data Data? Complex is What The number of disparate data sources sources data of disparate number The data size of the The (or data tables). (or data billion+)? or of millions, 100’s of rows, millions (is it www.sisense.com make it ideal for self-service data visualization tools. tools. visualization data self-service for ideal it make intermediate Data, of Simple an characteristics The store. analytics data than rather database, live a querying optiononeaffordsthe Data of also directly Simple tools. modeling drop and to drag itself lending forward, straight is relationships step data the modeling properly sources, data intermediate the two or one only without With aggregations. or indexes of creating directly queried be can sets data Small analysis. for it to prepare order in massaging significant or optimization model data need not does it because is simple data This of sources. number a limited from come that sets data of smaller consists data Simple Simple Data Data Size Matrix Complexity Data

Small Large SIMPLE BIG Few # of Sources (Tables) Sources # of DIVERSIFIED COMPLEX Many the Business Analytics solution, such as a for analytic queries (e.g., Vertica), or a specialized data warehouse appliance (e.g., Netezza). Other options include running business analytics on Big Data using in memory solutions. However, these solutions have There are some challenges, though, when considering an inherent limitations in both size of data and number a Business Analytics solution for Simple data. Will your of concurrent users. Additionally as size of data grows data always be small? Will it grow over time with usage other challenges arise. Either the system costs increase or will it expand to encompass additional data elements significantly because of the memory required to store or data sources? If the size of your data or the number the complete data set or one has to give up on data of data sources (tables) grow, how will your Business granularity to fit the data set in memory. Analytics solution scale? Creating a proper Business Analytics solution for Big These challenges are magnified in the context of a real Data, presents some additional challenges. Third party time connection to a transactional database. With a data warehouse and/or appliance tools create another large number of users, executing sophisticated queries, step in the Business Analytics process. Introducing performance impact can be significant. Poor performance another moving part has both direct costs for the is still the #1 inhibitor to user adoption for business specific solution and also indirect costs in the overhead analytics. Not only does such a connection put the to configure, maintain, and support both the new tool business analytics solution at risk for poor performance, and its integration with other pieces of the Business but it can hinder performance of the core transactional Analytics puzzle. Even with these additional tools in system as well. Managing these performance issues place, the Business Analytics solution is still likely to ultimately impacts an IT organization, and what was require data preparation and aggregation. With each “simple” becomes less so. Big Data

In the context of our Data Complexity Matrix, Big data consists of larger data sets (100’s of millions of rows +) that come from a limited number of data sources (tables). This data needs special preparation because of its size. You may need DBAs to work their magic, creating indexes, aggregation tables, clusters, etc. in order to query this data with reasonable performance.

For Big data, this manual data preparation step requires investment of time and resources from an IT team before any analytic output can be generated. In addition the size of the data may require some specialized tools as part of

www.sisense.com such aggregation, business users lose data granularity and risk missing insights derived from that granularity (a traditional OLAP problem). In addition, data aggregation limits the agility of the , forcing iterative cycles with IT, each time the analysts wants to change the queries or data sets under review.

Diversified Data

Diversified data consists of smaller data sets that are derived from multiple data sources (tables). Diversified data requires special attention in the ETL step of the Complex Data process, in order to ensure correct relational structure In the context of our Data Complexity Matrix, Complex between the various data tables. As the number of data consists of larger data sets that come from multiple, data sources (and data tables) grows this ETL process disparate data sources. Complex data sets require special becomes more and more complicated, requiring attention in both the ETL process and in managing the the skills of a DBA to remodel data, creating new size of the data. Complex data combines the challenges schemas or many different views of the data. Another of both Big and Diversified data. It takes a long time consideration is the frequency with which new data to prepare because of both the modeling challenges sources will be introduced. In order to keep analysis and the challenges in indexing and aggregation. agile and current, the Business Analytics solution may Specialized skills and resources are needed throughout need to absorb new data sources, forcing iterative the Business Analytics process, turning any project into reviews of the data model and the ETL. For Diversified a cross department, lengthy effort. This manpower cost data, this ETL step requires investment of time and is amplified each time a change in the data prep or ETL resources from an IT team before any analytic output is necessary in order to investigate new analytic paths. can be generated. The ETL process can become so Often additional third party tools are required as well (as cumbersome that it may necessitate the purchase and detailed above). use of a specialized ETL tool for automating this part of the data preparation (e.g. Informatica). As a result Complex data often comes with a very high total cost of ownership. License fees for the Business Creating a proper Business Analytics solution for Analytics tool are typically just the tip of the iceberg. Diversified data, presents some additional challenges. License fees for additional data warehouse and data Third party data preparation tools create another step prep tools are additional hard costs in such cases. There in the Business Analytics process. Introducing another are also additional costs created by the overhead and moving part has both direct costs for the specific specialized skills needed to integrate and maintain solution and also indirect costs in the overhead to multiple tools from different vendors that may or may not configure, maintain, and support both the new tool work effectively together. With all of these challenges, and its integration with other pieces of the Business agility for the business analyst is diminished and time to Analytics puzzle. insight is longer.

www.sisense.com steps in the business analytics process. It eliminates the need for aggregating, compacting, and indexing data and eliminates the patchwork of multiple tools that are typically needed to analyze Complex data.

Choosing a Business Analytics Solution

Each quadrant in the Data Complexity Matrix has its own unique aspects and challenges. When evaluating different business analytics solutions, one must consider the quadrant you reside in today and the one you will reside in tomorrow.

Sisense In-ChipTM technology solves problems for Simplifying Business Big Data with its unparalleled data processing speed. Eliminating the need for massaging data and eliminating Analytics for Complex the need for additional data warehouse tools. Data Sisense Single StackTM technology solves problems for Diversified Data by unifying the business analytics Sisense technology was built specifically to simplify process into a single software solution eliminating much Business Analytics for Complex data, optimizing speed of the ETL process and greatly simplifying what is left. to handle larger data sets and unifying the Business The combination of In-ChipTM and Single StackTM Analytics process, thereby eliminating many steps in provides the 1-2 punch that solves both of the problems data preparation and easing the burden of Business associated with Complex Data. Faster processing of Analytics for IT organizations. In-ChipTM technology larger data sets and unlimited mashing of multiple data is optimized for processing speed by leveraging sources (tables). modern CPU technology. Sisense can process 100’s of millions of rows using low cost, commodity hardware. Sisense simplifies business analytics by reducing reliance Single StackTM technology fuses all the elements of a on scarce, specialized IT skills and empowering individual Business Analytics solution, from data preparation to business users. Business analysts can go from nothing data management, analytics and visualization, into a to a fully functional business analytics program using single, efficient software solution. As a result of these production data in 90 minutes, delivering the fastest two technology innovations, Sisense eliminates multiple time to insight in the market.

www.sisense.com READY FOR A TEST DRIVE? We offer a full Proof-of-Concept on your real data. See how Sisense lets you easily prepare and analyze both big and disparate datasets, from start to finish in 90 minutes.

GET STARTED

http://www.sisense.com/get/test-drive