<<

THE LAKE COMPANY

THE DATA LAKE COMPANY

Product Brief

Self Service Data Preparation Enables More Democratized Access to Data, Maximizes Big Data ROI Benefits of Mica

Enterprises must modernize their big data architecture and shorten data preparation time so that business users and data architects can be more productive. To achieve this, business users need direct access to data. By leveraging Bedrock’s metadata Data visibility management platform while maintaining the governance policies and controls Mica provides an easy-to-use required by IT, Mica provides users with: enterprise-wide data catalog (across multiple Hadoop clusters) 1. An on-ramp for self-service data discovery that is faceted and searchable. 2. The ability for 3. The ability for data provisioning Improved time-to-analytics With Mica, business users have the tools they need for rapidly discovering data Users can interactively transform sets, interacting with them and uncovering insights. data to reduce preparation time and interact with sample data to detect patterns and outliers. Explore: Enterprise-wide data catalog to explore and curate Mica provides an enterprise-wide data catalog that allows easy exploration and searchability of datasets by business users, data architects, data scientists, or Operationalized transformations any subject matter experts. Gone are the days of spending time going back and Users can push transformations forth with various development teams to find what is needed. In addition, data to Bedrock, Zaloni’s data lake stewards can use Mica to add/update labels, descriptions and ratings to entities, management platform, which manages and automates which can help others get better search results. monitoring, scheduling, How does it work? Mica leverages the power of the Zaloni Bedrock metadata notifications and more. This allows users to leverage the Hadoop repository to collect business, technical, and operational metadata from across cluster to run transformations. your enterprise data lake. It provides a single view of all data assets in one dashboard. This data catalog provides a rich marketplace experience for users, where they can browse, and select and group the datasets that they are most interested in. Data consumers can search the catalog two ways: Improved team collaboration Using Workspaces, users can share 1. Free-form text: Users type in keywords and receive matching results their enrichments and entities of based on relevance (e.g., date created, best match, last ingested, etc.). interest with peers, building on each other’s work 2. Multi-faceted search: Using filters based on metadata such as data format, and helping others get better search results. source platform, source schema, subject area and more, users can narrow the search results to a set of entities.

Mica Product Brief 1 THE DATA LAKE COMPANY

THE DATA LAKE COMPANY

Once the user is interested in a dataset, they can further own custom expressions using Google Refine Expression explore within Mica and see: Language (GREL), which has support for variables, built-in functions and controls. • Business description of entities, tags and labels as provided in Bedrock Once the sample data results are satisfactory, the user can: • Technical structure, including column names, data types, • Export results in CSV: User can manipulate the results in a formats and sensitivity flag spreadsheet • results for each column as defined in Bedrock • Publish the enrichment as a Bedrock entity: Bedrock creates a • Data profiling on each column, including frequency distribution, workflow and then executes it ranges and counts of nulls

• A data preview that shows a sample from the entity

For each column, the user can create profiling information like frequency counts, min and max ranges, scatterplot, text length that is seen on the left side of the data grid

Data Catalog View with facets and filters

Prepare: Self-service data preparation to reduce time-to-analytics Mica provides users with a sandbox to interact and work with sample data from the dataset. Users can play with this sample data without affecting the source data. Using the one-click “Enrich” button, the user is provided a tabular view of the data along with a list of the various transformations Perform transformations such as formatting, split, other common text functions, date functions, numeric functions, that can be applied to each column. In addition to out-of-the- transpose, sort and add new columns. These transformations box functionality, Mica allows advanced users to create their are applied in real time on sample data and users can iterate and further apply additional functions

Mica Product Brief 2 THE DATA LAKE COMPANY

THE DATA LAKE COMPANY

Collaborate: Workspaces enable collaboration for improved productivity Mica’s Workspaces enable teams to collaborate to increase productivity. A Workspace includes entities hand-picked by a team member and shows the history of enrichments/ transformations done to those entities. Workspaces also allow teams to do “Smart Searches,” which save search criteria and display entities that match the criteria in real time.

Workspaces enabling teams to collaborate and Operationalize: Seamless workflow share datasets operationalization for increased efficiency Using Mica, data users can easily define a process and then operationalize it in Bedrock. Mica automatically converts the UI steps into Spark code, transfers it to Bedrock, and creates a and executes a workflow. The Bedrock workflow can be scheduled/modified by IT to operationalize the process. With this feature, data-savvy users can perform on any dataset from the data lake and more easily operationalize it using Bedrock.

Mica Product Brief 3 THE DATA LAKE COMPANY

THE DATA LAKE COMPANY

Self Service Data Preparation Enables More Democratized Access to Data, Maximizes Big Data ROI To Learn More:

Call us: +1 919.323.4050 About Zaloni E-mail: [email protected] Zaloni, the data lake company, is a provider of enterprise data lake management Visit: http://www.zaloni.com solutions. Our software platforms, Bedrock and Mica, enable customers to gain competitive advantage through organized, actionable big data lakes. Serving the Find Us on Social Media: Fortune 500, Zaloni has helped its customers build production implementations at Twitter handle @zaloni many of the world’s leading companies. To learn more, visit: www.zaloni.com

Services and Training Zaloni Professional Services offer expert consulting and training services to help you reduce risk, accelerate adoption, and improve business performance. Through the use of these services, delivered globally, you can move quickly from pre- production to post-production to maximize the value of your investment.

Customer Support We provide a comprehensive customer support experience that includes online portal access to software and documentation, and a robust knowledge base.

MICAPBRIEF_09/16 Mica Product Brief 4