The technology industry throws around a lot of similar terms with different meanings as well as entirely different terms with similar meanings. In this post, I don’t want to debate the meanings and origins of different terms; rather, I’d like to highlight a technology weapon that you should have in your data management arsenal. We currently refer to this technology as data virtualization. Other similar terms you may have heard include data fabric, data mesh and [data] federation. I’ll briefly discuss these terms and how I see them being used, but ultimately, I’d like to share with you some research that shows why data virtualization can be valuable, regardless of what you call it.
Alteryx is a data analytics software company that offers data preparation and analytics tools to simplify and automate data wrangling, data cleaning and modeling processes, enabling line-of-business personnel to quickly access, manipulate, analyze and output data. The platform features tools to run a variety of analytic functions such as diagnostic, predictive, prescriptive and geospatial analytics in a unified platform, and can connect to various data warehouses, cloud applications, spreadsheets and other sources.
Data governance is a hot topic these days. In fact, we are conducting benchmark research on the subject here. With increasing regulations such as the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA), organizations face more external oversight of their data governance practices. The risk of significant fines associated with these and other regulations, coupled with organizations’ internal compliance requirements, has brought more attention to data governance practices. We anticipate through 2023, three-quarters of Chief Data Officers’ primary concerns will be governing the privacy and security of their organization’s data.
Collibra is a data governance software company that offers tools for metadata management and data cataloging. The software enables organizations to find data quickly, identify its source and assure its integrity. Line-of-business workers can use it to create, review and update the organization's policies on different data assets. Collibra’s software uses a microservice architecture and open application programming interfaces to connect to various data ecosystems. Its data intelligence cloud platform can automatically classify data from various sources such as online transaction processing databases, master repositories and Excel files without moving the data, so the information assets stay protected.
Sisu Data is an analytics platform for structured data that uses machine learning and statistical analysis to automatically monitor changes in data sets and surface explanations. It can prioritize facts based on their impact and provide a detailed, interpretable context to refine and support conclusions. The product features fact boards, annotations and the ability to share facts and analysis across teams. Data teams and analysts start by creating common definitions of key performance indicators, which Sisu then utilizes to automatically test thousands of hypotheses to identify differences between groups.
Rapidminer is a visual enterprise data science platform that includes data extraction, data mining, deep learning, artificial intelligence and machine learning (AI/ML) and predictive analytics. It can support AI/ML processes with data preparation, model validation, results visualization and model optimization. Rapidminer Studio is its visual workflow designer for the creation of predictive models. It offers more than 1,500 algorithms and functions in their library, along with templates, for common use cases including customer churn, predictive maintenance and fraud detection. It has a drag and drop visual interface and can connect to databases, enterprise data warehouses, data lakes, cloud storage, business applications and social media. The platform also supports push-down processing for data prep and ETL inside databases to minimize data movement and optimize performance.
Confluent Platform is a streaming platform built by the original creators of Apache Kafka. It enables organizations to organize and manage streaming data from various sources. Confluent launched its IPO in June this year and raised $828 million to further expand its business. Confluent Platform was brought to several public cloud vendor marketplaces last year as Confluent Cloud. The offering is currently available in Azure, AWS, and GCP marketplaces. Furthermore, the company strengthened its partnership with Microsoft at the beginning of this year, establishing Confluent Cloud as a fully managed Apache Kafka service directly available on Microsoft Azure. Azure customers can access the extensive library of pre-built connectors, a unified billing model with options to use Azure committed spend on Confluent Cloud, and deeper integrations with Azure services.
The annual Ventana Research Digital Innovation Awards showcase advances in the productivity and potential of business applications, as well as technology that contributes significantly to the improved processes and performance of an organization. Our goal is to recognize technology and vendors that have introduced noteworthy digital innovations to advance business and IT.
Event data can be used to enhance existing processes, but it can also be used to dramatically impact operations, revenue models and the bottom line for manufacturers. Our Benchmark Research shows 95% of manufacturers consider it important to speed the flow of information and improve responsiveness within business processes. In this perspective I’ll share how manufacturers are working with event data to transform their organizations.
I am happy to share insights gleaned from our latest Value Index research, an assessment of how well vendors’ offerings meet buyers’ requirements. The 2021 Ventana Research Value Index: Collaborative Analytics and Data is the distillation of a year of market and product research by Ventana Research. See our prior post for a description of our methodology and included vendors.