It has been more than five years since James Dixon of Pentaho coined the term “data lake.” His original post suggests, “If you think of a data mart as a store of bottled water – cleansed and packaged and structured for easy consumption – the data lake is a large body of water in a more natural state.” The analogy is a simple one, but in my experience talking with many end users there is still mystery surrounding the concept. In this post I’d like to clarify what a data lake is, review the reasons an organization might consider using one and the challenges they present, and outline some developments in software tools that support data lakes.
Topics: Big Data, Business Analytics, Business Intelligence, Data Governance, Data Lake, Data Science, Governance, Risk & Compliance (GRC), Information Management, Predictive Analytics, Social Media, Uncategorized, Strata+Hadoop
Talend recently announced version 5 of its information management platform, which emphasizes unifying its various components. Through a combination of development activities, acquisitions and partnerships, Talend has been steadily building its portfolio of information management capabilities. In addition to its core data integration capabilities, it has added data quality, master data management, application integration and with this release business process management (BPM).
Topics: Big Data, Business Analytics, Data Governance, Data Integration, Data Quality, Governance, Risk & Compliance (GRC), Information Applications, Information Management, Master Data Management, Talend, Strata+Hadoop, Cloud Computing
Kalido recently introduced version 9 of its Information Engine product. The company has been around for 10 years but has had difficulty establishing its identity in the information management market. Kalido was perhaps ahead of its time, partly a vendor of data integration, partly master data management and partly data governance. As an example of the positioning challenge, its core product, Information Engine, while not a data integration tool, could in some cases provide sufficient capabilities to meet an organization’s data integration needs. Its real value, however, comes from authoring and management of information about the user’s data warehouse.
About 30 years ago, perhaps on this very day, I was sitting in front of an Apple II working on a VisiCalc spreadsheet. At the time, I don’t think I even knew who Steve Jobs was. I wasn’t in the software industry yet. I was working for a public accounting firm. The Apple II sat in a corner of the office “typing pool.” For those of you who don’t know what a typing pool was, there was no swimming involved – it was a group of full-time employees with dedicated equipment who did all the typing and word processing tasks of the office.
Topics: Business Analytics, Business Collaboration, Business Intelligence, Business Mobility, Business Performance, Customer & Contact Center, Financial Performance, Governance, Risk & Compliance (GRC), Information Applications, Information Management, Location Intelligence, Mobile, Operational Intelligence, Sales Performance, Social Media, Supply Chain Performance, Sustainability, Visualization, IT Performance Management (ITPM)