It has been more than five years since James Dixon of Pentaho coined the term “data lake.” His original post suggests, “If you think of a data mart as a store of bottled water – cleansed and packaged and structured for easy consumption – the data lake is a large body of water in a more natural state.” The analogy is a simple one, but in my experience talking with many end users there is still mystery surrounding the concept. In this post I’d like to clarify what a data lake is, review the reasons an organization might consider using one and the challenges they present, and outline some developments in software tools that support data lakes.
Topics: Big Data, Data Science, Predictive Analytics, Social Media, Business Analytics, Business Intelligence, Data Governance, Data Lake, Governance, Risk & Compliance (GRC), Information Management, Uncategorized, Strata+Hadoop
Talend recently announced version 5 of its information management platform, which emphasizes unifying its various components. Through a combination of development activities, acquisitions and partnerships, Talend has been steadily building its portfolio of information management capabilities. In addition to its core data integration capabilities, it has added data quality, master data management, application integration and with this release business process management (BPM).
Topics: Big Data, Data Quality, Master Data Management, Talend, Business Analytics, Cloud Computing, Data Governance, Data Integration, Governance, Risk & Compliance (GRC), Information Applications, Information Management, Strata+Hadoop
Kalido recently introduced version 9 of its Information Engine product. The company has been around for 10 years but has had difficulty establishing its identity in the information management market. Kalido was perhaps ahead of its time, partly a vendor of data integration, partly master data management and partly data governance. As an example of the positioning challenge, its core product, Information Engine, while not a data integration tool, could in some cases provide sufficient capabilities to meet an organization’s data integration needs. Its real value, however, comes from authoring and management of information about the user’s data warehouse.
About 30 years ago, perhaps on this very day, I was sitting in front of an Apple II working on a VisiCalc spreadsheet. At the time, I don’t think I even knew who Steve Jobs was. I wasn’t in the software industry yet. I was working for a public accounting firm. The Apple II sat in a corner of the office “typing pool.” For those of you who don’t know what a typing pool was, there was no swimming involved – it was a group of full-time employees with dedicated equipment who did all the typing and word processing tasks of the office.
Topics: Mobile, Sales Performance, Social Media, Supply Chain Performance, Sustainability, Business Analytics, Business Collaboration, Business Intelligence, Business Mobility, Business Performance, Customer & Contact Center, Financial Performance, Governance, Risk & Compliance (GRC), Information Applications, Information Management, Location Intelligence, Operational Intelligence, Visualization, IT Performance Management (ITPM)