David Menninger's Analyst Perspectives

Data Lakes: Safe Way to Swim in Big Data?

Posted by David Menninger on May 11, 2016 9:20:56 AM

It has been more than five years since James Dixon of Pentaho coined the term “data lake.” His original post suggests, “If you think of a data mart as a store of bottled water – cleansed and packaged and structured for easy consumption – the data lake is a large body of water in a more natural state.” The analogy is a simple one, but in my experience talking with many end users there is still mystery surrounding the concept. In this post I’d like to clarify what a data lake is, review the reasons an organization might consider using one and the challenges they present, and outline some developments in software tools that support data lakes.

Read More

Topics: Big Data, Business Analytics, Business Intelligence, Data Governance, Data Lake, data science, Governance, Risk & Compliance (GRC), Information Management, Predictive Analytics, Social Media, Uncategorized, Strata+Hadoop