A growing number of organizations now have multiple data lakes that use different technologies. These data sources must be integrated into a fused data lake to allow data scientists and other users to access data crucial for analytics.
Generally, there are three solutions: Integration by data science tool, by data replication, or by data virtualization.
In this webinar Rick van der Lans will set out the pros and cons of each these. The focus will be on how data virtualization technology simplifies access to new data lake architecture.
- Why organizations have multiple data lakes
- How the three solutions compare
- Key data virtualization features for developing a fused data lake