In the past year, big data players have increasingly adopted the Apache Spark distributed computing architecture. This growth has been fueled by several innovative differentiators for big data applications, such as MapReduce 2.0 (or YARN), provisions for analytic workflows, and efficient use of memory.
As one of the most exciting and widely adopted open-source projects, Apache Spark in-memory clusters are driving new opportunities for application development as well as increased intake of IT infrastructure. Apache Spark is now the most active Apache project, with more than 600 contributions being made in the last 12 months by more than 200 organizations.
In fact, adoption of Spark is outpacing Hadoop adoption because of its accelerated access to big data, according to Databricks’ recent 2015 Spark industry survey. For solution providers, the Apache Spark technology stack is a significant player because it’s one of the core technologies used to modernize data warehouses, a huge segment of the IT industry that accounts for multiple billions in revenue.
In support of this new computing architecture, TIBCO is hosting a webinar on Tuesday, Nov. 17, 2015 at 1 p.m. EST, to discuss one of the most popular components of Spark: Spark SQL, Spark’s module for working with structured data. The webinar will feature Daniel Gutierrez, managing editor of InsideBIGDATA.
During the event, the webinar panel will offer an overview of Spark and discuss why Spark is so hot. The panel will also look at Spark through a Hadoop lens as well as focus on Spark SQL and the TIBCO–Spark connection.