How to Apply Machine Learning and Big Data Analytics to Real-Time Processing

Big data analytics (bigdata) and data mining concept. Businessman think about big data and data mining issues (volume velocity variety variability veracity complexity). Wide banner composition and office in background.
Reading Time: 2 minutes

In March 2016, I had a talk at Voxxed Zurich about “How to Apply Machine Learning and Big Data Analytics to Real Time Processing”.


Finding Insights with TIBCO Spotfire and TERR

Big data is currently a big hype. Large amounts of historical data are stored in Hadoop or other platforms. Data discovery tools and statistical computing are used to draw new knowledge and to find patterns from this data for promotions, cross-selling, fraud detection, or predictive maintenance. TIBCO Spotfire can be used by business users for visualization and data discovery to find these insights.

In addition, a data scientist can support business users by building analytic models with tools and frameworks such as R/TERR, Apache Spark MLlib or H20. Machine learning helps create and train powerful algorithms, which can improve business processes and add business value. These analytic models can then be embedded into TIBCO Spotfire, so that the business user can leverage them without knowing the detailed logic behind an algorithm.

The key challenge is how these insights can then be integrated from historical data to new transactions in real time to make customers happy, increase revenue, prevent fraud, or replace machines before they break.

Putting Analytic Models into Action with TIBCO StreamBase and Live Datamart

Fast Data via stream processing is the solution to embed patterns—which were obtained from analyzing historical data—into future transactions in real time. The following slide deck uses several real world success stories to explain the concepts behind stream processing, respectively streaming analytics, and its relation to Apache Hadoop and other big data platforms. I discuss how patterns and statistical models of R, H20, Spark MLlib, and other technologies (using PMML standard) can be integrated into real-time processing using TIBCO StreamBase. TIBCO Live Datamart is added on top of streaming analytics to enable a live view of the data, as well as proactive human interactions.

Here is the slide deck, which explains how to realize a closed loop from big data analytics across machine learning and analytic models to real-time actions with streaming analytics: