Skip to content

Latest commit

 

History

History
741 lines (545 loc) · 66.8 KB

.README2.md

File metadata and controls

741 lines (545 loc) · 66.8 KB


ClearScape Analytics™ Demonstrations via Jupyter

Welcome to ClearScape Analytics Experience. This service consists of multiple demonstrations of the industry leading in-database analytics that you can run on your own. You can modify them or use them as examples to use with your own tools against our data or small (not sensitive) data you upload. Each notebook will:

  • describe the business situation,
  • will attach the needed data from the cloud, and
  • walk you step-by-step through the use of the ClearScape Analytics functionality.

These are functional demonstrations executed on a tiny platform with small data, but the same functionality is available on all of our platforms up to one with hundreds of nodes and petabytes of data. ClearScape Analytics allows you to apply AI, ML and advanced statistics to your data without the cost and complexity of exporting data. You can develop sophisticated models on other platforms with your favorite tools and import those models to execute in production at massive scale.

If you've never used Jupyter before, we strongly recommend reviewing the First Time User section of Getting Started. You'll find an introduction video with tips on using this platform. There are also tips for you if you just want to look without programming. If you have questions or issues, click here to send an e-mail to ClearScape Analytics Support.

Telco


Table of Contents

Items in italics are coming soon.

Getting Started Industries Business Function Analytic Function 3rd Party Tools
First Time User Automotive Finance Data Preparation AWS SageMaker
I am not a programmer Energy & Natural Resources Marketing Descriptive Statistics Azure ML
Developer Information Financial Feature Engineering Dataiku
Healthcare Generative AI H2O.ai
Manufacturing Geospatial Microsoft PowerBI
Retail Hypothesis testing MicroStrategy
Telco Machine learning R
Travel & Transportation ModelOps SAP Business Objects
Object Storage SAS
Open-and-connected analytics Tableau
Path Analytics Vertex
Text Analysis



Getting Started

First Time User

Introduction Video

Video description how to find demos in the index and folder view, tips on running demos and options for foreign vs local tables used in the demonstrations in your ClearScape Analytics environment.
Information Only

Basic Jupyter Navigation

When running a Jupyter Notebook, there are various indicators that show what is happening. This is a guide to those indicators.
SQL VersionRead Only SQL Version

Back to Table of Contents

I am not a programmer

I am not a programmer

Not everyone that uses this site will want to learn programming. Some will want to review the business cases, look at the steps for the analysis and look at the tables, charts and maps. This is a guide for those people.
Information Only

Back to Table of Contents

Developer Information

Data Loading (Python)

Shows how to use python to load CSV data from local storage and from zipped files
Python Version

Data Loading (SQL)

Shows multiple ways to load data from local CSV files, and cloud files on Google and AWS using the SQL kernel.
SQL Version

Query Service REST API

Demonstration of using REST API calls to Vantage which is useful for web or mobile applications to access and maintain data.
Python Version

SQL Basics in Jupyter

This guide will walk you through writing your first SQL queries in Jupyter. It uses some of the Vantage system tables as a source for the queries.
SQL VersionRead Only SQL Version

teradataml Python Basics

Introduction to Teradataml package for Python including connecting to Vantage, Teradata DataFrames, data manipulation and export to Pandas.
Python VersionRead Only Python Version

Intro to Panda for Python

Provides step-by-step instructions on the basics of using Python Pandas with Jupyter notebooks.
Python VersionRead Only Python Version

Charting and Visualization

Data from queries is brought to life with graphics and charts. This shows how to use the %chart magic command to display results.
SQL VersionRead Only SQL Version

VAL Overview

Vantage Analytics Library (VAL) is a set of over 50 functions for advanced analytics. This provides an overview and links to an 8 minute video overview.
SQL VersionRead Only SQL Version

Data Dictionary

This provides an index to all of the databases used by demo notebooks on this machine, allowing you to use that data for your own notebooks or BI tools.
Python Version

How to Submit Your Demos

It is very easy to submit your demo for publication. Tell us directory with the notebook and referenced files and grant us access to your database. We'll take it from there.
Python Version

Back to Table of Contents


Industries

Automotive

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionRead Only Python-SQL Version

Battery Defect Analysis

Uses local data (or foreign tables on GCP) to analyze patterns of battery failure, then links to data on AWS for detailed battery measurement.
Python-SQL VersionRead Only Python-SQL Version

Back to Table of Contents

Energy & Natural Resources

Energy Consumption Forecasting

This combines machine learning and BYOM to forecast energy consumption using Vantage to score the model at scale without having to export all data.
Python Version

Energy Consumption Forecasting Dataiku

Demonstration of using Dataiku with Vantage. Instructions provided for use with your Dataiku copy + screen shots if you don't have Dataiku. PMML model from Dataiku is imported to Vantage for execution and scoring.
Python VersionRead Only Python Version

Back to Table of Contents

Financial

Banking Customer Churn

Uses a combination of Vantage Analytics Library to prepare data, using machine learning in python and importing the resulting PMML model into Vantage for scoring.
Python VersionRead Only Python Version

Cash Demand Forecasing

Predicts the future demand for cast in Automatic Teller Machines (ATMs) using Auto Regressive Integrated Moving Average using In-Database functions.
SQL Version

Consumer Complaints

Uses government consumer complaint data with SQL queries and visualizations to identify source of top complaints.
SQL VersionRead Only SQL Version

Credit Card Data Preparation

This shows the use of ClearScape analytics to reduce the pre-processing effort of incoming raw credit card data to prepare for analysis of potential loan defaulters.
SQL VersionRead Only SQL Version

Financial Customer Journey

Uses analytic techniques to find new customers, measure marketing attribution, and maximizing marketing effectiveness
SQL VersionPython-SQL Version

Financial Fraud Detection InDB

Detect financial transaction fraud using powerful in-database machine learning functions
Python VersionPython-SQL VersionRead Only Python VersionRead Only Python-SQL Version

Financial Fraud Detection VIA BYOM

A model that was developed externally is imported into Vantage for evaluation and execution at scale to detect fraud.
Python VersionPython-SQL VersionRead Only Python VersionRead Only Python-SQL Version

Insurance Policy Temporal

Show As-IS/As-Was capabilities of Vantage Temporal to dramatically simplify the SQL and improve performance for analyzing insurance policies versus claims.
SQL VersionRead Only SQL Version

Back to Table of Contents

Healthcare

02 Explore Diabetes Data

This is a guide through the PIMA Diabetes prediction including data exploration and model experimentation.
Python Version

03 ModelOps Operationalize PMML

Covers the ModelOps operationalizing of Bring-your-own-model (BOYM) importing a model via PMML. PMML allows exchange predictive models produced by data mining and machine learning algorithms.
Python Version

04 Modelops Operationalize ONNX

Covers the ModelOps operationalizing of the ONNX model format for BYOM for the Diabetes use case. ONNX is an efficient model format primarily related to neural networks.
Python Version

05 ModelOps Operationalize H2O

Covers the ModelOps operationalizing of the H2O model format for BYOM for the Diabetes use case. H2O is an open source, distributed in-memory machine learning library with linear scalability.
Python Version

Diabetes Prediction via BYOM H2O

This uses BYOM to import a trained H2O model to identify potential diabetes patients. BYOM allows the data scientist to create models in languages they prefer and run at scale inside Vantage without moving data.
Python-SQL Version

Diabetes Prediction via DF and GLM

Decision Forest and Generalized Linear Model are applied to identify factors that indicate diabetes. The earlier the disease is identified, the better the chance of reducing organ damage.
Python VersionPython-SQL Version

Heart Failure Prediction

Machine learning is applied to the complex attributes of patients to help recognize patterns that may lead to heart failure faster than a human may recognize.
Python-SQL VersionRead Only Python-SQL Version

Knee Replacement Attribution

The ClearScape Analytics Attribution function is used to determine the weight of various events that precede the final outcome, in this example, knee replacement.
SQL VersionRead Only SQL Version

Knee Replacement nPath

This uses the ClearScape Analytics nPath® function to provide visuals on the events leading up to the final outcome, in this case, knee replacement.
Python VersionRead Only Python Version

Back to Table of Contents

Manufacturing

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionRead Only Python-SQL Version

Battery Defect Analysis

Uses local data (or foreign tables on GCP) to analyze patterns of battery failure, then links to data on AWS for detailed battery measurement.
Python-SQL VersionRead Only Python-SQL Version

Car Complaints

Uses geospatial techniques to locate service centers close to the complaint and outlier detection to detect part defects earlier than expected.
Python-SQL VersionRead Only Python-SQL Version

Green Manufacturing

Uses analytic and ML techniques to predict how long vehicle testing will take based on combination of features installed.
Python-SQL VersionRead Only Python-SQL Version

Predictive Maintenance

Uses ML functions to predict failures to identify and mitigate potential machine failures before they occur.
Python-SQL VersionRead Only Python-SQL Version

Remaining Useful Life Forecasting

Applies machine learning to predict Remaining Useful Life (RUL) of jet engines, allowing scheduling of maintenance and replacement before failure occurs and reduces the cost of maintenance and replacement.
Python-SQL VersionRead Only Python-SQL Version

Back to Table of Contents

Retail

Broken Digital Journey

This uses nPath® analysis to show the retail events that prevent the ultimate objective: a purchase. This uses interactive Sankey diagrams to understand the problems.
Video VersionPython VersionRead Only Python Version

Customer Behavior Analysis

Analysis of customer purchase behavior using nPath® analysis in Python with visualization using Sankey diagrams.
Python VersionPython-SQL VersionRead Only Python VersionRead Only Python-SQL Version

Deep History via Object Store

This demonstrates integration of local data or foreign tables on GCP and integration across cloud providers to detailed historical sales records on AWS.
SQL Version

Hyper-Personalization

Hyper-personalization creates models from customer interations on multiple channels to determine the "Next Best Offer" for the individual.
Python-SQL VersionRead Only Python-SQL Version

K-Means Clustering and ML model

This uses the unsupervised K-Means ML algorithm to identify patterns in retail purchases.
SQL VersionRead Only SQL Version

Multi-Touch Attribution for Business Analyst

This is a somewhat simplified version of the Multi-Touch Attribution demonstration focused on the interests of the Business Analyst vs the Data Scientist.
Python-SQL VersionRead Only Python-SQL Version

Multi-Touch Attribution for Data Scientist

Demonstrates attribution of customer behavior via single touch and multi-touch rule-based models, and using statistical, and algorithmic models. Multiple approaches are demonstrated since each has strengths and limitations.
Python-SQL VersionRead Only Python-SQL Version

Retail Demand Forecasting

This creates an ARIMA time series model based on holidays and merchandising activities impacting store sales for a hypermarket retailer.
Python VersionRead Only Python Version

Retail Item Demand Forecast

Predicts demand for retail products showing how multiple models can be run concurrently.
Python-SQL VersionRead Only Python-SQL Version

Store Sales Forecast via ARIMA

Forecasts total store sales using ARIMA (AutoRegressive Integrated Moving Average)
Python VersionRead Only Python Version

Store Sales Forecast via Prophet

Uses the Script Table Operator (STO) to run the H20 machine learning library as an extension to ClearScape Analytics. H20 runs in parallel inside the Vantage database without exporting data to another platform.
Python-SQL VersionRead Only Python-SQL Version

Text Term Frequency

Use NGram splitter to analyze comments retail products to determine patterns of words used to describe products.
SQL VersionPython VersionPython-SQL Version

Back to Table of Contents

Telco

Graph Analysis of CDR Records

Uses graph analysis to identify communities and key influencers within Call Data Records. This uses Script Table Operator to invoke external procedures to work inside the Vantage database without exporting data.
Python-SQL VersionRead Only Python-SQL Version

Telco Customer Churn

This uses logistic regression for supervised learning to predict the probability of a customer switching vendors based on usage patterns, billing information demographics and interactions. XGBoost is then used in database to improve the prediction.
Python-SQL VersionRead Only Python-SQL Version

Telco Network Coverage

Demonstrates the ability of Geospatial to show signal strength, coverage areas and travel path of customers through cell tower coverage area.
Python-SQL VersionRead Only Python-SQL Version

Back to Table of Contents

Travel & Transportation

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
SQL VersionPython-SQL Version

Air Passenger Forecasting

Applies Auto Regressive Integrated Moving Average (ARIMA) analysis to forecast airplane passenger volume.
SQL VersionRead Only SQL Version

NYC Taxi Geospatial

Applies geospatial functions to analysis of NYC Taxi pickups and drop off locations.
Python-SQL VersionRead Only Python-SQL Version

Remaining Useful Life Forecasting

Applies machine learning to predict Remaining Useful Life (RUL) of jet engines, allowing scheduling of maintenance and replacement before failure occurs and reduces the cost of maintenance and replacement.
Python-SQL VersionRead Only Python-SQL Version

Train Delay Path Analysis

Uses nPath displays to show relationship of delays and predictive models to anticipate potential delays and enable proactive planning.
Python VersionRead Only Python Version

Back to Table of Contents


Business Function

Marketing

Hyper-Personalization

Hyper-personalization creates models from customer interations on multiple channels to determine the "Next Best Offer" for the individual.
Python-SQL VersionRead Only Python-SQL Version

Multi-Touch Attribution for Business Analyst

This is a somewhat simplified version of the Multi-Touch Attribution demonstration focused on the interests of the Business Analyst vs the Data Scientist.
Python-SQL VersionRead Only Python-SQL Version

Multi-Touch Attribution for Data Scientist

Demonstrates attribution of customer behavior via single touch and multi-touch rule-based models, and using statistical, and algorithmic models. Multiple approaches are demonstrated since each has strengths and limitations.
Python-SQL VersionRead Only Python-SQL Version

Back to Table of Contents


Analytic Function

Data Preparation

Credit Card Data Preparation

This shows the use of ClearScape analytics to reduce the pre-processing effort of incoming raw credit card data to prepare for analysis of potential loan defaulters.
SQL VersionRead Only SQL Version

Data Prep and Transformation

This demonstrates a subset of the over 100 analytic functions in the teradataml package for Python
Python VersionPython-SQL Version

Outlier Analysis

Outliers in an analysis skew the results and make it difficult to recognize the main patterns. ClearScape Analytics has tools to remove outliers automatically.
SQL VersionRead Only SQL Version

Back to Table of Contents

Descriptive Statistics

VAL Descriptive Statistics

This performs in-database analysis of data values, distribution, histograms, and text field analysis using SQL to access the Vantage Analytics Library.
SQL VersionRead Only SQL Version

VAL teradataml Demo

Demonstrated the use of Teradataml in Python to perform descriptive statistics, transformation, model building model evaluation and scoring.
Python VersionRead Only Python Version

Back to Table of Contents

Geospatial

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
SQL VersionPython-SQL Version

Car Complaints

Uses geospatial techniques to locate service centers close to the complaint and outlier detection to detect part defects earlier than expected.
Python-SQL VersionRead Only Python-SQL Version

NYC Taxi Geospatial

Applies geospatial functions to analysis of NYC Taxi pickups and drop off locations.
Python-SQL VersionRead Only Python-SQL Version

Sensor Data Analytics

Creative application of geospatial to locations of sensors in a research lab and integration of data from tables with detailed recordings on cloud storage.
Python-SQL VersionRead Only Python-SQL Version

Telco Network Coverage

Demonstrates the ability of Geospatial to show signal strength, coverage areas and travel path of customers through cell tower coverage area.
Python-SQL VersionRead Only Python-SQL Version

Back to Table of Contents

Hypothesis testing

VAL Hypothesis Tests

This demonstrates a subset of the 18 hypothesis test in the Vantage Analytics library using SQL such as Parametric, Binomial, Kolmogorov/Smirnoff, Rank, etc.
SQL VersionRead Only SQL Version

Back to Table of Contents

Machine learning

03 ModelOps Operationalize PMML

Covers the ModelOps operationalizing of Bring-your-own-model (BOYM) importing a model via PMML. PMML allows exchange predictive models produced by data mining and machine learning algorithms.
Python Version

04 Modelops Operationalize ONNX

Covers the ModelOps operationalizing of the ONNX model format for BYOM for the Diabetes use case. ONNX is an efficient model format primarily related to neural networks.
Python Version

05 ModelOps Operationalize H2O

Covers the ModelOps operationalizing of the H2O model format for BYOM for the Diabetes use case. H2O is an open source, distributed in-memory machine learning library with linear scalability.
Python Version

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionRead Only Python-SQL Version

Banking Customer Churn

Uses a combination of Vantage Analytics Library to prepare data, using machine learning in python and importing the resulting PMML model into Vantage for scoring.
Python VersionRead Only Python Version

Diabetes Prediction via DF and GLM

Decision Forest and Generalized Linear Model are applied to identify factors that indicate diabetes. The earlier the disease is identified, the better the chance of reducing organ damage.
Python VersionPython-SQL Version

Financial Fraud Detection InDB

Detect financial transaction fraud using powerful in-database machine learning functions
Python VersionPython-SQL VersionRead Only Python VersionRead Only Python-SQL Version

Financial Fraud Detection VIA BYOM

A model that was developed externally is imported into Vantage for evaluation and execution at scale to detect fraud.
Python VersionPython-SQL VersionRead Only Python VersionRead Only Python-SQL Version

Graph Analysis of CDR Records

Uses graph analysis to identify communities and key influencers within Call Data Records. This uses Script Table Operator to invoke external procedures to work inside the Vantage database without exporting data.
Python-SQL VersionRead Only Python-SQL Version

Green Manufacturing

Uses analytic and ML techniques to predict how long vehicle testing will take based on combination of features installed.
Python-SQL VersionRead Only Python-SQL Version

Heart Failure Prediction

Machine learning is applied to the complex attributes of patients to help recognize patterns that may lead to heart failure faster than a human may recognize.
Python-SQL VersionRead Only Python-SQL Version

Hyper-Personalization

Hyper-personalization creates models from customer interations on multiple channels to determine the "Next Best Offer" for the individual.
Python-SQL VersionRead Only Python-SQL Version

K-Means Clustering and ML model

This uses the unsupervised K-Means ML algorithm to identify patterns in retail purchases.
SQL VersionRead Only SQL Version

Multi-Touch Attribution for Business Analyst

This is a somewhat simplified version of the Multi-Touch Attribution demonstration focused on the interests of the Business Analyst vs the Data Scientist.
Python-SQL VersionRead Only Python-SQL Version

Multi-Touch Attribution for Data Scientist

Demonstrates attribution of customer behavior via single touch and multi-touch rule-based models, and using statistical, and algorithmic models. Multiple approaches are demonstrated since each has strengths and limitations.
Python-SQL VersionRead Only Python-SQL Version

Predictive Maintenance

Uses ML functions to predict failures to identify and mitigate potential machine failures before they occur.
Python-SQL VersionRead Only Python-SQL Version

Remaining Useful Life Forecasting

Applies machine learning to predict Remaining Useful Life (RUL) of jet engines, allowing scheduling of maintenance and replacement before failure occurs and reduces the cost of maintenance and replacement.
Python-SQL VersionRead Only Python-SQL Version

Retail Item Demand Forecast

Predicts demand for retail products showing how multiple models can be run concurrently.
Python-SQL VersionRead Only Python-SQL Version

Store Sales Forecast via ARIMA

Forecasts total store sales using ARIMA (AutoRegressive Integrated Moving Average)
Python VersionRead Only Python Version

Store Sales Forecast via Prophet

Uses the Script Table Operator (STO) to run the H20 machine learning library as an extension to ClearScape Analytics. H20 runs in parallel inside the Vantage database without exporting data to another platform.
Python-SQL VersionRead Only Python-SQL Version

Telco Customer Churn

This uses logistic regression for supervised learning to predict the probability of a customer switching vendors based on usage patterns, billing information demographics and interactions. XGBoost is then used in database to improve the prediction.
Python-SQL VersionRead Only Python-SQL Version

Train Delay Path Analysis

Uses nPath displays to show relationship of delays and predictive models to anticipate potential delays and enable proactive planning.
Python VersionRead Only Python Version

VAL Analytics and ML

Demonstration of Vantage Analytic Library scoring and analytic functions like linear regression, decision trees, K-Means clustering, Factor Analysis, etc.
SQL VersionRead Only SQL Version

Vertex AI Integration

In this tutorial, we will show how to integrate Vantage Analytics capabilities in Vertex AI ML Pipelines. Vertix AI is the environment for data scientists to deploy ML models.
Python Version

Back to Table of Contents

ModelOps

00 ModelOps Introduction

This introduction and table of contents introduces you to ModelOps and provides a launch for ModelOps. It is recommended to go through ModelOps deomstrations in sequence.
Information Only

01 ModelOps Getting Started

This introduces the ModelOps methodology, provides an overview video, and a description of navigating the projects, models, and datasets plus a description of monitoring capabilities.
Python Version

02 Explore Diabetes Data

This is a guide through the PIMA Diabetes prediction including data exploration and model experimentation.
Python Version

03 ModelOps Operationalize PMML

Covers the ModelOps operationalizing of Bring-your-own-model (BOYM) importing a model via PMML. PMML allows exchange predictive models produced by data mining and machine learning algorithms.
Python Version

04 Modelops Operationalize ONNX

Covers the ModelOps operationalizing of the ONNX model format for BYOM for the Diabetes use case. ONNX is an efficient model format primarily related to neural networks.
Python Version

05 ModelOps Operationalize H2O

Covers the ModelOps operationalizing of the H2O model format for BYOM for the Diabetes use case. H2O is an open source, distributed in-memory machine learning library with linear scalability.
Python Version

06 ModelOps Project Setup

Shows you how to set up your own GIT repository for models and create a new project in ModelOps associated with your new repository. This step is required for the next notebooks.
Python Version

07 ModelOps Define Functions

For the project you've created in ModelOps, this shows definition of the training function, evaluate function, scoring function, life cycle, and monitoring.
Python Version

08 ModelOps Add H20 to Project

Demonstrates the use of ModelOps to finalize the H2O AI model, train, evaluate, approve, deploy, score and monitor.
Python Version

09 ModelOps Add XGBoost to Project

Uses XGBoost algorithm to generate both Python Joblib and PMML model formats and operationalize them through ModelOps.
Python Version

10 ModelOps Add R gbm Model to Project

Uses the gbm R package to generate both R model formats and operationalize through ModelOps. The gbm R packages extends Freund & Schapire's AadaBoost algorithm and Friedman's Gradient Boosting Machine (gbm).
Python Version

Back to Table of Contents

Object Storage

Battery Defect Analysis

Uses local data (or foreign tables on GCP) to analyze patterns of battery failure, then links to data on AWS for detailed battery measurement.
Python-SQL VersionRead Only Python-SQL Version

Deep History via Object Store

This demonstrates integration of local data or foreign tables on GCP and integration across cloud providers to detailed historical sales records on AWS.
SQL Version

Back to Table of Contents

Open-and-connected analytics

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionRead Only Python-SQL Version

Banking Customer Churn

Uses a combination of Vantage Analytics Library to prepare data, using machine learning in python and importing the resulting PMML model into Vantage for scoring.
Python VersionRead Only Python Version

Dataiku

Discusses how the 3rd party tool DataIku can be used with Vantage.
Information Only

Diabetes Prediction via BYOM H2O

This uses BYOM to import a trained H2O model to identify potential diabetes patients. BYOM allows the data scientist to create models in languages they prefer and run at scale inside Vantage without moving data.
Python-SQL Version

Energy Consumption Forecasting

This combines machine learning and BYOM to forecast energy consumption using Vantage to score the model at scale without having to export all data.
Python Version

Energy Consumption Forecasting Dataiku

Demonstration of using Dataiku with Vantage. Instructions provided for use with your Dataiku copy + screen shots if you don't have Dataiku. PMML model from Dataiku is imported to Vantage for execution and scoring.
Python VersionRead Only Python Version

Financial Fraud Detection VIA BYOM

A model that was developed externally is imported into Vantage for evaluation and execution at scale to detect fraud.
Python VersionPython-SQL VersionRead Only Python VersionRead Only Python-SQL Version

teradataml Python Basics

Introduction to Teradataml package for Python including connecting to Vantage, Teradata DataFrames, data manipulation and export to Pandas.
Python VersionRead Only Python Version

Vertex AI Integration

In this tutorial, we will show how to integrate Vantage Analytics capabilities in Vertex AI ML Pipelines. Vertix AI is the environment for data scientists to deploy ML models.
Python Version

Back to Table of Contents

Path Analytics

Broken Digital Journey

This uses nPath® analysis to show the retail events that prevent the ultimate objective: a purchase. This uses interactive Sankey diagrams to understand the problems.
Video VersionPython VersionRead Only Python Version

Customer Behavior Analysis

Analysis of customer purchase behavior using nPath® analysis in Python with visualization using Sankey diagrams.
Python VersionPython-SQL VersionRead Only Python VersionRead Only Python-SQL Version

Financial Customer Journey

Uses analytic techniques to find new customers, measure marketing attribution, and maximizing marketing effectiveness
SQL VersionPython-SQL Version

Knee Replacement Attribution

The ClearScape Analytics Attribution function is used to determine the weight of various events that precede the final outcome, in this example, knee replacement.
SQL VersionRead Only SQL Version

Knee Replacement nPath

This uses the ClearScape Analytics nPath® function to provide visuals on the events leading up to the final outcome, in this case, knee replacement.
Python VersionRead Only Python Version

MultiTouch Attribution

Shows rule-based, Statistics, and Algorithmic attribution of the marketing touchpoints leading to conversion. Ten approaches will be demonstrated along with path analysis of effectiveness and cost of conversion.
Python-SQL VersionRead Only Python-SQL Version

Train Delay Path Analysis

Uses nPath displays to show relationship of delays and predictive models to anticipate potential delays and enable proactive planning.
Python VersionRead Only Python Version

Back to Table of Contents

Text Analysis

Text Term Frequency

Use NGram splitter to analyze comments retail products to determine patterns of words used to describe products.
SQL VersionPython VersionPython-SQL Version

VAL Descriptive Statistics

This performs in-database analysis of data values, distribution, histograms, and text field analysis using SQL to access the Vantage Analytics Library.
SQL VersionRead Only SQL Version

Back to Table of Contents

Time series analytics

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
SQL VersionPython-SQL Version

Air Passenger Forecasting

Applies Auto Regressive Integrated Moving Average (ARIMA) analysis to forecast airplane passenger volume.
SQL VersionRead Only SQL Version

Carbon Footprint Analytics

A key component of ESG is Carbon Footprint. This demonstrates a part of a solution available from Teradata to integrate multiple data sources to calculate carbon footprint of various corporate activities.
Python-SQL VersionRead Only Python-SQL Version

Cash Demand Forecasing

Predicts the future demand for cast in Automatic Teller Machines (ATMs) using Auto Regressive Integrated Moving Average using In-Database functions.
SQL Version

Consumer Complaints

Uses government consumer complaint data with SQL queries and visualizations to identify source of top complaints.
SQL VersionRead Only SQL Version

Energy Consumption Forecasting

This combines machine learning and BYOM to forecast energy consumption using Vantage to score the model at scale without having to export all data.
Python Version

Fourier Transforms

Fourier transformations are demonstrated to filter out noise from signals to allow identification of underlying patterns.
SQL VersionRead Only SQL Version

Insurance Policy Temporal

Show As-IS/As-Was capabilities of Vantage Temporal to dramatically simplify the SQL and improve performance for analyzing insurance policies versus claims.
SQL VersionRead Only SQL Version

NYC Taxi Temporal

Time series data can answer the questions about what was happening at a point in time. This applies Time series and temporal capabilities of vantage to NYC taxi data.
SQL VersionRead Only SQL Version

Retail Demand Forecasting

This creates an ARIMA time series model based on holidays and merchandising activities impacting store sales for a hypermarket retailer.
Python VersionRead Only Python Version

Vantage Query Log Analysis

Analysis of sessions and queries you executed using the built-in logging facilities of Vantage.
SQL VersionRead Only SQL Version

Back to Table of Contents


3rd Party Tools

Dataiku

Dataiku

Discusses how the 3rd party tool DataIku can be used with Vantage.
Information Only

Energy Consumption Forecasting Dataiku

Demonstration of using Dataiku with Vantage. Instructions provided for use with your Dataiku copy + screen shots if you don't have Dataiku. PMML model from Dataiku is imported to Vantage for execution and scoring.
Python VersionRead Only Python Version

Back to Table of Contents

R

10 ModelOps Add R gbm Model to Project

Uses the gbm R package to generate both R model formats and operationalize through ModelOps. The gbm R packages extends Freund & Schapire's AadaBoost algorithm and Friedman's Gradient Boosting Machine (gbm).
Python Version

Back to Table of Contents


©2023 Teradata. All Rights Reserved