Skip to content

Teradata/jupyter-demos

Repository files navigation


#ClearScape Analytics™ Demonstrations via Jupyter

Welcome to ClearScape Analytics Experience. This service consists of multiple demonstrations of the industry leading in-database analytics that you can run on your own. You can modify them or use them as examples to use with your own tools against our data or small (not sensitive) data you upload. Each notebook will:

  • describe the business situation,
  • will attach the needed data from the cloud, and
  • walk you step-by-step through the use of the ClearScape Analytics functionality.

These are functional demonstrations executed on a tiny platform with small data, but the same functionality is available on all of our platforms up to one with hundreds of nodes and petabytes of data. ClearScape Analytics allows you to apply AI, ML and advanced statistics to your data without the cost and complexity of exporting data. You can develop sophisticated models on other platforms with your favorite tools and import those models to execute in production at massive scale.

If you've never used Jupyter before, we strongly recommend reviewing the First Time User section of Getting Started. You'll find an introduction video with tips on using this platform. There are also tips for you if you just want to look without programming. If you have questions or issues, click here to send an e-mail to ClearScape Analytics Support.


Table of Contents

Items in italics are coming soon.

Getting Started Industries Business Function Analytic Function 3rd Party Tools
First Time User Azure Cloud Automotive Finance Data Preparation
I am not a programmer Energy & Natural Resources Marketing Descriptive Statistics Azure ML
Developer Information Financial Feature Engineering Celebrus
Healthcare Generative AI Dataiku
Manufacturing Geospatial H2O.ai
Retail Hypothesis testing Microsoft PowerBI
Telco Machine learning MicroStrategy
Travel & Transportation ModelOps R
Defense Object Storage SAP Business Objects
Open-and-connected analytics SAS
Path Analytics Tableau
Text Analysis Vertex AI
Time series analytics AWS Bedrock



Getting Started

First Time User

Back to Table of Contents

Azure Cloud

Getting Started With Azure Beta

Follow these steps to create your first Azure account and create an Azure Machine Learning workspace to complete our AzureML use cases.
InformationInformation

Back to Table of Contents

First Time User

Introduction Video

Video description how to find demos in the index and folder view, tips on running demos and options for foreign vs local tables used in the demonstrations in your ClearScape Analytics environment.
Information

Basic Jupyter Navigation

When running a Jupyter Notebook, there are various indicators that show what is happening. This is a guide to those indicators.
SQL Version

Back to Table of Contents

I am not a programmer

I am not a programmer

Not everyone that uses this site will want to learn programming. Some will want to review the business cases, look at the steps for the analysis and look at the tables, charts and maps. This is a guide for those people.
Information

Back to Table of Contents

Developer Information

Charting and Visualizations using teradataml Beta

The td_plot method streamlines large-scale visualizations by providing users with efficient tools to create visualizations directly within the Vantage platform
Python Version

Data Loading (Python)

Shows how to use python to load CSV data from local storage and from zipped files
Python Version

Data Loading (SQL)

Shows multiple ways to load data from local CSV files, and cloud files on Google and AWS using the SQL kernel.
SQL Version

Query Service REST API

Demonstration of using REST API calls to Vantage which is useful for web or mobile applications to access and maintain data.
Python Version

Using ClearScape Analytics with openAI Beta

To ensure optimal utilization of the OpenAI API in generative AI notebooks, it is essential to establish the API keys correctly. This concise guide outlines the process of configuring OpenAI API keys for seamless integration across multiple notebooks.
Information

SQL Basics in Jupyter

This guide will walk you through writing your first SQL queries in Jupyter. It uses some of the Vantage system tables as a source for the queries.
SQL Version

teradataml Python Basics

Introduction to Teradataml package for Python including connecting to Vantage, Teradata DataFrames, data manipulation and export to Pandas.
Python Version

Intro to Panda for Python

Provides step-by-step instructions on the basics of using Python Pandas with Jupyter notebooks.
Python Version

Charting and Visualization

Data from queries is brought to life with graphics and charts. This shows how to use the %chart magic command to display results.
SQL Version

VAL Overview

Vantage Analytics Library (VAL) is a set of over 50 functions for advanced analytics. This provides an overview and links to an 8 minute video overview.
SQL Version

Data Dictionary

This provides an index to all of the databases used by demo notebooks on this machine, allowing you to use that data for your own notebooks or BI tools.
Python Version

How to Submit Your Demos

It is very easy to submit your demo for publication. Tell us directory with the notebook and referenced files and grant us access to your database. We'll take it from there.
Python VersionVideo

Back to Table of Contents


Industries

Automotive

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionVideo

Battery Defect Analysis

Uses local data (or foreign tables on GCP) to analyze patterns of battery failure, then links to data on AWS for detailed battery measurement.
Python-SQL Version

Car Complaints

Uses geospatial techniques to locate service centers close to the complaint and outlier detection to detect part defects earlier than expected.
Python-SQL Version

Back to Table of Contents

Energy & Natural Resources

Carbon Footprint Analytics

A key component of ESG is Carbon Footprint. This demonstrates a part of a solution available from Teradata to integrate multiple data sources to calculate carbon footprint of various corporate activities.
Python-SQL Version

Energy Consumption Forecasting

This combines machine learning and BYOM to forecast energy consumption using Vantage to score the model at scale without having to export all data.
Python Version

Energy Consumption Forecasting Dataiku

Demonstration of using Dataiku with Vantage. Instructions provided for use with your Dataiku copy + screen shots if you don't have Dataiku. PMML model from Dataiku is imported to Vantage for execution and scoring.
Python Version

Energy Consumption Forecasting using AzureML

This leverages the power of AzureML and Teradata Vantage to enhance our machine learning capabilities and enable scalable model scoring to forecast energy consumption.
Python Version

Flood Proximity to Climate Risk Analysis

This allows enterprises to rapidly analyze this geographic-related information in real-time at any scale - effectively understanding the impacts of these climate events on entire populations.
Python Version

Back to Table of Contents

Financial

Anomaly Detection of Outstanding Amounts Beta

Anomaly detection in bank accounts can help in identifying unusual patterns, potentially flagging issues like errors or fraudulent activities. Enterprise re-use of Features ensures rapid creation & deployment of models while new Features can be created and used without extensive engineering support.
Python Version

Credit Card Data Preparation

Uses functions from TeradataML Python library to prepare data for analysis using data cleansing, exploration and feature engineering functions.
Python Version

Mortage Calculator using GenAI

Use Retrieval-Augmented Generation (RAG), Lanchain and LLM models to as questions about loans and retrieve relevant data from Vantage.
Python Version

Banking Customer Churn

Uses a combination of Vantage Analytics Library to prepare data, using machine learning in python and importing the resulting PMML model into Vantage for scoring.
Python Version

Cash Demand Forecasing

Predicts the future demand for cast in Automatic Teller Machines (ATMs) using Auto Regressive Integrated Moving Average (ARIMA) using In-Database functions.
SQL Version

Consumer Complaints

Uses government consumer complaint data with SQL queries and visualizations to identify source of top complaints.
SQL Version

Credit Card Data Preparation

This shows the use of ClearScape analytics to reduce the pre-processing effort of incoming raw credit card data to prepare for analysis of potential loan defaulters.
SQL Version

Digital Identity Management

Combines ClearScape Analytics with Celebrus to track unique customers across web sessions and devices to drive personalized experiences
SQL Version

Financial Customer Journey

Uses analytic techniques to find new customers, measure marketing attribution, and maximizing marketing effectiveness
Python-SQL VersionSQL VersionVideo

Financial Fraud Detection InDB

Detect financial transaction fraud using powerful in-database machine learning functions
Python VersionVideo

Financial Fraud Detection VIA BYOM

A model that was developed externally is imported into Vantage for evaluation and execution at scale to detect fraud.
Python Version

Flood Proximity to Climate Risk Analysis

This allows enterprises to rapidly analyze this geographic-related information in real-time at any scale - effectively understanding the impacts of these climate events on entire populations.
Python Version

Insurance Policy Temporal

Show As-IS/As-Was capabilities of Vantage Temporal to dramatically simplify the SQL and improve performance for analyzing insurance policies versus claims.
SQL Version

Back to Table of Contents

Healthcare

02 ModelOps Explore Diabetes Data

This is a guide through the PIMA Diabetes prediction including data exploration and model experimentation.
Python Version

03 ModelOps Operationalize PMML

Covers the ModelOps operationalizing of Bring-your-own-model (BOYM) importing a model via PMML. PMML allows exchange predictive models produced by data mining and machine learning algorithms.
Python Version

04 Modelops Operationalize ONNX

Covers the ModelOps operationalizing of the ONNX model format for BYOM for the Diabetes use case. ONNX is an efficient model format primarily related to neural networks.
Python Version

05 ModelOps Operationalize H2O

Covers the ModelOps operationalizing of the H2O model format for BYOM for the Diabetes use case. H2O is an open source, distributed in-memory machine learning library with linear scalability.
Python Version

Data Science 101 with Python

This walks through the Cross-Industry Standard Process for Data Mining (CRISP-DM) from data understanding through modeling and evaluation.
Python Version

Diabetes Prediction via BYOM H2O

This uses BYOM to import a trained H2O model to identify potential diabetes patients. BYOM allows the data scientist to create models in languages they prefer and run at scale inside Vantage without moving data.
Python-SQL Version

Diabetes Prediction via DF and GLM

Decision Forest and Generalized Linear Model are applied to identify factors that indicate diabetes. The earlier the disease is identified, the better the chance of reducing organ damage.
Python Version

Heart Failure Prediction

Machine learning is applied to the complex attributes of patients to help recognize patterns that may lead to heart failure faster than a human may recognize.
Python-SQL Version

Knee Replacement Attribution

The ClearScape Analytics Attribution function is used to determine the weight of various events that precede the final outcome, in this example, knee replacement.
SQL Version

Knee Replacement nPath

This uses the ClearScape Analytics nPath® function to provide visuals on the events leading up to the final outcome, in this case, knee replacement.
Python Version

Parkinson's Disease Prediction

This uses analytics to determine which biomedical voice measurements are significant in identifying people potentially with Parkinson's
Python-SQL Version

Cancer Prediction using the TDAPIClient and VertexAI Beta

Use TDAPIClient to connect to the Google Cloud Vertex API Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Cancer Prediction using Teradata and the SageMaker API Beta

Use the Vantage SageMaker API feature to connect to an Amazon Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Back to Table of Contents

Manufacturing

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionVideo

Battery Defect Analysis

Uses local data (or foreign tables on GCP) to analyze patterns of battery failure, then links to data on AWS for detailed battery measurement.
Python-SQL Version

Car Complaints

Uses geospatial techniques to locate service centers close to the complaint and outlier detection to detect part defects earlier than expected.
Python-SQL Version

Green Manufacturing

Uses analytic and ML techniques to predict how long vehicle testing will take based on combination of features installed.
Python Version

Predictive Maintenance

Uses ML functions to predict failures to identify and mitigate potential machine failures before they occur.
Python-SQL Version

Remaining Useful Life Forecasting

Applies machine learning to predict Remaining Useful Life (RUL) of jet engines, allowing scheduling of maintenance and replacement before failure occurs and reduces the cost of maintenance and replacement.
Python-SQL Version

Back to Table of Contents

Retail

Natural Language Processing Beta

Use NLP for Sentiment Analysis, perform Kmeans clustering, execute Principal Component Analysis (PCA) using comments received by a Retail Store
Python-SQL Version

Product Recommentations via TDApiClient Beta

Build a product recommendation system using OpenAI embeddings and Vantage in db_function VectorDistance. We will also use Vantage as VectorDB, to store the embeddings.
Python Version

Broken Digital Journey

This uses nPath® analysis to show the retail events that prevent the ultimate objective: a purchase. This uses interactive Sankey diagrams to understand the problems.
Python VersionVideo

Competitor Proximity Geospatial Analysis

This looks at the impact on purchasing when people that shop at our stores live within reasonable drive time to a competitor. This could identify proximity based marketing tactics to target larger promotions to those customers to increase share of their spend.
Python Version

Customer Behavior Analysis

Analysis of customer purchase behavior using nPath® analysis in Python with visualization using Sankey diagrams.
Python VersionSQL Version

Deep History via Object Store

This demonstrates integration of local data or foreign tables on GCP and integration across cloud providers to detailed historical sales records on AWS.
SQL Version

Hyper-Personalization

Hyper-personalization creates models from customer iteractions on multiple channels to determine the "Next Best Offer" for the individual.
Python-SQL Version

K-Means Clustering and ML model

This uses the unsupervised K-Means ML algorithm to identify patterns in retail purchases.
SQL Version

Marketing Campaign Effectiveness

Examines the results of campaigns by various customer attributes then uses correlation, outlier elimination, and machine learning to identify the best campaigns.
Python-SQL Version

Multi-Touch Attribution for Business Analyst

This is a somewhat simplified version of the Multi-Touch Attribution demonstration focused on the interests of the Business Analyst vs the Data Scientist.
Python-SQL Version

Multi-Touch Attribution for Data Scientist

Demonstrates attribution of customer behavior via single touch and multi-touch rule-based models, and using statistical, and algorithmic models. Multiple approaches are demonstrated since each has strengths and limitations.
Python-SQL Version

Product Recommendations via Open Source

Uses FlagEmbedding from HuggingFace plus Vantage in-database functions to assess the vector distance between the product entered and similar products.
Python Version

Product Recommendations via OpenAI

Uses OpenAIEmbeddings and Vantage in-database function to assists consumers in receiving product recommendations
Python Version

Retail Demand Forecasting

This creates an ARIMA time series model based on holidays and merchandising activities impacting store sales for a hypermarket retailer.
Python Version

Retail Item Demand Forecast

Predicts demand for retail products showing how multiple models can be run concurrently.
Python-SQL Version

Sales Forecasting using UAF

A detailed coverage of the analytic steps in sales forecasting including data preparation, exploration, seasonal normalizing, model creation, validation, and forecasting.
Python Version

Store Sales Forecast via ARIMA

Forecasts total store sales using ARIMA (AutoRegressive Integrated Moving Average)
Python Version

Store Sales Forecast via Prophet

Uses the Script Table Operator (STO) to run the H20 machine learning library as an extension to ClearScape Analytics. H20 runs in parallel inside the Vantage database without exporting data to another platform.
Python-SQL Version

Text Term Frequency

Use NGram splitter to analyze comments retail products to determine patterns of words used to describe products.
Python VersionPython-SQL VersionSQL Version

Product Recommendation via AWS Bedrock Beta

Use AWS Bedrock, Embedding from HuggingFace and Vantage in-DB functions to assist in providing product recommendations to develop a recipe assistant chatbot.
Python Version

Back to Table of Contents

Telco

Telco 5G Quality of Experience

Quality of service analysis of 5G vs 4G video for different situations such as Indoor, outdoor, and moble.
SQL Version

Graph Analysis of CDR Records

Uses graph analysis to identify communities and key influencers within Call Data Records. This uses Script Table Operator to invoke external procedures to work inside the Vantage database without exporting data.
Python-SQL Version

Telco Customer Churn

This uses logistic regression for supervised learning to predict the probability of a customer switching vendors based on usage patterns, billing information demographics and interactions. XGBoost is then used in database to improve the prediction.
Python VersionVideo

Telco Network Coverage

Demonstrates the ability of Geospatial to show signal strength, coverage areas and travel path of customers through cell tower coverage area.
Python-SQL Version

Back to Table of Contents

Travel & Transportation

Chatbot using GenAI with insurance PDF document

Uses TD_VectorDistance in Vantage to create a chatbot using Retrieval-Augmented Generation (RAG), Langchain, and LLM to answer questions using a 24 page travelers insurance policy PDF as source data to answer questions about coverage.
Python VersionPython-SQL Version

Shipping Time Prediction Beta

Use ClearScape Analytics to develop a robust system that can reliably estimate delivery dates, accounting for handling time, transit time, and other relevant variables affecting the actual delivery timeframe.
Python-SQL Version

Supply Chain Visibility

Shows the integration of data from warehouse, trucking company, and customer Order Management Systems (OMS) using temporal to reconcile different update frequencies and show rerouting when warehouse is unavailable.
Python Version

Vehicle Routing for Delivery

Demonstrates advanced analytics to find lowest cost routes to deliver a large volume of packages by a fleet of vehicles.
Python-SQL Version

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
Python-SQL VersionSQL Version

Air Passenger Forecasting

Applies Auto Regressive Integrated Moving Average (ARIMA) analysis to forecast airplane passenger volume.
SQL Version

NYC Taxi Geospatial

Applies geospatial functions to analysis of NYC Taxi pickups and drop off locations.
Python-SQL Version

Remaining Useful Life Forecasting

Applies machine learning to predict Remaining Useful Life (RUL) of jet engines, allowing scheduling of maintenance and replacement before failure occurs and reduces the cost of maintenance and replacement.
Python-SQL Version

Train Delay Path Analysis

Uses nPath displays to show relationship of delays and predictive models to anticipate potential delays and enable proactive planning.
Python Version

Back to Table of Contents

Defense

Signal Processing and Classification Beta

Use the Unbounded Array Framework in ClearScape Analytics to classify sonar signals. Extract and engineer features to use in training and scoring our models.
Python Version

Back to Table of Contents


Business Function

Marketing

Competitor Proximity Geospatial Analysis

This looks at the impact on purchasing when people that shop at our stores live within reasonable drive time to a competitor. This could identify proximity based marketing tactics to target larger promotions to those customers to increase share of their spend.
Python Version

Customer 360

This integrates data about customers from multiple sources using multiple matching techniquest to create the "Golden Customer Record" and calculate LTV.
SQL Version

Digital Identity Management

Combines ClearScape Analytics with Celebrus to track unique customers across web sessions and devices to drive personalized experiences
SQL Version

Generative AI Question Answering

Uses OpenAI to translate english language questions into SQL queries to run against a table of marketing data. This includes a link to a notebook with instructions for getting the required OpenAI API key.
Python VersionVideo

Hyper-Personalization

Hyper-personalization creates models from customer iteractions on multiple channels to determine the "Next Best Offer" for the individual.
Python-SQL Version

Marketing Campaign Effectiveness

Examines the results of campaigns by various customer attributes then uses correlation, outlier elimination, and machine learning to identify the best campaigns.
Python-SQL Version

Multi-Touch Attribution for Business Analyst

This is a somewhat simplified version of the Multi-Touch Attribution demonstration focused on the interests of the Business Analyst vs the Data Scientist.
Python-SQL Version

Multi-Touch Attribution for Data Scientist

Demonstrates attribution of customer behavior via single touch and multi-touch rule-based models, and using statistical, and algorithmic models. Multiple approaches are demonstrated since each has strengths and limitations.
Python-SQL Version

Store Sales Forecast via ARIMA

Forecasts total store sales using ARIMA (AutoRegressive Integrated Moving Average)
Python Version

Back to Table of Contents


Analytic Function

Data Preparation

Credit Card Data Preparation

Uses functions from TeradataML Python library to prepare data for analysis using data cleansing, exploration and feature engineering functions.
Python VersionSQL Version

Data Prep and Transformation

This demonstrates a subset of the over 100 analytic functions in the teradataml package for Python
Python VersionPython-SQL Version

Outlier Analysis

Outliers in an analysis skew the results and make it difficult to recognize the main patterns. ClearScape Analytics has tools to remove outliers automatically.
SQL Version

Back to Table of Contents

Descriptive Statistics

Sales Forecasting using UAF

A detailed coverage of the analytic steps in sales forecasting including data preparation, exploration, seasonal normalizing, model creation, validation, and forecasting.
Python VersionPython Version

VAL Descriptive Statistics

This performs in-database analysis of data values, distribution, histograms, and text field analysis using SQL to access the Vantage Analytics Library.
SQL Version

VAL teradataml Demo

Demonstrated the use of Teradataml in Python to perform descriptive statistics, transformation, model building model evaluation and scoring.
Python Version

Back to Table of Contents

Generative AI

Chatbot using GenAI with insurance PDF document

Uses TD_VectorDistance in Vantage to create a chatbot using Retrieval-Augmented Generation (RAG), Langchain, and LLM to answer questions using a 24 page travelers insurance policy PDF as source data to answer questions about coverage.
Python VersionPython-SQL Version

Mortage Calculator using GenAI

Use Retrieval-Augmented Generation (RAG), Lanchain and LLM models to as questions about loans and retrieve relevant data from Vantage.
Python Version

Product Recommentations via TDApiClient Beta

Build a product recommendation system using OpenAI embeddings and Vantage in db_function VectorDistance. We will also use Vantage as VectorDB, to store the embeddings.
Python Version

Generative AI Question Answering

Uses OpenAI to translate english language questions into SQL queries to run against a table of marketing data. This includes a link to a notebook with instructions for getting the required OpenAI API key.
Python VersionVideo

Product Recommendations via Open Source

Uses FlagEmbedding from HuggingFace plus Vantage in-database functions to assess the vector distance between the product entered and similar products.
Python Version

Product Recommendations via OpenAI

Uses OpenAIEmbeddings and Vantage in-database function to assists consumers in receiving product recommendations
Python Version

Grocery Recommendations using GenAI Beta

Use context-based product recommendations powered by GenAI Large Language Models to enhance a shopping experience.
Python-SQL Version

Product Recommendation via AWS Bedrock Beta

Use AWS Bedrock, Embedding from HuggingFace and Vantage in-DB functions to assist in providing product recommendations to develop a recipe assistant chatbot.
Python Version

Back to Table of Contents

Geospatial

Supply Chain Visibility

Shows the integration of data from warehouse, trucking company, and customer Order Management Systems (OMS) using temporal to reconcile different update frequencies and show rerouting when warehouse is unavailable.
Python Version

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
Python-SQL VersionSQL Version

Car Complaints

Uses geospatial techniques to locate service centers close to the complaint and outlier detection to detect part defects earlier than expected.
Python-SQL Version

Competitor Proximity Geospatial Analysis

This looks at the impact on purchasing when people that shop at our stores live within reasonable drive time to a competitor. This could identify proximity based marketing tactics to target larger promotions to those customers to increase share of their spend.
Python Version

Flood Proximity to Climate Risk Analysis

This allows enterprises to rapidly analyze this geographic-related information in real-time at any scale - effectively understanding the impacts of these climate events on entire populations.
Python Version

NYC Taxi Geospatial

Applies geospatial functions to analysis of NYC Taxi pickups and drop off locations.
Python-SQL Version

Sensor Data Analytics

Creative application of geospatial to locations of sensors in a research lab and integration of data from tables with detailed recordings on cloud storage.
Python-SQL Version

Telco Network Coverage

Demonstrates the ability of Geospatial to show signal strength, coverage areas and travel path of customers through cell tower coverage area.
Python-SQL Version

Back to Table of Contents

Hypothesis testing

Competitor Proximity Geospatial Analysis

This looks at the impact on purchasing when people that shop at our stores live within reasonable drive time to a competitor. This could identify proximity based marketing tactics to target larger promotions to those customers to increase share of their spend.
Python Version

Data Science 101 with Python

This walks through the Cross-Industry Standard Process for Data Mining (CRISP-DM) from data understanding through modeling and evaluation.
Python Version

Sales Forecasting using UAF

A detailed coverage of the analytic steps in sales forecasting including data preparation, exploration, seasonal normalizing, model creation, validation, and forecasting.
Python Version

VAL Hypothesis Tests

This demonstrates a subset of the 18 hypothesis test in the Vantage Analytics library using SQL such as Parametric, Binomial, Kolmogorov/Smirnoff, Rank, etc.
SQL Version

Back to Table of Contents

Machine learning

Anomaly Detection of Outstanding Amounts Beta

Anomaly detection in bank accounts can help in identifying unusual patterns, potentially flagging issues like errors or fraudulent activities. Enterprise re-use of Features ensures rapid creation & deployment of models while new Features can be created and used without extensive engineering support.
Python Version

Sales Forecasting with Vantage vs SAS

This demo walks through how a typical SAS user would use sales data to build a simple sales forecasting model and then will showcase how we can achieve the same using Vantage ClearScape Analytics.
Python-SQL VersionSQL Version

Shipping Time Prediction Beta

Use ClearScape Analytics to develop a robust system that can reliably estimate delivery dates, accounting for handling time, transit time, and other relevant variables affecting the actual delivery timeframe.
Python-SQL Version

Telco 5G Quality of Experience

Quality of service analysis of 5G vs 4G video for different situations such as Indoor, outdoor, and moble.
SQL Version

03 ModelOps Operationalize PMML

Covers the ModelOps operationalizing of Bring-your-own-model (BOYM) importing a model via PMML. PMML allows exchange predictive models produced by data mining and machine learning algorithms.
Python Version

04 Modelops Operationalize ONNX

Covers the ModelOps operationalizing of the ONNX model format for BYOM for the Diabetes use case. ONNX is an efficient model format primarily related to neural networks.
Python Version

05 ModelOps Operationalize H2O

Covers the ModelOps operationalizing of the H2O model format for BYOM for the Diabetes use case. H2O is an open source, distributed in-memory machine learning library with linear scalability.
Python Version

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionVideo

Banking Customer Churn

Uses a combination of Vantage Analytics Library to prepare data, using machine learning in python and importing the resulting PMML model into Vantage for scoring.
Python Version

Diabetes Prediction via DF and GLM

Decision Forest and Generalized Linear Model are applied to identify factors that indicate diabetes. The earlier the disease is identified, the better the chance of reducing organ damage.
Python Version

Financial Fraud Detection InDB

Detect financial transaction fraud using powerful in-database machine learning functions
Python VersionVideo

Financial Fraud Detection VIA BYOM

A model that was developed externally is imported into Vantage for evaluation and execution at scale to detect fraud.
Python Version

Graph Analysis of CDR Records

Uses graph analysis to identify communities and key influencers within Call Data Records. This uses Script Table Operator to invoke external procedures to work inside the Vantage database without exporting data.
Python-SQL Version

Green Manufacturing

Uses analytic and ML techniques to predict how long vehicle testing will take based on combination of features installed.
Python Version

Heart Failure Prediction

Machine learning is applied to the complex attributes of patients to help recognize patterns that may lead to heart failure faster than a human may recognize.
Python-SQL Version

Hyper-Personalization

Hyper-personalization creates models from customer iteractions on multiple channels to determine the "Next Best Offer" for the individual.
Python-SQL Version

K-Means Clustering and ML model

This uses the unsupervised K-Means ML algorithm to identify patterns in retail purchases.
SQL Version

Marketing Campaign Effectiveness

Examines the results of campaigns by various customer attributes then uses correlation, outlier elimination, and machine learning to identify the best campaigns.
Python-SQL Version

Multi-Touch Attribution for Business Analyst

This is a somewhat simplified version of the Multi-Touch Attribution demonstration focused on the interests of the Business Analyst vs the Data Scientist.
Python-SQL Version

Multi-Touch Attribution for Data Scientist

Demonstrates attribution of customer behavior via single touch and multi-touch rule-based models, and using statistical, and algorithmic models. Multiple approaches are demonstrated since each has strengths and limitations.
Python-SQL Version

Parkinson's Disease Prediction

This uses analytics to determine which biomedical voice measurements are significant in identifying people potentially with Parkinson's
Python-SQL Version

Predictive Maintenance

Uses ML functions to predict failures to identify and mitigate potential machine failures before they occur.
Python-SQL Version

Remaining Useful Life Forecasting

Applies machine learning to predict Remaining Useful Life (RUL) of jet engines, allowing scheduling of maintenance and replacement before failure occurs and reduces the cost of maintenance and replacement.
Python-SQL Version

Retail Item Demand Forecast

Predicts demand for retail products showing how multiple models can be run concurrently.
Python-SQL Version

Store Sales Forecast via ARIMA

Forecasts total store sales using ARIMA (AutoRegressive Integrated Moving Average)
Python Version

Store Sales Forecast via Prophet

Uses the Script Table Operator (STO) to run the H20 machine learning library as an extension to ClearScape Analytics. H20 runs in parallel inside the Vantage database without exporting data to another platform.
Python-SQL Version

Telco Customer Churn

This uses logistic regression for supervised learning to predict the probability of a customer switching vendors based on usage patterns, billing information demographics and interactions. XGBoost is then used in database to improve the prediction.
Python VersionVideo

Train Delay Path Analysis

Uses nPath displays to show relationship of delays and predictive models to anticipate potential delays and enable proactive planning.
Python Version

VAL Analytics and ML

Demonstration of Vantage Analytic Library scoring and analytic functions like linear regression, decision trees, K-Means clustering, Factor Analysis, etc.
SQL Version

Vertex AI Integration

In this tutorial, we will show how to integrate Vantage Analytics capabilities in Vertex AI ML Pipelines. Vertex AI is the environment for data scientists to deploy ML models.
Python Version

Hyperparameter Tuning using the Titanic Passenger Dataset Beta

Create a predictive algorithm that can identify whethr or not passengers on the Titanic survived the ship's sinking.
Python Version

Cancer Prediction using the TDAPIClient and VertexAI Beta

Use TDAPIClient to connect to the Google Cloud Vertex API Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Cancer Prediction using Teradata and the SageMaker API Beta

Use the Vantage SageMaker API feature to connect to an Amazon Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Back to Table of Contents

ModelOps

01 ModelOps Getting Started

This introduces the ModelOps methodology, provides an overview video, and a description of navigating the projects, models, and datasets plus a description of monitoring capabilities.
Python Version

02 ModelOps Explore Diabetes Data

This is a guide through the PIMA Diabetes prediction including data exploration and model experimentation.
Python Version

03 ModelOps Operationalize PMML

Covers the ModelOps operationalizing of Bring-your-own-model (BOYM) importing a model via PMML. PMML allows exchange predictive models produced by data mining and machine learning algorithms.
Python Version

04 Modelops Operationalize ONNX

Covers the ModelOps operationalizing of the ONNX model format for BYOM for the Diabetes use case. ONNX is an efficient model format primarily related to neural networks.
Python Version

05 ModelOps Operationalize H2O

Covers the ModelOps operationalizing of the H2O model format for BYOM for the Diabetes use case. H2O is an open source, distributed in-memory machine learning library with linear scalability.
Python Version

06 ModelOps Project Setup

Shows you how to set up your own GIT repository for models and create a new project in ModelOps associated with your new repository. This step is required for the next notebooks.
Python Version

07 ModelOps Define Functions

For the project you've created in ModelOps, this shows definition of the training function, evaluate function, scoring function, life cycle, and monitoring.
Python Version

08 ModelOps Add H20 to Project

Demonstrates the use of ModelOps to finalize the H2O AI model, train, evaluate, approve, deploy, score and monitor.
Python Version

09 ModelOps Add XGBoost to Project

Uses XGBoost algorithm to generate both Python Joblib and PMML model formats and operationalize them through ModelOps.
Python Version

10 ModelOps Add R gbm Model to Project

Uses the gbm R package to generate both R model formats and operationalize through ModelOps. The gbm R packages extends Freund & Schapire's AadaBoost algorithm and Friedman's Gradient Boosting Machine (gbm).
Python Version

11 ModelOps In-Database XGBoost using Git

This notebook will cover the Operationalization of the PIMA diabetes use case with Python using the Teradata In-database XGBoost model.
Python Version

ModelOps Introduction & List of Notebooks

This introduction and table of contents introduces you to ModelOps and provides a launch for ModelOps. It is recommended to go through ModelOps demonstrations in sequence.
Information

Back to Table of Contents

Object Storage

Battery Defect Analysis

Uses local data (or foreign tables on GCP) to analyze patterns of battery failure, then links to data on AWS for detailed battery measurement.
Python-SQL Version

Deep History via Object Store

This demonstrates integration of local data or foreign tables on GCP and integration across cloud providers to detailed historical sales records on AWS.
SQL Version

Sensor Data Analytics

Creative application of geospatial to locations of sensors in a research lab and integration of data from tables with detailed recordings on cloud storage.
Python-SQL Version

Back to Table of Contents

Open-and-connected analytics

Vehicle Routing for Delivery

Demonstrates advanced analytics to find lowest cost routes to deliver a large volume of packages by a fleet of vehicles.
Python-SQL Version

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionVideo

Banking Customer Churn

Uses a combination of Vantage Analytics Library to prepare data, using machine learning in python and importing the resulting PMML model into Vantage for scoring.
Python Version

Dataiku

Discusses how the 3rd party tool DataIku can be used with Vantage.
Information

Diabetes Prediction via BYOM H2O

This uses BYOM to import a trained H2O model to identify potential diabetes patients. BYOM allows the data scientist to create models in languages they prefer and run at scale inside Vantage without moving data.
Python-SQL Version

Energy Consumption Forecasting

This combines machine learning and BYOM to forecast energy consumption using Vantage to score the model at scale without having to export all data.
Python Version

Energy Consumption Forecasting Dataiku

Demonstration of using Dataiku with Vantage. Instructions provided for use with your Dataiku copy + screen shots if you don't have Dataiku. PMML model from Dataiku is imported to Vantage for execution and scoring.
Python Version

Financial Fraud Detection VIA BYOM

A model that was developed externally is imported into Vantage for evaluation and execution at scale to detect fraud.
Python Version

teradataml Python Basics

Introduction to Teradataml package for Python including connecting to Vantage, Teradata DataFrames, data manipulation and export to Pandas.
Python Version

Vertex AI Integration

In this tutorial, we will show how to integrate Vantage Analytics capabilities in Vertex AI ML Pipelines. Vertex AI is the environment for data scientists to deploy ML models.
Python Version

Back to Table of Contents

Path Analytics

Broken Digital Journey

This uses nPath® analysis to show the retail events that prevent the ultimate objective: a purchase. This uses interactive Sankey diagrams to understand the problems.
Python VersionVideo

Customer Behavior Analysis

Analysis of customer purchase behavior using nPath® analysis in Python with visualization using Sankey diagrams.
Python VersionSQL Version

Financial Customer Journey

Uses analytic techniques to find new customers, measure marketing attribution, and maximizing marketing effectiveness
Python-SQL VersionSQL VersionVideo

Knee Replacement Attribution

The ClearScape Analytics Attribution function is used to determine the weight of various events that precede the final outcome, in this example, knee replacement.
SQL Version

Knee Replacement nPath

This uses the ClearScape Analytics nPath® function to provide visuals on the events leading up to the final outcome, in this case, knee replacement.
Python Version

MultiTouch Attribution

Shows rule-based, Statistics, and Algorithmic attribution of the marketing touchpoints leading to conversion. Ten approaches will be demonstrated along with path analysis of effectiveness and cost of conversion.
Python-SQL Version

Train Delay Path Analysis

Uses nPath displays to show relationship of delays and predictive models to anticipate potential delays and enable proactive planning.
Python Version

Back to Table of Contents

Text Analysis

Natural Language Processing Beta

Use NLP for Sentiment Analysis, perform Kmeans clustering, execute Principal Component Analysis (PCA) using comments received by a Retail Store
Python-SQL Version

Customer 360

This integrates data about customers from multiple sources using multiple matching techniquest to create the "Golden Customer Record" and calculate LTV.
SQL Version

Text Term Frequency

Use NGram splitter to analyze comments retail products to determine patterns of words used to describe products.
Python VersionPython-SQL VersionSQL Version

VAL Descriptive Statistics

This performs in-database analysis of data values, distribution, histograms, and text field analysis using SQL to access the Vantage Analytics Library.
SQL Version

Back to Table of Contents

Time series analytics

Supply Chain Visibility

Shows the integration of data from warehouse, trucking company, and customer Order Management Systems (OMS) using temporal to reconcile different update frequencies and show rerouting when warehouse is unavailable.
Python Version

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
Python-SQL VersionSQL Version

Air Passenger Forecasting

Applies Auto Regressive Integrated Moving Average (ARIMA) analysis to forecast airplane passenger volume.
SQL Version

Carbon Footprint Analytics

A key component of ESG is Carbon Footprint. This demonstrates a part of a solution available from Teradata to integrate multiple data sources to calculate carbon footprint of various corporate activities.
Python-SQL Version

Cash Demand Forecasing

Predicts the future demand for cast in Automatic Teller Machines (ATMs) using Auto Regressive Integrated Moving Average (ARIMA) using In-Database functions.
SQL Version

Consumer Complaints

Uses government consumer complaint data with SQL queries and visualizations to identify source of top complaints.
SQL Version

Energy Consumption Forecasting

This combines machine learning and BYOM to forecast energy consumption using Vantage to score the model at scale without having to export all data.
Python Version

Fourier Transforms

Fourier transformations are demonstrated to filter out noise from signals to allow identification of underlying patterns.
SQL Version

Insurance Policy Temporal

Show As-IS/As-Was capabilities of Vantage Temporal to dramatically simplify the SQL and improve performance for analyzing insurance policies versus claims.
SQL Version

NYC Taxi Temporal

Time series data can answer the questions about what was happening at a point in time. This applies Time series and temporal capabilities of vantage to NYC taxi data.
SQL Version

Retail Demand Forecasting

This creates an ARIMA time series model based on holidays and merchandising activities impacting store sales for a hypermarket retailer.
Python Version

Vantage Query Log Analysis

Analysis of sessions and queries you executed using the built-in logging facilities of Vantage.
SQL Version

Back to Table of Contents

Unbounded Array Framework

Signal Processing and Classification Beta

Use the Unbounded Array Framework in ClearScape Analytics to classify sonar signals. Extract and engineer features to use in training and scoring our models.
Python Version

Back to Table of Contents

Prediction Analysis

Cancer Prediction using the TDAPIClient and VertexAI Beta

Use TDAPIClient to connect to the Google Cloud Vertex API Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Back to Table of Contents


3rd Party Tools

AWS SageMaker

Cancer Prediction using Teradata and the SageMaker API Beta

Use the Vantage SageMaker API feature to connect to an Amazon Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python VersionPython Version

Back to Table of Contents

Azure ML

Energy Consumption Forecasting using AzureML

This leverages the power of AzureML and Teradata Vantage to enhance our machine learning capabilities and enable scalable model scoring to forecast energy consumption.
Python Version

Back to Table of Contents

Celebrus

Digital Identity Management

Combines ClearScape Analytics with Celebrus to track unique customers across web sessions and devices to drive personalized experiences
SQL Version

Back to Table of Contents

Dataiku

Dataiku

Discusses how the 3rd party tool DataIku can be used with Vantage.
Information

Energy Consumption Forecasting Dataiku

Demonstration of using Dataiku with Vantage. Instructions provided for use with your Dataiku copy + screen shots if you don't have Dataiku. PMML model from Dataiku is imported to Vantage for execution and scoring.
Python Version

Back to Table of Contents

H2O.ai

Back to Table of Contents

R

ModelOps Introduction & List of Notebooks

This introduction and table of contents introduces you to ModelOps and provides a launch for ModelOps. It is recommended to go through ModelOps demonstrations in sequence.
InformationInformation

10 ModelOps Add R gbm Model to Project

Uses the gbm R package to generate both R model formats and operationalize through ModelOps. The gbm R packages extends Freund & Schapire's AadaBoost algorithm and Friedman's Gradient Boosting Machine (gbm).
Python Version

Back to Table of Contents

SAS

Sales Forecasting with Vantage vs SAS

This demo walks through how a typical SAS user would use sales data to build a simple sales forecasting model and then will showcase how we can achieve the same using Vantage ClearScape Analytics.
Python-SQL VersionSQL Version

Back to Table of Contents

AWS Bedrock

Product Recommendation via AWS Bedrock Beta

Use AWS Bedrock, Embedding from HuggingFace and Vantage in-DB functions to assist in providing product recommendations to develop a recipe assistant chatbot.
Python Version

Back to Table of Contents


Language

Python

Anomaly Detection of Outstanding Amounts Beta

Anomaly detection in bank accounts can help in identifying unusual patterns, potentially flagging issues like errors or fraudulent activities. Enterprise re-use of Features ensures rapid creation & deployment of models while new Features can be created and used without extensive engineering support.
Python Version

Charting and Visualizations using teradataml Beta

The td_plot method streamlines large-scale visualizations by providing users with efficient tools to create visualizations directly within the Vantage platform
Python Version

Chatbot using GenAI with insurance PDF document

Uses TD_VectorDistance in Vantage to create a chatbot using Retrieval-Augmented Generation (RAG), Langchain, and LLM to answer questions using a 24 page travelers insurance policy PDF as source data to answer questions about coverage.
Python VersionPython-SQL Version

Credit Card Data Preparation

Uses functions from TeradataML Python library to prepare data for analysis using data cleansing, exploration and feature engineering functions.
Python Version

ModelOps Introduction & List of Notebooks

This introduction and table of contents introduces you to ModelOps and provides a launch for ModelOps. It is recommended to go through ModelOps demonstrations in sequence.
Information

Mortage Calculator using GenAI

Use Retrieval-Augmented Generation (RAG), Lanchain and LLM models to as questions about loans and retrieve relevant data from Vantage.
Python Version

Natural Language Processing Beta

Use NLP for Sentiment Analysis, perform Kmeans clustering, execute Principal Component Analysis (PCA) using comments received by a Retail Store
Python-SQL Version

Product Recommentations via TDApiClient Beta

Build a product recommendation system using OpenAI embeddings and Vantage in db_function VectorDistance. We will also use Vantage as VectorDB, to store the embeddings.
Python Version

Sales Forecasting with Vantage vs SAS

This demo walks through how a typical SAS user would use sales data to build a simple sales forecasting model and then will showcase how we can achieve the same using Vantage ClearScape Analytics.
Python-SQL VersionSQL Version

Shipping Time Prediction Beta

Use ClearScape Analytics to develop a robust system that can reliably estimate delivery dates, accounting for handling time, transit time, and other relevant variables affecting the actual delivery timeframe.
Python-SQL Version

Supply Chain Visibility

Shows the integration of data from warehouse, trucking company, and customer Order Management Systems (OMS) using temporal to reconcile different update frequencies and show rerouting when warehouse is unavailable.
Python Version

Vehicle Routing for Delivery

Demonstrates advanced analytics to find lowest cost routes to deliver a large volume of packages by a fleet of vehicles.
Python-SQL Version

11 ModelOps In-Database XGBoost using Git

This notebook will cover the Operationalization of the PIMA diabetes use case with Python using the Teradata In-database XGBoost model.
Python Version

Grocery Recommendations using GenAI Beta

Use context-based product recommendations powered by GenAI Large Language Models to enhance a shopping experience.
Python-SQL Version

Hyperparameter Tuning using the Titanic Passenger Dataset Beta

Create a predictive algorithm that can identify whethr or not passengers on the Titanic survived the ship's sinking.
Python Version

Signal Processing and Classification Beta

Use the Unbounded Array Framework in ClearScape Analytics to classify sonar signals. Extract and engineer features to use in training and scoring our models.
Python Version

Cancer Prediction using the TDAPIClient and VertexAI Beta

Use TDAPIClient to connect to the Google Cloud Vertex API Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Product Recommendation via AWS Bedrock Beta

Use AWS Bedrock, Embedding from HuggingFace and Vantage in-DB functions to assist in providing product recommendations to develop a recipe assistant chatbot.
Python Version

01 ModelOps Getting Started

This introduces the ModelOps methodology, provides an overview video, and a description of navigating the projects, models, and datasets plus a description of monitoring capabilities.
Python Version

02 ModelOps Explore Diabetes Data

This is a guide through the PIMA Diabetes prediction including data exploration and model experimentation.
Python Version

03 ModelOps Operationalize PMML

Covers the ModelOps operationalizing of Bring-your-own-model (BOYM) importing a model via PMML. PMML allows exchange predictive models produced by data mining and machine learning algorithms.
Python Version

04 Modelops Operationalize ONNX

Covers the ModelOps operationalizing of the ONNX model format for BYOM for the Diabetes use case. ONNX is an efficient model format primarily related to neural networks.
Python Version

05 ModelOps Operationalize H2O

Covers the ModelOps operationalizing of the H2O model format for BYOM for the Diabetes use case. H2O is an open source, distributed in-memory machine learning library with linear scalability.
Python Version

06 ModelOps Project Setup

Shows you how to set up your own GIT repository for models and create a new project in ModelOps associated with your new repository. This step is required for the next notebooks.
Python Version

07 ModelOps Define Functions

For the project you've created in ModelOps, this shows definition of the training function, evaluate function, scoring function, life cycle, and monitoring.
Python Version

08 ModelOps Add H20 to Project

Demonstrates the use of ModelOps to finalize the H2O AI model, train, evaluate, approve, deploy, score and monitor.
Python Version

09 ModelOps Add XGBoost to Project

Uses XGBoost algorithm to generate both Python Joblib and PMML model formats and operationalize them through ModelOps.
Python Version

10 ModelOps Add R gbm Model to Project

Uses the gbm R package to generate both R model formats and operationalize through ModelOps. The gbm R packages extends Freund & Schapire's AadaBoost algorithm and Friedman's Gradient Boosting Machine (gbm).
Python Version

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
Python-SQL VersionSQL Version

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionVideo

Banking Customer Churn

Uses a combination of Vantage Analytics Library to prepare data, using machine learning in python and importing the resulting PMML model into Vantage for scoring.
Python Version

Battery Defect Analysis

Uses local data (or foreign tables on GCP) to analyze patterns of battery failure, then links to data on AWS for detailed battery measurement.
Python-SQL Version

Broken Digital Journey

This uses nPath® analysis to show the retail events that prevent the ultimate objective: a purchase. This uses interactive Sankey diagrams to understand the problems.
Python VersionVideo

Car Complaints

Uses geospatial techniques to locate service centers close to the complaint and outlier detection to detect part defects earlier than expected.
Python-SQL Version

Carbon Footprint Analytics

A key component of ESG is Carbon Footprint. This demonstrates a part of a solution available from Teradata to integrate multiple data sources to calculate carbon footprint of various corporate activities.
Python-SQL Version

Competitor Proximity Geospatial Analysis

This looks at the impact on purchasing when people that shop at our stores live within reasonable drive time to a competitor. This could identify proximity based marketing tactics to target larger promotions to those customers to increase share of their spend.
Python Version

Customer Behavior Analysis

Analysis of customer purchase behavior using nPath® analysis in Python with visualization using Sankey diagrams.
Python VersionSQL Version

Data Dictionary

This provides an index to all of the databases used by demo notebooks on this machine, allowing you to use that data for your own notebooks or BI tools.
Python Version

Data Dictionary Raw

This provides linkage to a larger set of databases and tables than are currently used by the demos in Jupyter.
Python Version

Data Loading (Python)

Shows how to use python to load CSV data from local storage and from zipped files
Python Version

Data Prep and Transformation

This demonstrates a subset of the over 100 analytic functions in the teradataml package for Python
Python VersionPython-SQL Version

Data Science 101 with Python

This walks through the Cross-Industry Standard Process for Data Mining (CRISP-DM) from data understanding through modeling and evaluation.
Python Version

Diabetes Prediction via BYOM H2O

This uses BYOM to import a trained H2O model to identify potential diabetes patients. BYOM allows the data scientist to create models in languages they prefer and run at scale inside Vantage without moving data.
Python-SQL Version

Diabetes Prediction via DF and GLM

Decision Forest and Generalized Linear Model are applied to identify factors that indicate diabetes. The earlier the disease is identified, the better the chance of reducing organ damage.
Python Version

Energy Consumption Forecasting

This combines machine learning and BYOM to forecast energy consumption using Vantage to score the model at scale without having to export all data.
Python Version

Energy Consumption Forecasting Dataiku

Demonstration of using Dataiku with Vantage. Instructions provided for use with your Dataiku copy + screen shots if you don't have Dataiku. PMML model from Dataiku is imported to Vantage for execution and scoring.
Python Version

Energy Consumption Forecasting using AzureML

This leverages the power of AzureML and Teradata Vantage to enhance our machine learning capabilities and enable scalable model scoring to forecast energy consumption.
Python Version

Financial Customer Journey

Uses analytic techniques to find new customers, measure marketing attribution, and maximizing marketing effectiveness
Python-SQL VersionSQL VersionVideo

Financial Fraud Detection InDB

Detect financial transaction fraud using powerful in-database machine learning functions
Python VersionVideo

Financial Fraud Detection VIA BYOM

A model that was developed externally is imported into Vantage for evaluation and execution at scale to detect fraud.
Python Version

Flood Proximity to Climate Risk Analysis

This allows enterprises to rapidly analyze this geographic-related information in real-time at any scale - effectively understanding the impacts of these climate events on entire populations.
Python Version

Generative AI Question Answering

Uses OpenAI to translate english language questions into SQL queries to run against a table of marketing data. This includes a link to a notebook with instructions for getting the required OpenAI API key.
Python VersionVideo

Graph Analysis of CDR Records

Uses graph analysis to identify communities and key influencers within Call Data Records. This uses Script Table Operator to invoke external procedures to work inside the Vantage database without exporting data.
Python-SQL Version

Green Manufacturing

Uses analytic and ML techniques to predict how long vehicle testing will take based on combination of features installed.
Python Version

Heart Failure Prediction

Machine learning is applied to the complex attributes of patients to help recognize patterns that may lead to heart failure faster than a human may recognize.
Python-SQL Version

How to Submit Your Demos

It is very easy to submit your demo for publication. Tell us directory with the notebook and referenced files and grant us access to your database. We'll take it from there.
Python VersionVideo

Hyper-Personalization

Hyper-personalization creates models from customer iteractions on multiple channels to determine the "Next Best Offer" for the individual.
Python-SQL Version

Intro to Panda for Python

Provides step-by-step instructions on the basics of using Python Pandas with Jupyter notebooks.
Python Version

Knee Replacement nPath

This uses the ClearScape Analytics nPath® function to provide visuals on the events leading up to the final outcome, in this case, knee replacement.
Python Version

Marketing Campaign Effectiveness

Examines the results of campaigns by various customer attributes then uses correlation, outlier elimination, and machine learning to identify the best campaigns.
Python-SQL Version

Multi-Touch Attribution for Business Analyst

This is a somewhat simplified version of the Multi-Touch Attribution demonstration focused on the interests of the Business Analyst vs the Data Scientist.
Python-SQL Version

Multi-Touch Attribution for Data Scientist

Demonstrates attribution of customer behavior via single touch and multi-touch rule-based models, and using statistical, and algorithmic models. Multiple approaches are demonstrated since each has strengths and limitations.
Python-SQL Version

MultiTouch Attribution

Shows rule-based, Statistics, and Algorithmic attribution of the marketing touchpoints leading to conversion. Ten approaches will be demonstrated along with path analysis of effectiveness and cost of conversion.
Python-SQL Version

NYC Taxi Geospatial

Applies geospatial functions to analysis of NYC Taxi pickups and drop off locations.
Python-SQL Version

Parkinson's Disease Prediction

This uses analytics to determine which biomedical voice measurements are significant in identifying people potentially with Parkinson's
Python-SQL Version

Predictive Maintenance

Uses ML functions to predict failures to identify and mitigate potential machine failures before they occur.
Python-SQL Version

Product Recommendations via Open Source

Uses FlagEmbedding from HuggingFace plus Vantage in-database functions to assess the vector distance between the product entered and similar products.
Python Version

Product Recommendations via OpenAI

Uses OpenAIEmbeddings and Vantage in-database function to assists consumers in receiving product recommendations
Python Version

Query Service REST API

Demonstration of using REST API calls to Vantage which is useful for web or mobile applications to access and maintain data.
Python Version

Remaining Useful Life Forecasting

Applies machine learning to predict Remaining Useful Life (RUL) of jet engines, allowing scheduling of maintenance and replacement before failure occurs and reduces the cost of maintenance and replacement.
Python-SQL Version

Retail Demand Forecasting

This creates an ARIMA time series model based on holidays and merchandising activities impacting store sales for a hypermarket retailer.
Python Version

Retail Item Demand Forecast

Predicts demand for retail products showing how multiple models can be run concurrently.
Python-SQL Version

Sales Forecasting using UAF

A detailed coverage of the analytic steps in sales forecasting including data preparation, exploration, seasonal normalizing, model creation, validation, and forecasting.
Python Version

Sensor Data Analytics

Creative application of geospatial to locations of sensors in a research lab and integration of data from tables with detailed recordings on cloud storage.
Python-SQL Version

Store Sales Forecast via ARIMA

Forecasts total store sales using ARIMA (AutoRegressive Integrated Moving Average)
Python Version

Store Sales Forecast via Prophet

Uses the Script Table Operator (STO) to run the H20 machine learning library as an extension to ClearScape Analytics. H20 runs in parallel inside the Vantage database without exporting data to another platform.
Python-SQL Version

Telco Customer Churn

This uses logistic regression for supervised learning to predict the probability of a customer switching vendors based on usage patterns, billing information demographics and interactions. XGBoost is then used in database to improve the prediction.
Python VersionVideo

Telco Network Coverage

Demonstrates the ability of Geospatial to show signal strength, coverage areas and travel path of customers through cell tower coverage area.
Python-SQL Version

teradataml Python Basics

Introduction to Teradataml package for Python including connecting to Vantage, Teradata DataFrames, data manipulation and export to Pandas.
Python Version

Text Term Frequency

Use NGram splitter to analyze comments retail products to determine patterns of words used to describe products.
Python VersionPython-SQL VersionSQL Version

Train Delay Path Analysis

Uses nPath displays to show relationship of delays and predictive models to anticipate potential delays and enable proactive planning.
Python Version

VAL teradataml Demo

Demonstrated the use of Teradataml in Python to perform descriptive statistics, transformation, model building model evaluation and scoring.
Python Version

Vertex AI Integration

In this tutorial, we will show how to integrate Vantage Analytics capabilities in Vertex AI ML Pipelines. Vertex AI is the environment for data scientists to deploy ML models.
Python Version

Back to Table of Contents

SQL

Chatbot using GenAI with insurance PDF document

Uses TD_VectorDistance in Vantage to create a chatbot using Retrieval-Augmented Generation (RAG), Langchain, and LLM to answer questions using a 24 page travelers insurance policy PDF as source data to answer questions about coverage.
Python VersionPython-SQL Version

Sales Forecasting with Vantage vs SAS

This demo walks through how a typical SAS user would use sales data to build a simple sales forecasting model and then will showcase how we can achieve the same using Vantage ClearScape Analytics.
Python-SQL VersionSQL Version

Shipping Time Prediction Beta

Use ClearScape Analytics to develop a robust system that can reliably estimate delivery dates, accounting for handling time, transit time, and other relevant variables affecting the actual delivery timeframe.
Python-SQL Version

Telco 5G Quality of Experience

Quality of service analysis of 5G vs 4G video for different situations such as Indoor, outdoor, and moble.
SQL Version

Grocery Recommendations using GenAI Beta

Use context-based product recommendations powered by GenAI Large Language Models to enhance a shopping experience.
Python-SQL Version

4D Analytics on bike sharing

Demonstration of Geospatial and TimeSeries using Austin bike trip data between 2014 and 2017.
Python-SQL VersionSQL Version

Air Passenger Forecasting

Applies Auto Regressive Integrated Moving Average (ARIMA) analysis to forecast airplane passenger volume.
SQL Version

Basic Jupyter Navigation

When running a Jupyter Notebook, there are various indicators that show what is happening. This is a guide to those indicators.
SQL Version

Cash Demand Forecasing

Predicts the future demand for cast in Automatic Teller Machines (ATMs) using Auto Regressive Integrated Moving Average (ARIMA) using In-Database functions.
SQL Version

Charting and Visualization

Data from queries is brought to life with graphics and charts. This shows how to use the %chart magic command to display results.
SQL Version

Consumer Complaints

Uses government consumer complaint data with SQL queries and visualizations to identify source of top complaints.
SQL Version

Credit Card Data Preparation

This shows the use of ClearScape analytics to reduce the pre-processing effort of incoming raw credit card data to prepare for analysis of potential loan defaulters.
SQL Version

Customer 360

This integrates data about customers from multiple sources using multiple matching techniquest to create the "Golden Customer Record" and calculate LTV.
SQL Version

Data Loading (SQL)

Shows multiple ways to load data from local CSV files, and cloud files on Google and AWS using the SQL kernel.
SQL Version

Deep History via Object Store

This demonstrates integration of local data or foreign tables on GCP and integration across cloud providers to detailed historical sales records on AWS.
SQL Version

Digital Identity Management

Combines ClearScape Analytics with Celebrus to track unique customers across web sessions and devices to drive personalized experiences
SQL Version

Financial Customer Journey

Uses analytic techniques to find new customers, measure marketing attribution, and maximizing marketing effectiveness
Python-SQL VersionSQL VersionVideo

Fourier Transforms

Fourier transformations are demonstrated to filter out noise from signals to allow identification of underlying patterns.
SQL Version

Insurance Policy Temporal

Show As-IS/As-Was capabilities of Vantage Temporal to dramatically simplify the SQL and improve performance for analyzing insurance policies versus claims.
SQL Version

K-Means Clustering and ML model

This uses the unsupervised K-Means ML algorithm to identify patterns in retail purchases.
SQL Version

Knee Replacement Attribution

The ClearScape Analytics Attribution function is used to determine the weight of various events that precede the final outcome, in this example, knee replacement.
SQL Version

NYC Taxi Temporal

Time series data can answer the questions about what was happening at a point in time. This applies Time series and temporal capabilities of vantage to NYC taxi data.
SQL Version

Outlier Analysis

Outliers in an analysis skew the results and make it difficult to recognize the main patterns. ClearScape Analytics has tools to remove outliers automatically.
SQL Version

SQL Basics in Jupyter

This guide will walk you through writing your first SQL queries in Jupyter. It uses some of the Vantage system tables as a source for the queries.
SQL Version

Text Term Frequency

Use NGram splitter to analyze comments retail products to determine patterns of words used to describe products.
Python VersionPython-SQL VersionSQL Version

VAL Analytics and ML

Demonstration of Vantage Analytic Library scoring and analytic functions like linear regression, decision trees, K-Means clustering, Factor Analysis, etc.
SQL Version

VAL Descriptive Statistics

This performs in-database analysis of data values, distribution, histograms, and text field analysis using SQL to access the Vantage Analytics Library.
SQL Version

VAL Hypothesis Tests

This demonstrates a subset of the 18 hypothesis test in the Vantage Analytics library using SQL such as Parametric, Binomial, Kolmogorov/Smirnoff, Rank, etc.
SQL Version

VAL Overview

Vantage Analytics Library (VAL) is a set of over 50 functions for advanced analytics. This provides an overview and links to an 8 minute video overview.
SQL Version

Vantage Query Log Analysis

Analysis of sessions and queries you executed using the built-in logging facilities of Vantage.
SQL Version

Back to Table of Contents


Solution Accelerator

Anomaly Detection

Anomaly Detection of Outstanding Amounts Beta

Anomaly detection in bank accounts can help in identifying unusual patterns, potentially flagging issues like errors or fraudulent activities. Enterprise re-use of Features ensures rapid creation & deployment of models while new Features can be created and used without extensive engineering support.
Python Version

Anomaly Detection

Evaluates potential failures in spot welds based on voltage anomalies during the welding process.
Python-SQL VersionVideo

Back to Table of Contents

Enterprise Feature Store

Back to Table of Contents


Other

Beta Review

Anomaly Detection of Outstanding Amounts Beta

Anomaly detection in bank accounts can help in identifying unusual patterns, potentially flagging issues like errors or fraudulent activities. Enterprise re-use of Features ensures rapid creation & deployment of models while new Features can be created and used without extensive engineering support.
Python VersionPython Version

Charting and Visualizations using teradataml Beta

The td_plot method streamlines large-scale visualizations by providing users with efficient tools to create visualizations directly within the Vantage platform
Python Version

Natural Language Processing Beta

Use NLP for Sentiment Analysis, perform Kmeans clustering, execute Principal Component Analysis (PCA) using comments received by a Retail Store
Python-SQL Version

Product Recommentations via TDApiClient Beta

Build a product recommendation system using OpenAI embeddings and Vantage in db_function VectorDistance. We will also use Vantage as VectorDB, to store the embeddings.
Python Version

Shipping Time Prediction Beta

Use ClearScape Analytics to develop a robust system that can reliably estimate delivery dates, accounting for handling time, transit time, and other relevant variables affecting the actual delivery timeframe.
Python-SQL Version

Getting Started With Azure Beta

Follow these steps to create your first Azure account and create an Azure Machine Learning workspace to complete our AzureML use cases.
Information

Using ClearScape Analytics with openAI Beta

To ensure optimal utilization of the OpenAI API in generative AI notebooks, it is essential to establish the API keys correctly. This concise guide outlines the process of configuring OpenAI API keys for seamless integration across multiple notebooks.
Information

Hyperparameter Tuning using the Titanic Passenger Dataset Beta

Create a predictive algorithm that can identify whethr or not passengers on the Titanic survived the ship's sinking.
Python Version

Signal Processing and Classification Beta

Use the Unbounded Array Framework in ClearScape Analytics to classify sonar signals. Extract and engineer features to use in training and scoring our models.
Python Version

Grocery Recommendations using GenAI Beta

Use context-based product recommendations powered by GenAI Large Language Models to enhance a shopping experience.
Python-SQL Version

Cancer Prediction using Teradata and the SageMaker API Beta

Use the Vantage SageMaker API feature to connect to an Amazon Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Cancer Prediction using the TDAPIClient and VertexAI Beta

Use TDAPIClient to connect to the Google Cloud Vertex API Endpoint to orchestrate Extreme Gradient XG Boost model training and score the model in SageMaker. Deploy the solution's ML model.
Python Version

Product Recommendation via AWS Bedrock Beta

Use AWS Bedrock, Embedding from HuggingFace and Vantage in-DB functions to assist in providing product recommendations to develop a recipe assistant chatbot.
Python Version

Back to Table of Contents


©2023 Teradata. All Rights Reserved